Skip to content

minhhuy24072002/Coding-Challenge-S21

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

ACM Research Coding Challenge (Spring 2021)

No Collaboration Policy

You may not collaborate with anyone on this challenge. You are allowed to use Internet documentation. If you do use existing code (either from Github, Stack Overflow, or other sources), please cite your sources in the README.

Submission Procedure

Please follow the below instructions on how to submit your answers.

  1. Create a public fork of this repo and name it ACM-Research-Coding-Challenge-S21. To fork this repo, click the button on the top right and click the "Fork" button.
  2. Clone the fork of the repo to your computer using git clone [the URL of your clone]. You may need to install Git for this (Google it).
  3. Complete the Challenge based on the instructions below.
  4. Submit your solution by filling out this form.

Question One

Genome analysis is the identification of genomic features such as gene expression or DNA sequences in an individual's genetic makeup. A genbank file (.gb) format contains information about an individual's DNA sequence. The following dataset in Genome.gb contains a complete genome sequence of Tomato Curly Stunt Virus.

With this file, create a circular genome map and output it as a JPG/PNG/JPEG format. We're not looking for any complex maps, just be sure to highlight the features and their labels.

You may use any programming language you feel most comfortable. We recommend Python because it is the easiest to implement. You're allowed to use any library you want to implement this, just document which ones you used in this README file. Try to complete this as soon as possible.

Regardless if you can or cannot answer the question, provide a short explanation of how you got your solution or how you think it can be solved in your README.md file. However, we highly recommend giving the challenge a try, you just might learn something new!

In this project a mainly use biopython. With biopython I can read genbank file with SeqIO then extract information about gene name, start, end, strand,... With GenomeDiagram I can draw linear or circular genome graph. However, I have change something to make my graph more understandable. Firstly, I set the color differently for each of gene for the graph to help viewer distinguish between different genes. Secondly, I add an inner circle to show to data of start and end of genes so that the viewer have more informations. Finally, I add EcoRI recognition, an strandless feature, to give more information to the viewer.

The libray that I use: biopython https://biopython.org/wiki/Download reportlab https://pypi.org/project/reportlab/

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%