Project 2 DNA

Project 2: DNA Analysis
Due Dates:
Checkpoint 1 1/7/14 10%
Final Due Date 1/12/14

Students will write a program that uses arrays and files to analyze DNA sequences and determine if they represent proteins.

Special thanks to Stuart Reges and Marty Stepp of UW for use of this assignment.
I. Background
Deoxyribonucleic acid (DNA) is a complex biochemical macromolecule that carries genetic information for cellular life forms and some viruses. DNA is also the mechanism through which genetic information from parents is passed on during reproduction. DNA consists of long chains of chemical compounds called nucleotides. Four nucleotides are present in DNA: Adenine (A), Cytosine (C), Guanine (G), and Thymine (T). Certain regions of the DNA are called genes. Most genes encode instructions for building proteins (they're called "protein-coding" genes). These proteins are responsible for carrying out most of the life processes of the organism. Nucleotides in a gene are organized into codons. Codons are groups of three nucleotides and are written as the first letters of their nucleotides (e.g., TAC or GGA). Each codon uniquely encodes a single amino acid, a building block of proteins.

The sequences of DNA that encode proteins occur between a start codon (which we will assume to be ATG) and a stop codon (which is any of TAA, TAG, or TGA). Not all regions of DNA are genes; large portions that do not lie between a valid start and stop codon are called intergenic DNA and have other (possibly unknown) function. Computational biologists examine large DNA data files to find patterns and important information, such as which regions are genes. Sometimes they are interested in the percentages of mass accounted for by each of the four nucleotide types. Often high percentages of Cytosine (C) and Guanine (G) are indicators of important genetic data.

In this assignment, you will write a program the reads named nucleotide sequences from an input file and performs analysis on the

Project 2 DNA

You May Also Find These Documents Helpful

Btec Level 3 Unit 25 D2

Btec Level 3 Unit 25 D2

Homework04

Homework04

Dna Sci/230

Dna Sci/230

Cell Physiology Study Guide

Cell Physiology Study Guide

GE Hw 2

GE Hw 2

Ways in which living organisms differ from each other

Ways in which living organisms differ from each other

Dna Chip

Dna Chip

Resuscitation of extinct species

Resuscitation of extinct species

The Human Genome Project (Hgp) and Bioinformatics

The Human Genome Project (Hgp) and Bioinformatics

DNA COMPUTING

DNA COMPUTING

Ib Diploma Biology Notes

Ib Diploma Biology Notes

Study Guide Exam 4

Study Guide Exam 4

Dna Computing

Dna Computing

Bioinformatics

Bioinformatics

Time Pass

Time Pass

Related Topics