USER:ESTEVEZJ SANDBOX GENOMICS

Genomics is a discipline in genetics that applies recombinant DNA, DNA sequencing methods, and bioinformatics to sequence, assemble, and analyze the function and structure of genomes (the complete set of DNA within a single cell of an organism).^[1]^[2] The field includes efforts to determine the entire DNA sequence of organisms and fine-scale genetic mapping. The field also includes studies of intragenomic phenomena such as heterosis, epistasis, pleiotropy and other interactions between loci and alleles within the genome.^[3] In contrast, the investigation of the roles and functions of single genes is a primary focus of molecular biology or genetics and is a common topic of modern medical and biological research. Research of single genes does not fall into the definition of genomics unless the aim of this genetic, pathway, and functional information analysis is to elucidate its effect on, place in, and response to the entire genome's networks.^[4]

History

Etymology

While the word 'genome' (from the German Genom, attributed to Hans Winkler) was in use in English as early as 1926, ^[5] the term 'genomics' was coined by Dr. Tom Roderick, a geneticist at the Jackson Laboratory ( Bar Harbor, ME) over beer at a meeting held in Maryland on the mapping of the human genome in 1986. ^[6]

Early sequencing efforts

Following James D. Watson and Francis Crick's discovery of the structure of DNA in 1953 and Fred Sanger's publication of the Amino acid sequence of insulin in 1955, nucleic acid sequencing became a major target of early molecular biologists.^[7] In 1964, Robert W. Holley and colleagues published the first nucleic acid sequence ever determined, the ribonucleotide sequence of alanine transfer RNA.^[8]^[9] Extending this work, Marshall Nirenberg and Philip Leder revealed the triplet nature of the genetic code and were able to determine the sequences of 54 out of 64 codons in their experiments.^[10] In 1972, Walter Fiers and his team at the Laboratory of Molecular Biology of the University of Ghent ( Ghent, Belgium) were the first to determine the sequence of a gene: the gene for Bacteriophage MS2 coat protein.^[11] Fiers' group expanded on their MS2 coat protein work, determining the complete nucleotide-sequence of bacteriophage MS2-RNA (whose genome encodes just four genes in 3569 base pairs [bp]) and Simian virus 40 in 1976 and 1978, respectively.^[12]^[13]

DNA sequencing technology developed

Frederick Sanger

Walter Gilbert

Frederick Sanger and Walter Gilbert shared half of the 1980 Nobel Prize in chemistry for independently developing methods for the sequencing of DNA.

In addition to his seminal work on the amino acid sequence of insulin, Frederick Sanger and his colleagues played a key role in the development of DNA sequencing techniques that enabled the establishment of comprehensive genome sequencing projects.^[3] In 1975, he and Alan Coulson published a sequencing procedure using DNA polymerase with radiolabelled nucleotides that he called the Plus and Minus technique.^[14]^[15] This involved two closely related methods that generated short oligonucleotides with defined 3' termini. These could be fractionated by electrophoresis on a polyacrylamide gel and visualised using autoradiography. The procedure could sequence up to 80 nucleotides in one go and was a big improvement on what gone before but was still very laborious. Nevertheless, in 1977 his group was able to sequence most of the 5,386 nucleotides of the single-stranded bacteriophage φX174, completing the first fully sequenced DNA-based genome.^[16] The refinement of the Plus and Minus method resulted in the chain-termination, or Sanger method, which formed the basis of the techniques of DNA sequencing, genome mapping, data storage, and bioinformatic analysis most widely used in the following quarter-century of research.^[17]^[18] In the same year Walter Gilbert and Allan Maxam of Harvard University independently developed the Maxam-Gilbert method (also known as the chemical method) of DNA sequencing, involving the preferential cleavage of DNA at known bases, a less efficient method.^[19]^[20] For their groundbreaking work in the sequencing of nucleic acids, Gilbert and Sanger shared half the 1980 Nobel Prize in chemistry with Paul Berg ( recombinant DNA).

Complete genomes

The advent of these technologies resulted in a rapid intensification in the scope and speed of completion of genome sequencing projects. The first complete genome sequence of an eukaryotic organelle, the human mitochondrion (16,568 bp, about 16.6 kb [kilobase]), was reported in 1981,^[21] and the first chloroplast genomes followed in 1986.^[22]^[23] In 1992, the first eukaryotic chromosome, chromosome III of brewer's yeast Saccharomyces cerevisiae (315 kb) was sequenced.^[24] The first free-living organism to be sequenced was that of Haemophilus influenzae (1.8 Mb [megabase]) in 1995.^[25] The following year a consortium of researchers from laboratories across North America, Europe, and Japan announced the completion of the first complete genome sequence of an eukaryote, S. cerevisiae (12.1 Mb) was completed the following year and since then genomes have continued being sequenced at an exponentially growing pace.^[26] As of October 2011^[update], the complete sequences are available for: 2719 viruses, 1115 archaea and bacteria, and 36 eukaryotes, of which about half are fungi.^[27] ^[28]

Most of the microorganisms whose genomes have been completely sequenced are problematic pathogens, such as Haemophilus influenzae, which has resulted in a pronounced bias in their phylogenetic distribution compared to the breadth of microbial diversity.^[29]^[30] Of the other sequenced species, most were chosen because they were well-studied model organisms or promised to become good models. Yeast ( Saccharomyces cerevisiae) has long been an important model organism for the eukaryotic cell, while the fruit fly Drosophila melanogaster has been a very important tool (notably in early pre-molecular genetics). The worm Caenorhabditis elegans is an often used simple model for multicellular organisms. The zebrafish Brachydanio rerio is used for many developmental studies on the molecular level and the flower Arabidopsis thaliana is a model organism for flowering plants. The Japanese pufferfish ( Takifugu rubripes) and the spotted green pufferfish ( Tetraodon nigroviridis) are interesting because of their small and compact genomes, containing very little non-coding DNA compared to most species.^[31]^[32] The mammals dog ( Canis familiaris), ^[33] brown rat ( Rattus norvegicus), mouse ( Mus musculus), and chimpanzee ( Pan troglodytes) are all important model animals in medical research. ^[20]

A rough draft of the human genome was completed by the Human Genome Project in early 2001, creating much fanfare.^[34] This project, completed in 2003, sequenced the entire genome for one specific person, and by 2007 this sequence was declared "finished" (less than one error in 20,000 bases and all chromosomes assembled).^[34] In the years since then, the genomes of many other individual people have been sequenced, partly under the auspices of the 1000 Genomes Project, which announced the sequencing of 1,092 genomes in October 2012.^[35] Completion of this project was made possible by the development of dramatically more efficient sequencing technologies and required the commitment of significant bioinformatics resources from large international collaboration.^[36] The continued analysis of human genomic data has profound political and social repercussions for human societies.^[37]

Next-generation sequencing

^[38]

Size comparison of selected genomes.^[39] ^[40]
Latin Name	Common Name	Genome Size
Eukaryotes
Lilium longiflorum	Easter lily	90,000,000 Kb
Homo sapiens	Human	3,200,000 Kb
Oryza sativa	Rice	420,000 Kb
Drosophila melanogaster	Fruit fly	137,000 Kb
Arabidopsis thaliana	Mustard cress	115,428 Kb
Caenorhabditis elegans	Roundworm	97,000 Kb
Saccharomyces cerevisiae	Yeast	12,069 Kb
Eubacteria
Haemophilus influenzae	Pfeiffer's bacillus	1,830 Kb
Escherichia coli	Human colon bacterium	4,639 Kb
Helicobacter pylori	Stomach ulcer bacterium	1,667 Kb
Mycobacterium tuberculosis	Tuberculosis	4,411 Kb
Yersinia pestis	Plague	4,653 Kb
Archaea
Halobacterium	Salt-tolerant archaean	2,014 Kb
Methanobacterium thermoautotrophicum	Methane-producing archaean	1,751 Kb

Genome analysis

After an organism has been selected, genome projects involve three components: the sequencing of DNA, the assembly of that sequence to create a representation of the original chromosome, and the annotation and analysis of that representation.^[3]

Sequencing

Historically, sequencing was done in sequencing centers, centralized facilities (ranging from large independent institutions such as Joint Genome Institute which sequence dozens of terabases a year, to local molecular biology core facilities) which contain research laboratories with the costly instrumentation and technical support necessary. As sequencing technology continues to improve, however, a new generation of effective fast turnaround benchtop sequencers has come within reach of the average academic laboratory.^[41]^[42] On the whole, genome sequencing approaches fall into two broad categories, shotgun and high-throughput (aka next-generation) sequencing.^[3]

Shotgun sequencing

Shotgun sequencing (Sanger sequencing is used interchangably) is a sequencing method designed for analysis of DNA sequences longer than 1000 base pairs, up to and including entire chromosomes.^[43] It is named by analogy with the rapidly-expanding, quasi-random firing pattern of a shotgun. Since the chain termination method of DNA sequencing can only be used for fairly short strands (100 to 1000 basepairs), longer DNA sequences must be broken into random small segments which are then sequenced to obtain reads. Multiple overlapping reads for the target DNA are obtained by performing several rounds of this fragmentation and sequencing. Computer programs then use the overlapping ends of different reads to assemble them into a continuous sequence.^[43]^[44]

The technology underlying shotgun sequencing is the classical chain-termination method, which is based on the selective incorporation of chain-terminating dideoxynucleotides by DNA polymerase during in vitro DNA replication.^[45]^[16] Developed by Frederick Sanger and colleagues in 1977, it was the most widely-used sequencing method for approximately 25 years. More recently, Sanger sequencing has been supplanted by "Next-Gen" sequencing methods, especially for large-scale, automated genome analyses. However, the Sanger method remains in wide use in 2012, primarily for smaller-scale projects and for obtaining especially long contiguous DNA sequence reads (>500 nucleotides).^[46] Chain-termination methods require a single-stranded DNA template, a DNA primer, a DNA polymerase, normal deoxynucleotidetriphosphates (dNTPs), and modified nucleotides (dideoxyNTPs) that terminate DNA strand elongation. These chain-terminating nucleotides lack a 3'- OH group required for the formation of a phosphodiester bond between two nucleotides, causing DNA polymerase to cease extension of DNA when a ddNTP is incorporated. The ddNTPs may be radioactively or fluorescently labelled for detection in automated sequencing machines.^[3] Typically, these automated DNA-sequencing instruments ( DNA sequencers) can sequence up to 96 DNA samples in a single batch (run) in up to 48 runs a day.^[47]

High-throughput sequencing

The high demand for low-cost sequencing has driven the development of high-throughput sequencing (or next-generation sequencing [NGS]) technologies that parallelize the sequencing process, producing thousands or millions of sequences at once.^[48]^[49] High-throughput sequencing technologies are intended to lower the cost of DNA sequencing beyond what is possible with standard dye-terminator methods. In ultra-high-throughput sequencing as many as 500,000 sequencing-by-synthesis operations may be run in parallel.^[50]^[51]

While NGS methods' high throughput are associated with the sequencing of large eukaryotic genomes, their scalability gives them applications in the sequencing of smaller prokaryotic genomes.^[47] In a December 2012 comparison of NGS technologies, researchers from the Integrated Microbial Genomes project found comparison ^[46]

454 pyrosequencing

Illumina (Solexa) sequencing

Solexa, now part of Illumina, developed a sequencing method based on reversible dye-terminators technology acquired from Manteia Predictive Medicine in 2004. This technology had been invented and developed in late 1996 at Glaxo-Welcome's Geneva Biomedical Research Institute (GBRI), by Dr. Pascal Mayer and Dr Laurent Farinelli.^[52] In this method, DNA molecules and primers are first attached on a slide and amplified with polymerase so that local clonal colonies, initially coined "DNA colonies", are formed. To determine the sequence, four types of reversible terminator bases (RT-bases) are added and non-incorporated nucleotides are washed away. Unlike pyrosequencing, the DNA chains are extended one nucleotide at a time and image acquisition can be performed at a delayed moment, allowing for very large arrays of DNA colonies to be captured by sequential images taken from a single camera.

Decoupling the enzymatic reaction and the image capture allows for optimal throughput and theoretically unlimited sequencing capacity. With an optimal configuration, the ultimately reachable instrument throughput is thus dictated solely by the analogic-to-digital conversion rate of the camera, multiplied by the number of cameras and divided by the number of pixels per DNA colony required for visualizing them optimally (approximately 10 pixels/colony). In 2012, with cameras operating at more than 10 MHz A/D conversion rates and available optics, fluidics and enzymatics, throughput can be multiples of 1 million nucleotides/second, corresponding roughly to 1 human genome equivalent at 1x coverage per hour per instrument, and 1 human genome re-sequenced (at approx. 30x) per day per instrument (equipped with a single camera). The camera takes images of the fluorescently labeled nucleotides, then the dye along with the terminal 3' blocker is chemically removed from the DNA, allowing the next cycle.^[53]

Single cell genomics

metagenomes and, ^[54]^[55]
Origins of, ^[56]
Medical applications of, ^[57]

Assembly

Overlapping reads form contigs; contigs and gaps of known length form scaffolds.

Paired end reads of next generation sequencing data mapped to a reference genome.

Multiple, fragmented sequence reads must be assembled together on the basis of their overlapping areas.

When are genomes finished?

genome standards ^[58]
Coverage?

Challenges

challenges reviewed:
- NGS and, ^[59]
- Mammalian assembly and, ^[60]
Assembler performance compared (2011) ^[61]^[62]

Algorithms

Scaffolding

De-novo vs. mapping assembly

Finishing

Annotation

The DNA sequence alone is of little value without additional analysis.^[3] Genome annotation is the process of attaching biological information to sequences, and consists of three main steps:^[63]

identifying portions of the genome that do not code for proteins
identifying elements on the genome, a process called gene prediction, and
attaching biological information to these elements.

Automatic annotation tools try to perform these steps in silico, as opposed to manual annotation (a.k.a. curation) which involves human expertise and potential experimental verification.^[64] Ideally, these approaches co-exist and complement each other in the same annotation pipeline (also see below).

Traditionally, the basic level of annotation is using BLAST for finding similarities, and then annotating genomes based on homolouges.^[3] More recently, additional information is added to the annotation platform. The additional information allows manual annotators to deconvolute discrepancies between genes that are given the same annotation. Some databases use genome context information, similarity scores, experimental data, and integrations of other resources to provide genome annotations through their Subsystems approach. Other databases (e.g. Ensembl) rely on both curated data sources as well as a range of different software tools in their automated genome annotation pipeline.^[65] Structural annotation consists of the identification of genomic elements, primarily ORFs and their localisation, or gene structure. Functional annotation consists of attaching biological information to genomic elements.

Sequencing pipelines and databases

The need for reproducibility and efficient management of large amount of data associated with genome projects mean that computational pipelines have important applications in genomics.^[66]

Research areas

Functional genomics

Functional genomics is a field of molecular biology that attempts to make use of the vast wealth of data produced by genomic projects (such as genome sequencing projects) to describe gene (and protein) functions and interactions. Functional genomics focuses on the dynamic aspects such as gene transcription, translation, and protein–protein interactions, as opposed to the static aspects of the genomic information such as DNA sequence or structures. Functional genomics attempts to answer questions about the function of DNA at the levels of genes, RNA transcripts, and protein products. A key characteristic of functional genomics studies is their genome-wide approach to these questions, generally involving high-throughput methods rather than a more traditional “gene-by-gene” approach.

A major branch of genomics is still concerned with sequencing the genomes of various organisms, but the knowledge of full genomes has created the possibility for the field of functional genomics, mainly concerned with patterns of gene expression during various conditions. The most important tools here are microarrays and bioinformatics.

Evolutionary genomics

Structural genomics

Structural genomics seeks to describe the 3-dimensional structure of every protein encoded by a given genome.^[67]^[68] This genome-based approach allows for a high-throughput method of structure determination by a combination of experimental and modeling approaches. The principal difference between structural genomics and traditional structural prediction is that structural genomics attempts to determine the structure of every protein encoded by the genome, rather than focusing on one particular protein. With full-genome sequences available, structure prediction can be done more quickly through a combination of experimental and modeling approaches, especially because the availability of large number of sequenced genomes and previously-solved protein structures allows scientists to model protein structure on the structures of previously solved homologs. Structural genomics involves taking a large number of approaches to structure determination, including experimental methods using genomic sequences or modeling-based approaches based on sequence or structural homology to a protein of known structure or based on chemical and physical principles for a protein with no homology to any known structure. As opposed to traditional structural biology, the determination of a protein structure through a structural genomics effort often (but not always) comes before anything is known regarding the protein function. This raises new challenges in structural bioinformatics, i.e. determining protein function from its 3D structure.^[69]

Epigenomics

Epigenomics is the study of the complete set of epigenetic modifications on the genetic material of a cell, known as the epigenome.^[70] Epigenetic modifications are reversible modifications on a cell’s DNA or histones that affect gene expression without altering the DNA sequence (Russell 2010 p. 475). Two of the most characterized epigenetic modifications are DNA methylation and histone modification. Epigenetic modifications play an important role in gene expression and regulation, and are involved in numerous cellular processes such as in differentiation/development and tumorigenesis ^[70]. The study of epigenetics on a global level has been made possible only recently through the adaptation of genomic high-throughput assays ^[71]

Metagenomics

Metagenomics is the study of metagenomes, genetic material recovered directly from environmental samples. The broad field may also be referred to as environmental genomics, ecogenomics or community genomics. While traditional microbiology and microbial genome sequencing rely upon cultivated clonal cultures, early environmental gene sequencing cloned specific genes (often the 16S rRNA gene) to produce a profile of diversity in a natural sample. Such work revealed that the vast majority of microbial biodiversity had been missed by cultivation-based methods.^[72] Recent studies use "shotgun" Sanger sequencing or massively parallel pyrosequencing to get largely unbiased samples of all genes from all the members of the sampled communities.^[73] Because of its power to reveal the previously hidden diversity of microscopic life, metagenomics offers a powerful lens for viewing the microbial world that has the potential to revolutionize understanding of the entire living world.^[74]^[75]

Study systems

Viruses and bacteriophages

Bacteriophages have played and continue to play a key role in bacterial genetics and molecular biology. Historically, they were used to define gene structure and gene regulation. Also the first genome to be sequenced was a bacteriophage. However, bacteriophage research did not lead the genomics revolution, which is clearly dominated by bacterial genomics. Only very recently has the study of bacteriophage genomes become prominent, thereby enabling researchers to understand the mechanisms underlying phage evolution. Bacteriophage genome sequences can be obtained through direct sequencing of isolated bacteriophages, but can also be derived as part of microbial genomes. Analysis of bacterial genomes has shown that a substantial amount of microbial DNA consists of prophage sequences and prophage-like elements.^[76] A detailed database mining of these sequences offers insights into the role of prophages in shaping the bacterial genome.^[77]^[78]

Microbes

^[79] ^[80]

Cyanobacteria

At present there are 24 cyanobacteria for which a total genome sequence is available. 15 of these cyanobacteria come from the marine environment. These are six Prochlorococcus strains, seven marine Synechococcus strains, Trichodesmium erythraeum IMS101 and Crocosphaera watsonii WH8501. Several studies have demonstrated how these sequences could be used very successfully to infer important ecological and physiological characteristics of marine cyanobacteria. However, there are many more genome projects currently in progress, amongst those there are further Prochlorococcus and marine Synechococcus isolates, Acaryochloris and Prochloron, the N₂-fixing filamentous cyanobacteria Nodularia spumigena, Lyngbya aestuarii and Lyngbya majuscula, as well as bacteriophages infecting marine cyanobaceria. Thus, the growing body of genome information can also be tapped in a more general way to address global problems by applying a comparative approach. Some new and exciting examples of progress in this field are the identification of genes for regulatory RNAs, insights into the evolutionary origin of photosynthesis, or estimation of the contribution of horizontal gene transfer to the genomes that have been analyzed.^[81]

Animals

Human genomics

Applications of genomics

Genomics has provided applications in many fields, including medicine, biotechnology, anthropology and other social sciences.^[37]

Genomic medicine

^[82]

^[83] ^[84] ^[85] ^[86]

Synthetic biology and bioengineering

^[87]

Social aspects

'Astrological' genomics

^[88]

Race and genomics

^[89]^[90]^[91]

'Omics

References

^ National Human Genome Research Institute (2010-11-08). "A Brief Guide to Genomics". Genome.gov. Retrieved 2011-12-03.
^ Concepts of genetics (10th ed.). San Francisco: Pearson Education. 2012. ISBN 9780321724120.
^ ^a ^b ^c ^d ^e ^f ^g Pevsner, Jonathan (2009). Bioinformatics and functional genomics (2nd ed.). Hoboken, N.J: Wiley-Blackwell. ISBN 9780470085851.
^ National Human Genome Research Institute (2010-11-08). "FAQ About Genetic and Genomic Science". Genome.gov. Retrieved 2011-12-03.
^ "Genome, n.". Oxford English Dictionary (Third ed.). Oxford University Press. 2008. Retrieved 2012-12-01.(subscription required)
^ Yadav, S. P. (2007). "The wholeness in suffix -omics, -omes, and the word om". Journal of Biomolecular Techniques : JBT. 18 (5): 277. PMC 2392988. PMID 18166670.
^ Ankeny, Rachel A. (June 2003). "Sequencing the genome from nematode to human: changing methods, changing science". Endeavour. 27 (2): 87–92. doi: 10.1016/S0160-9327(03)00061-9. ISSN 0160-9327. PMID 12798815. Retrieved 2012-06-18.
^ Holley RW, Everett GA, Madison JT, Zamir A. (1965 May). "Nucleotide Sequences In The Yeast Alanine Transfer Ribonucleic Acid" (PDF). J Biol Chem. 240 (5): 2122–8. doi: 10.1016/S0021-9258(18)97435-1. PMID 14299636. {{ cite journal}}: Check date values in: |date= ( help)CS1 maint: multiple names: authors list ( link)
^ Holley RW, Apgar J, Everett GA, Madison JT, Marquisee M, Merrill SH, Penswick JR, Zamir A (1965-03-19). "Structure Of A Ribonucleic Acid". Science. 147 (3664): 1462–5. Bibcode: 1965Sci...147.1462H. doi: 10.1126/science.147.3664.1462. PMID 14263761. S2CID 40989800.{{ cite journal}}: CS1 maint: multiple names: authors list ( link)
^ Nirenberg M, Leder P, Bernfield M, Brimacombe R, Trupin J, Rottman F, O'Neal C (May 1965). "RNA codewords and protein synthesis, VII. On the general nature of the RNA code". Proc. Natl. Acad. Sci. U.S.A. 53 (5): 1161–8. Bibcode: 1965PNAS...53.1161N. doi: 10.1073/pnas.53.5.1161. PMC 301388. PMID 5330357.{{ cite journal}}: CS1 maint: date and year ( link) CS1 maint: multiple names: authors list ( link)
^ Min Jou W, Haegeman G, Ysebaert M, Fiers W (1972). "Nucleotide sequence of the gene coding for the bacteriophage MS2 coat protein". Nature. 237 (5350): 82–88. Bibcode: 1972Natur.237...82J. doi: 10.1038/237082a0. PMID 4555447. S2CID 4153893.{{ cite journal}}: CS1 maint: multiple names: authors list ( link)
^ Fiers W, Contreras R, Duerinck F, Haegeman G, Iserentant D, Merregaert J, Min Jou W, Molemans F, Raeymaekers A, Van den Berghe A, Volckaert G, Ysebaert M (1976). "Complete nucleotide sequence of bacteriophage MS2 RNA: primary and secondary structure of the replicase gene". Nature. 260 (5551): 500–507. Bibcode: 1976Natur.260..500F. doi: 10.1038/260500a0. PMID 1264203. S2CID 4289674.{{ cite journal}}: CS1 maint: multiple names: authors list ( link)
^ Fiers, W.; Contreras, R.; Haegeman, G.; Rogiers, R.; Van De Voorde, A.; Van Heuverswyn, H.; Van Herreweghe, J.; Volckaert, G.; Ysebaert, M. (1978-05-11). "Complete nucleotide sequence of SV40 DNA". Nature. 273 (5658): 113–120. Bibcode: 1978Natur.273..113F. doi: 10.1038/273113a0. ISSN 0028-0836. PMID 205802. S2CID 1634424. Retrieved 2012-12-20. {{ cite journal}}: Text "113-120" ignored ( help)
^ Tamarin, Robert H (2004). Principles of genetics (7 ed.). London: McGraw Hill. ISBN 0071243208-9780071243209. {{ cite book}}: Check |isbn= value: length ( help)
^ Sanger, F. (1980), Nobel lecture: Determination of nucleotide sequences in DNA (PDF), Nobelprize.org, retrieved 2010-10-18
^ ^a ^b Sanger F, Air GM, Barrell BG, Brown NL, Coulson AR, Fiddes CA, Hutchison CA, Slocombe PM, Smith M (1977). "Nucleotide sequence of bacteriophage phi X174 DNA". Nature. 265 (5596): 687–695. Bibcode: 1977Natur.265..687S. doi: 10.1038/265687a0. PMID 870828. S2CID 4206886.{{ cite journal}}: CS1 maint: multiple names: authors list ( link)
^ Kaiser, Olaf; Bartels, Daniela; Bekel, Thomas; Goesmann, Alexander; Kespohl, Sebastian; Pühler, Alfred; Meyer, Folker (December 2003). "Whole genome shotgun sequencing guided by bioinformatics pipelines—an optimized approach for an established technique" (PDF). Journal of Biotechnology. 106 (2–3): 121–133. doi: 10.1016/j.jbiotec.2003.08.008. ISSN 0168-1656. PMID 14651855. Retrieved 2012-12-20.
^ Sanger, F.; Nicklen, S.; Coulson, A. R. (1977). "DNA sequencing with chain-terminating inhibitors". Proceedings of the National Academy of Sciences of the United States of America. 74 (12): 5463–5467. Bibcode: 1977PNAS...74.5463S. doi: 10.1073/pnas.74.12.5463. PMC 431765. PMID 271968.
^ Maxam, A. M.; Gilbert, W. (February 1977). "A new method for sequencing DNA". Proceedings of the National Academy of Sciences of the United States of America. 74 (2): 560–564. Bibcode: 1977PNAS...74..560M. doi: 10.1073/pnas.74.2.560. ISSN 0027-8424. PMC 392330. PMID 265521.
^ ^a ^b Darden, Lindley (2010). "Molecular Biology". In Edward N. Zalta (ed.) (ed.). The Stanford Encyclopedia of Philosophy (Fall 2010 ed.). Retrieved 2012-12-20. {{ cite book}}: |editor= has generic name ( help); Unknown parameter |coauthors= ignored (|author= suggested) ( help)
^ Anderson, S.; Bankier, A. T.; Barrell, B. G.; De Bruijn, M. H. L.; Coulson, A. R.; Drouin, J.; Eperon, I. C.; Nierlich, D. P.; Roe, B. A.; Sanger, F.; Schreier, P. H.; Smith, A. J. H.; Staden, R.; Young, I. G. (1981). "Sequence and organization of the human mitochondrial genome". Nature. 290 (5806): 457–465. Bibcode: 1981Natur.290..457A. doi: 10.1038/290457a0. PMID 7219534. S2CID 4355527.(subscription required)
^ Shinozaki, K.; Ohme, M.; Tanaka, M.; Wakasugi, T.; Hayashida, N.; Matsubayashi, T.; Zaita, N.; Chunwongse, J.; Obokata, J.; Yamaguchi-Shinozaki, K.; Ohto, C.; Torazawa, K.; Meng, B. Y.; Sugita, M.; Deno, H.; Kamogashira, T.; Yamada, K.; Kusuda, J.; Takaiwa, F.; Kato, A.; Tohdoh, N.; Shimada, H.; Sugiura, M. (1986). "The complete nucleotide sequence of the tobacco chloroplast genome: Its gene organization and expression". The EMBO Journal. 5 (9): 2043–2049. doi: 10.1002/j.1460-2075.1986.tb04464.x. PMC 1167080. PMID 16453699.
^ Ohyama, K.; Fukuzawa, H.; Kohchi, T.; Shirai, H.; Sano, T.; Sano, S.; Umesono, K.; Shiki, Y.; Takeuchi, M.; Chang, Z.; Aota, S. I.; Inokuchi, H.; Ozeki, H. (1986). "Chloroplast gene organization deduced from complete sequence of liverwort Marchantia polymorpha chloroplast DNA". Nature. 322 (6079): 572. Bibcode: 1986Natur.322..572O. doi: 10.1038/322572a0. S2CID 4311952.
^ Oliver, S. G.; Van Der Aart, Q. J. M.; Agostoni-Carbone, M. L.; Aigle, M.; Alberghina, L.; Alexandraki, D.; Antoine, G.; Anwar, R.; Ballesta, J. P. G.; Benit, P.; Berben, G.; Bergantino, E.; Biteau, N.; Bolle, P. A.; Bolotin-Fukuhara, M.; Brown, A.; Brown, A. J. P.; Buhler, J. M.; Carcano, C.; Carignani, G.; Cederberg, H.; Chanet, R.; Contreras, R.; Crouzet, M.; Daignan-Fornier, B.; Defoor, E.; Delgado, M.; Demolder, J.; Doira, C.; et al. (1992). "The complete DNA sequence of yeast chromosome III". Nature. 357 (6373): 38–46. Bibcode: 1992Natur.357...38O. doi: 10.1038/357038a0. PMID 1574125. S2CID 4271784.
^ Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR, Bult CJ, Tomb JF, Dougherty BA, Merrick JM; et al. (1995). "Whole-genome random sequencing and assembly of Haemophilus influenzae Rd". Science. 269 (5223): 496–512. Bibcode: 1995Sci...269..496F. doi: 10.1126/science.7542800. PMID 7542800. {{ cite journal}}: Explicit use of et al. in: |author= ( help)CS1 maint: multiple names: authors list ( link)
^ Goffeau, A.; Barrell, B. G.; Bussey, H.; Davis, R. W.; Dujon, B.; Feldmann, H.; Galibert, F.; Hoheisel, J. D.; Jacq, C.; Johnston, M.; Louis, E. J.; Mewes, H. W.; Murakami, Y.; Philippsen, P.; Tettelin, H.; Oliver, S. G. (Oct 1996). "Life with 6000 genes". Science. 274 (5287): 546, 563–7. Bibcode: 1996Sci...274..546G. doi: 10.1126/science.274.5287.546. ISSN 0036-8075. PMID 8849441. S2CID 16763139.(subscription required)
^ "Complete genomes: Viruses". NCBI. 2011-11-17. Retrieved 2011-11-18.
^ "Genome Project Statistics". Entrez Genome Project. 2011-10-07. Retrieved 2011-11-18.
^ Zimmer, Carl (2009-12-29). "Scientists Start a Genomic Catalog of Earth's Abundant Microbes". The New York Times. ISSN 0362-4331. Retrieved 2012-12-21.}
^ Wu, D.; Hugenholtz, P.; Mavromatis, K.; Pukall, R. D.; Dalin, E.; Ivanova, N. N.; Kunin, V.; Goodwin, L.; Wu, M.; Tindall, B. J.; Hooper, S. D.; Pati, A.; Lykidis, A.; Spring, S.; Anderson, I. J.; d'Haeseleer, P.; Zemla, A.; Singer, M.; Lapidus, A.; Nolan, M.; Copeland, A.; Han, C.; Chen, F.; Cheng, J. F.; Lucas, S.; Kerfeld, C.; Lang, E.; Gronow, S.; Chain, P.; Bruce, D. (2009). "A phylogeny-driven genomic encyclopaedia of Bacteria and Archaea". Nature. 462 (7276): 1056–1060. Bibcode: 2009Natur.462.1056W. doi: 10.1038/nature08656. PMC 3073058. PMID 20033048.
^ "Human gene number slashed". BBC. 2004-10-20. Retrieved 2012-12-21.
^ Yue, G. H.; Lo, L. C.; Zhu, Z. Y.; Lin, G.; Feng, F. (2006). "The complete nucleotide sequence of the mitochondrial genome of Tetraodon nigroviridis". DNA Sequence : The Journal of DNA Sequencing and Mapping. 17 (2): 115–121. doi: 10.1080/10425170600700378. PMID 17076253. S2CID 21797344.
^ National Human Genome Research Institute (2004-07-14). "Dog Genome Assembled: Canine Genome Now Available to Research Community Worldwide". Genome.gov. Retrieved 2012-01-20.
^ ^a ^b McElheny, Victor (2010). Drawing the map of life : inside the Human Genome Project. New York NY: Basic Books. ISBN 9780465043330.
^ McVean, G. A.; Abecasis, D. M.; Auton, R. M.; Brooks, G. A. R.; Depristo, D. R.; Durbin, A.; Handsaker, A. G.; Kang, P.; Marth, E. E.; McVean, P.; Gabriel, S. B.; Gibbs, R. A.; Green, E. D.; Hurles, M. E.; Knoppers, B. M.; Korbel, J. O.; Lander, E. S.; Lee, C.; Lehrach, H.; Mardis, E. R.; Marth, G. T.; McVean, G. A.; Nickerson, D. A.; Schmidt, J. P.; Sherry, S. T.; Wang, J.; Wilson, R. K.; Gibbs (Principal Investigator), R. A.; Dinh, H.; et al. (2012). "An integrated map of genetic variation from 1,092 human genomes". Nature. 491 (7422): 56–65. Bibcode: 2012Natur.491...56T. doi: 10.1038/nature11632. PMC 3498066. PMID 23128226.
^ Nielsen, R. (2010). "Genomics: In search of rare human variants". Nature. 467 (7319): 1050–1051. Bibcode: 2010Natur.467.1050N. doi: 10.1038/4671050a. PMID 20981085. S2CID 5889828.
^ ^a ^b Barnes, Barry (2008). Genomes and what to make of them. Chicago: University of Chicago Press. ISBN 9780226172958-0226172953. {{ cite book}}: Check |isbn= value: length ( help); Unknown parameter |coauthors= ignored (|author= suggested) ( help) Cite error: The named reference "barnes2008" was defined multiple times with different content (see the help page).
^ Pontin, Jason (February 2011). "A Decade of Genomics". Technology Review. 114 (1): 8. ISSN 1099-274X.
^ Culver, Kenneth W. (2002-11-08). "Genomics". In Richard Robinson (ed.) (ed.). Genetics. Macmillan Science Library. Macmillan Reference USA. ISBN 0028656067. {{ cite encyclopedia}}: |editor= has generic name ( help); Unknown parameter |coauthors= ignored (|author= suggested) ( help)
^ Pierce, Benjamin A. (2008-12-05). Genetics. Macmillan. ISBN 9781429233248.
^ ^a ^b Monya Baker (2012-09-14). "Benchtop sequencers ship off" (Blog). Nature News Blog. Retrieved 2012-12-22.
^ Quail, M.; Smith, M. E.; Coupland, P.; Otto, T. D.; Harris, S. R.; Connor, T. R.; Bertoni, A.; Swerdlow, H. P.; Gu, Y. (2012). "A tale of three next generation sequencing platforms: Comparison of Ion torrent, pacific biosciences and illumina MiSeq sequencers". BMC Genomics. 13: 341. doi: 10.1186/1471-2164-13-341. PMC 3431227. PMID 22827831.{{ cite journal}}: CS1 maint: unflagged free DOI ( link)
^ ^a ^b Staden, R (1979 Jun 11). "A strategy of DNA sequencing employing computer programs". Nucleic Acids Research. 6 (7): 2601–10. doi: 10.1093/nar/6.7.2601. PMC 327874. PMID 461197. {{ cite journal}}: Check date values in: |date= ( help)
^ Anderson, S. (1981). "Shotgun DNA sequencing using cloned DNase I-generated fragments". Nucleic Acids Research. 9 (13): 3015–3027. doi: 10.1093/nar/9.13.3015. PMC 327328. PMID 6269069.
^ Sanger F, Coulson AR (May 1975). "A rapid method for determining sequences in DNA by primed synthesis with DNA polymerase". J. Mol. Biol. 94 (3): 441–8. doi: 10.1016/0022-2836(75)90213-2. PMID 1100841.{{ cite journal}}: CS1 maint: date and year ( link)
^ ^a ^b Mavromatis, K.; Land, M. L.; Brettin, T. S.; Quest, D. J.; Copeland, A.; Clum, A.; Goodwin, L.; Woyke, T.; Lapidus, A.; Klenk, H. P.; Cottingham, R. W.; Kyrpides, N. C. (2012). Liu, Zhanjiang (ed.). "The Fast Changing Landscape of Sequencing Technologies and Their Impact on Microbial Genome Assemblies and Annotation". PLOS ONE. 7 (12): e48837. Bibcode: 2012PLoSO...748837M. doi: 10.1371/journal.pone.0048837. PMC 3520994. PMID 23251337.
^ ^a ^b Illumina, Inc. (2012-02-28). An Introduction to Next-Generation Sequencing Technology (PDF). San Diego, California, USA: Illumina, Inc. p. 12. Retrieved 2012-12-28.
^ Hall N (May 2007). "Advanced sequencing technologies and their wider impact in microbiology". J. Exp. Biol. 210 (Pt 9): 1518–25. doi: 10.1242/jeb.001370. PMID 17449817. S2CID 25688677.{{ cite journal}}: CS1 maint: date and year ( link)
^ Church GM (January 2006). "Genomes for all". Sci. Am. 294 (1): 46–54. Bibcode: 2006SciAm.294a..46C. doi: 10.1038/scientificamerican0106-46. PMID 16468433.{{ cite journal}}: CS1 maint: date and year ( link)
^ Ten Bosch, J. R.; Grody, W. W. (2008). "Keeping Up with the Next Generation". The Journal of Molecular Diagnostics. 10 (6): 484–492. doi: 10.2353/jmoldx.2008.080027. PMC 2570630. PMID 18832462.
^ Tucker, T.; Marra, M.; Friedman, J. M. (2009). "Massively Parallel Sequencing: The Next Big Thing in Genetic Medicine". The American Journal of Human Genetics. 85 (2): 142–154. doi: 10.1016/j.ajhg.2009.06.022. PMC 2725244. PMID 19679224.
^ Kawashima, Eric H. (2005-05-12), Method of nucleic acid amplification, retrieved 2012-12-22 {{ citation}}: Unknown parameter |coauthors= ignored (|author= suggested) ( help)
^ Mardis ER (2008). "Next-generation DNA sequencing methods" (PDF). Annu Rev Genomics Hum Genet. 9: 387–402. doi: 10.1146/annurev.genom.9.081307.164359. PMID 18576944.
^ Gilbert, David (2009-04-29). "DNA of Uncultured Organisms Sequenced Using Novel Single-Cell Approach" (Press Release). DOE Joint Genome Institute. Retrieved 2012-12-23.
^ Woyke, T.; Xie, G.; Copeland, A.; González, J. M.; Han, C.; Kiss, H.; Saw, J. H.; Senin, P.; Yang, C.; Chatterji, S.; Cheng, J. F.; Eisen, J. A.; Sieracki, M. E.; Stepanauskas, R. (2009). Ahmed, Niyaz (ed.). "Assembling the Marine Metagenome, One Cell at a Time". PLOS ONE. 4 (4): e5299. Bibcode: 2009PLoSO...4.5299W. doi: 10.1371/journal.pone.0005299. PMC 2668756. PMID 19390573.
^ Zhang, L.; Cui, X.; Schmitt, K.; Hubert, R.; Navidi, W.; Arnheim, N. (1992-07-01). "Whole genome amplification from a single cell: implications for genetic analysis". Proceedings of the National Academy of Sciences of the United States of America. 89 (13): 5847–5851. Bibcode: 1992PNAS...89.5847Z. doi: 10.1073/pnas.89.13.5847. ISSN 0027-8424. PMC 49394. PMID 1631067.
^ Stein, Richard A. (2009-06-01). "Single-Cell Genomics Clarifies Big Picture". Genetic Engineering & Biotechnology News. Vol. 29, no. 11. Retrieved 2012-12-23.
^ Chain, P. S. G.; Grafham, D. V.; Fulton, R. S.; Fitzgerald, M. G.; Hostetler, J.; Muzny, D.; Ali, J.; Birren, B.; Bruce, D. C.; Buhay, C.; Cole, J. R.; Ding, Y.; Dugan, S.; Field, D.; Garrity, G. M.; Gibbs, R.; Graves, T.; Han, C. S.; Harrison, S. H.; Highlander, S.; Hugenholtz, P.; Khouri, H. M.; Kodira, C. D.; Kolker, E.; Kyrpides, N. C.; Lang, D.; Lapidus, A.; Malfatti, S. A.; Markowitz, V.; Metha, T. (2009). "Genome Project Standards in a New Era of Sequencing". Science. 326 (5950): 236–237. Bibcode: 2009Sci...326..236C. doi: 10.1126/science.1180614. PMC 3854948. PMID 19815760.
^ Pop, M.; Salzberg, S. L. (March 2008). "Bioinformatics challenges of new sequencing technology". Trends in Genetics : TIG. 24 (3): 142–149. doi: 10.1016/j.tig.2007.12.007. ISSN 0168-9525. PMC 2680276. PMID 18262676.
^ Batzoglou, Serafim (2005-01-15). "Algorithmic challenges in mammalian whole-genome assembly" (PDF). In Lynn B. Jorde, Shankar Subramaniam, Michael J. Dunn (eds.) (ed.). Encyclopedia of Genetics, Genomics, Proteomics and Bioinformatics. Chichester, UK: John Wiley & Sons, Ltd. ISBN 0470849746, 9780470849743, 047001153X, 9780470011539. Retrieved 2012-12-23. {{ cite book}}: |editor= has generic name ( help); Check |isbn= value: invalid character ( help)CS1 maint: multiple names: editors list ( link)
^ Narzisi, G.; Mishra, B. (2011). Aerts, Stein (ed.). "Comparing De Novo Genome Assembly: The Long and Short of It". PLOS ONE. 6 (4): e19175. Bibcode: 2011PLoSO...619175N. doi: 10.1371/journal.pone.0019175. PMC 3084767. PMID 21559467.
^ Zhang, Wenyu; Chen, Jiajia; Yang, Yang; Tang, Yifei; Shang, Jing; Shen, Bairong (2011-03-14). "A Practical Comparison of De Novo Genome Assembly Software Tools for Next-Generation Sequencing Technologies". PLOS ONE. 6 (3): e17915. Bibcode: 2011PLoSO...617915Z. doi: 10.1371/journal.pone.0017915. PMC 3056720. PMID 21423806. S2CID 4648109.
^ Stein, L. (2001). "Genome Annotation: From Sequence to Biology". Nature Reviews Genetics. 2 (7): 493–503. doi: 10.1038/35080529. PMID 11433356. S2CID 12044602.
^ Brent, Michael R (January 2008). "Steady progress and recent breakthroughs in the accuracy of automated genome annotation" (PDF). Nature Reviews. Genetics. 9 (1): 62–73. doi: 10.1038/nrg2220. ISSN 1471-0064. PMID 18087260. S2CID 20412451.
^ Flicek, P.; Ahmed, I.; Amode, M. R.; Barrell, D.; Beal, K.; Brent, S.; Carvalho-Silva, D.; Clapham, P.; Coates, G.; Fairley, S.; Fitzgerald, S.; Gil, L.; Garcia-Giron, C.; Gordon, L.; Hourlier, T.; Hunt, S.; Juettemann, T.; Kahari, A. K.; Keenan, S.; Komorowska, M.; Kulesha, E.; Longden, I.; Maurel, T.; McLaren, W. M.; Muffato, M.; Nag, R.; Overduin, B.; Pignatelli, M.; Pritchard, B.; Pritchard, E. (2012). "Ensembl 2013". Nucleic Acids Research. 41 (D1): D48–D55. doi: 10.1093/nar/gks1236. PMC 3531136. PMID 23203987.
^ Keith, J. M. (2008). Keith, Jonathan M (ed.). Bioinformatics. Methods in Molecular Biology. Vol. 453. pp. v–vi. doi: 10.1007/978-1-60327-429-6. ISBN 978-1-60327-428-9. PMID 18720577.
^ Marsden, R. L.; Lewis, T. A.; Orengo, C. A. (2007). "Towards a comprehensive structural coverage of completed genomes: A structural genomics viewpoint". BMC Bioinformatics. 8: 86. doi: 10.1186/1471-2105-8-86. PMC 1829165. PMID 17349043.{{ cite journal}}: CS1 maint: unflagged free DOI ( link)
^ Brenner, S. E.; Levitt, I. (2008). "Expectations from structural genomics". Protein Science. 9 (1): 197–200. doi: 10.1110/ps.9.1.197. PMC 2144435. PMID 10739263.
^ Brenner, S. E. (2001). "A tour of structural genomics" (PDF). Nature Reviews Genetics. 2 (10): 801–809. doi: 10.1038/35093574. PMID 11584296. S2CID 5656447. Retrieved 2012-12-07.
^ ^a ^b Francis, Richard C (2011). Epigenetics : the ultimate mystery of inheritance. New York: W.W. Norton. ISBN 9780393070057.
^ Laird, P. W. (2010). "Principles and challenges of genome-wide DNA methylation analysis". Nature Reviews Genetics. 11 (3): 191–203. doi: 10.1038/nrg2732. PMID 20125086. S2CID 6780101.
^ Hugenholtz, Philip; Goebel, Brett M.; Pace, Norman R. (1 September 1998). "Impact of Culture-Independent Studies on the Emerging Phylogenetic View of Bacterial Diversity". J. Bacteriol. 180 (18): 4765–74. doi: 10.1128/JB.180.18.4765-4774.1998. PMC 107498. PMID 9733676.
^ Eisen, JA (2007). "Environmental Shotgun Sequencing: Its Potential and Challenges for Studying the Hidden World of Microbes". PLOS Biology. 5 (3): e82. doi: 10.1371/journal.pbio.0050082. PMC 1821061. PMID 17355177.{{ cite journal}}: CS1 maint: unflagged free DOI ( link)
^ Marco, D, ed. (2010). Metagenomics: Theory, Methods and Applications. Caister Academic Press. ISBN 978-1-904455-54-7.
^ Marco, D, ed. (2011). Metagenomics: Current Innovations and Future Trends. Caister Academic Press. ISBN 978-1-904455-87-5.
^ Canchaya, Carlos; Proux, Caroline; Fournous, Ghislain; Bruttin, Anne; BrüSsow, Harald (June 2003). "Prophage Genomics". Microbiology and Molecular Biology Reviews. 67 (2): 238–276. doi: 10.1128/MMBR.67.2.238-276.2003. ISSN 1092-2172. PMC 156470. PMID 12794192.
^ McGrath S and van Sinderen D, ed. (2007). Bacteriophage: Genetics and Molecular Biology (1st ed.). Caister Academic Press. ISBN 978-1-904455-14-1.
^ Fouts, Derrick E. (November 2006). "Phage_Finder: Automated identification and classification of prophage regions in complete bacterial genome sequences". Nucleic Acids Research. 34 (20): 5839–5851. doi: 10.1093/nar/gkl732. ISSN 0305-1048. PMC 1635311. PMID 17062630. S2CID 13318164.
^ Office of Science (2005-08). Genomics:GTL Roadmap (PDF). US Department of Energy. {{ cite conference}}: Check date values in: |date= ( help)
^ Microbiology in the 21st Century: Where Are We and Where Are We Going?. American Academy of Microbiology. 2004.
^ Herrero A and Flores E, ed. (2008). The Cyanobacteria: Molecular Biology, Genomics and Evolution (1st ed.). Caister Academic Press. ISBN 978-1-904455-15-8.
^ Guttmacher, Alan E.; Collins, Francis S. (2002-11-07). "Genomic medicine--a primer". The New England Journal of Medicine. 347 (19): 1512–20. doi: 10.1056/NEJMra012240. ISSN 0028-4793. PMID 12421895. ProQuest 220135424. Retrieved 2012-12-07.
^ Phimister, Elizabeth G.; Feero, W. Gregory; Guttmacher, Alan E. (2012-02-23). "Realizing Genomic Medicine". The New England Journal of Medicine. 366 (8): 757–9. doi: 10.1056/NEJMe1200749. ISSN 0028-4793. PMID 22356329. ProQuest 923268241. Retrieved 2012-12-07.
^ Bodurtha, J.; Strauss Jf, 3rd (2012-01-05). "Genomic Medicine: Genomics and Perinatal Care". The New England Journal of Medicine. 366 (1): 64–73. doi: 10.1056/NEJMra1105043. ISSN 0028-4793. PMC 4877696. PMID 22216843. ProQuest 914354372.{{ cite journal}}: CS1 maint: numeric names: authors list ( link)
^ Hudson, K. L. (2011-09-15). "Genomic Medicine: Genomics, Health Care, and Society". The New England Journal of Medicine. 365 (11): 1033–1041. doi: 10.1056/NEJMra1010517. ISSN 0028-4793. PMID 21916641. ProQuest 890081959. Retrieved 2012-12-07.
^ O'Donnell, C. J.; Nabel, E. G. (2011-12-01). "Genomic Medicine: Genomics of Cardiovascular Disease". The New England Journal of Medicine. 365 (22): 2098–109. doi: 10.1056/NEJMra1105239. ISSN 0028-4793. PMID 22129254. ProQuest 907232641. Retrieved 2012-12-07.
^ Church, George M (2012). Regenesis : how synthetic biology will reinvent nature and ourselves. New York: Basic Books. ISBN 9780465021758-0465021751. {{ cite book}}: Check |isbn= value: length ( help); Unknown parameter |coauthors= ignored (|author= suggested) ( help)
^ Vitti, Joseph J.; Cho, Mildred K.; Tishkoff, Sarah A.; Sabeti, Pardis C. (March 2012). "Human evolutionary genomics: ethical and interpretive issues". Trends in Genetics. 28 (3): 137–145. doi: 10.1016/j.tig.2011.12.001. ISSN 0168-9525. PMC 5210170. PMID 22265990.
^ Wade, PeterLewis (October 2005). "Genomics and Race". Anthropology Today. 21 (5): 20–21. doi: 10.1111/j.0268-540X.2005.00382.x. ISSN 0268-540X.
^ Holden, Constance (2008-11-07). "The Touchy Subject of 'Race'". Science. 322 (5903): 839. doi: 10.1126/science.322.5903.839a. ISSN 1095-9203 0036-8075, 1095-9203. PMID 18988817. S2CID 32192135. Retrieved 2012-12-07. {{ cite journal}}: Check |issn= value ( help)
^ Cooper, Richard S.; Kaufman, Jay S.; Ward, Ryk (2003-03-20). "Race and genomics". The New England Journal of Medicine. 348 (12): 1166–70. doi: 10.1056/NEJMsb022863. ISSN 0028-4793. PMID 12646675. ProQuest 220163870. Retrieved 2012-12-07.

External links

Annual Review of Genomics and Human Genetics
BMC Genomics: A BMC journal on Genomics
Genomics journal
NHGRI: US government's genome institute
JCVI Comprehensive Microbial Resource
KoreaGenome.org: The first Korean Genome published and the sequence is available freely.
GenomicsNetwork: Looks at the development and use of the science and technologies of genomics.
Institute for Genome Sciences: Genomics research.
MIT OpenCourseWare HST.512 Genomic Medicine A free, self-study course in genomic medicine. Resources include audio lectures and selected lecture notes.

[gov2010-1] National Human Genome Research Institute (2010-11-08). "A Brief Guide to Genomics". Genome.gov. Retrieved 2011-12-03.

[klug2012-2] Concepts of genetics (10th ed.). San Francisco: Pearson Education. 2012. ISBN 9780321724120.

[pevsner2009-3] ^ ^a ^b ^c ^d ^e ^f ^g Pevsner, Jonathan (2009). Bioinformatics and functional genomics (2nd ed.). Hoboken, N.J: Wiley-Blackwell. ISBN 9780470085851.

[gov2010a-4] National Human Genome Research Institute (2010-11-08). "FAQ About Genetic and Genomic Science". Genome.gov. Retrieved 2011-12-03.

[oxford-5] "Genome, n.". Oxford English Dictionary (Third ed.). Oxford University Press. 2008. Retrieved 2012-12-01.(subscription required)

[yadav2007-6] Yadav, S. P. (2007). "The wholeness in suffix -omics, -omes, and the word om". Journal of Biomolecular Techniques : JBT. 18 (5): 277. PMC 2392988. PMID 18166670.

[ankeny2003-7] Ankeny, Rachel A. (June 2003). "Sequencing the genome from nematode to human: changing methods, changing science". Endeavour. 27 (2): 87–92. doi: 10.1016/S0160-9327(03)00061-9. ISSN 0160-9327. PMID 12798815. Retrieved 2012-06-18.

[holley1965a-8] Holley RW, Everett GA, Madison JT, Zamir A. (1965 May). "Nucleotide Sequences In The Yeast Alanine Transfer Ribonucleic Acid" (PDF). J Biol Chem. 240 (5): 2122–8. doi: 10.1016/S0021-9258(18)97435-1. PMID 14299636. {{ cite journal}}: Check date values in: |date= ( help)CS1 maint: multiple names: authors list ( link)

[holley1965b-9] Holley RW, Apgar J, Everett GA, Madison JT, Marquisee M, Merrill SH, Penswick JR, Zamir A (1965-03-19). "Structure Of A Ribonucleic Acid". Science. 147 (3664): 1462–5. Bibcode: 1965Sci...147.1462H. doi: 10.1126/science.147.3664.1462. PMID 14263761. S2CID 40989800.{{ cite journal}}: CS1 maint: multiple names: authors list ( link)

[pmid5330357-10] Nirenberg M, Leder P, Bernfield M, Brimacombe R, Trupin J, Rottman F, O'Neal C (May 1965). "RNA codewords and protein synthesis, VII. On the general nature of the RNA code". Proc. Natl. Acad. Sci. U.S.A. 53 (5): 1161–8. Bibcode: 1965PNAS...53.1161N. doi: 10.1073/pnas.53.5.1161. PMC 301388. PMID 5330357.{{ cite journal}}: CS1 maint: date and year ( link) CS1 maint: multiple names: authors list ( link)

[min1972-11] Min Jou W, Haegeman G, Ysebaert M, Fiers W (1972). "Nucleotide sequence of the gene coding for the bacteriophage MS2 coat protein". Nature. 237 (5350): 82–88. Bibcode: 1972Natur.237...82J. doi: 10.1038/237082a0. PMID 4555447. S2CID 4153893.{{ cite journal}}: CS1 maint: multiple names: authors list ( link)

[fiers1976-12] Fiers W, Contreras R, Duerinck F, Haegeman G, Iserentant D, Merregaert J, Min Jou W, Molemans F, Raeymaekers A, Van den Berghe A, Volckaert G, Ysebaert M (1976). "Complete nucleotide sequence of bacteriophage MS2 RNA: primary and secondary structure of the replicase gene". Nature. 260 (5551): 500–507. Bibcode: 1976Natur.260..500F. doi: 10.1038/260500a0. PMID 1264203. S2CID 4289674.{{ cite journal}}: CS1 maint: multiple names: authors list ( link)

[fiers1978-13] Fiers, W.; Contreras, R.; Haegeman, G.; Rogiers, R.; Van De Voorde, A.; Van Heuverswyn, H.; Van Herreweghe, J.; Volckaert, G.; Ysebaert, M. (1978-05-11). "Complete nucleotide sequence of SV40 DNA". Nature. 273 (5658): 113–120. Bibcode: 1978Natur.273..113F. doi: 10.1038/273113a0. ISSN 0028-0836. PMID 205802. S2CID 1634424. Retrieved 2012-12-20. {{ cite journal}}: Text "113-120" ignored ( help)

[tamarin2004-14] Tamarin, Robert H (2004). Principles of genetics (7 ed.). London: McGraw Hill. ISBN 0071243208-9780071243209. {{ cite book}}: Check |isbn= value: length ( help)

[lecture80-15] Sanger, F. (1980), Nobel lecture: Determination of nucleotide sequences in DNA (PDF), Nobelprize.org, retrieved 2010-10-18

[sanger1977-16] Sanger F, Air GM, Barrell BG, Brown NL, Coulson AR, Fiddes CA, Hutchison CA, Slocombe PM, Smith M (1977). "Nucleotide sequence of bacteriophage phi X174 DNA". Nature. 265 (5596): 687–695. Bibcode: 1977Natur.265..687S. doi: 10.1038/265687a0. PMID 870828. S2CID 4206886.{{ cite journal}}: CS1 maint: multiple names: authors list ( link)

[kaiser2003-17] Kaiser, Olaf; Bartels, Daniela; Bekel, Thomas; Goesmann, Alexander; Kespohl, Sebastian; Pühler, Alfred; Meyer, Folker (December 2003). "Whole genome shotgun sequencing guided by bioinformatics pipelines—an optimized approach for an established technique" (PDF). Journal of Biotechnology. 106 (2–3): 121–133. doi: 10.1016/j.jbiotec.2003.08.008. ISSN 0168-1656. PMID 14651855. Retrieved 2012-12-20.

[sanger1977a-18] Sanger, F.; Nicklen, S.; Coulson, A. R. (1977). "DNA sequencing with chain-terminating inhibitors". Proceedings of the National Academy of Sciences of the United States of America. 74 (12): 5463–5467. Bibcode: 1977PNAS...74.5463S. doi: 10.1073/pnas.74.12.5463. PMC 431765. PMID 271968.

[gilbert1977-19] Maxam, A. M.; Gilbert, W. (February 1977). "A new method for sequencing DNA". Proceedings of the National Academy of Sciences of the United States of America. 74 (2): 560–564. Bibcode: 1977PNAS...74..560M. doi: 10.1073/pnas.74.2.560. ISSN 0027-8424. PMC 392330. PMID 265521.

[darden2010-20] Darden, Lindley (2010). "Molecular Biology". In Edward N. Zalta (ed.) (ed.). The Stanford Encyclopedia of Philosophy (Fall 2010 ed.). Retrieved 2012-12-20. {{ cite book}}: |editor= has generic name ( help); Unknown parameter |coauthors= ignored (|author= suggested) ( help)

[anderson1981-21] Anderson, S.; Bankier, A. T.; Barrell, B. G.; De Bruijn, M. H. L.; Coulson, A. R.; Drouin, J.; Eperon, I. C.; Nierlich, D. P.; Roe, B. A.; Sanger, F.; Schreier, P. H.; Smith, A. J. H.; Staden, R.; Young, I. G. (1981). "Sequence and organization of the human mitochondrial genome". Nature. 290 (5806): 457–465. Bibcode: 1981Natur.290..457A. doi: 10.1038/290457a0. PMID 7219534. S2CID 4355527.(subscription required)

[shinozaki1986-22] Shinozaki, K.; Ohme, M.; Tanaka, M.; Wakasugi, T.; Hayashida, N.; Matsubayashi, T.; Zaita, N.; Chunwongse, J.; Obokata, J.; Yamaguchi-Shinozaki, K.; Ohto, C.; Torazawa, K.; Meng, B. Y.; Sugita, M.; Deno, H.; Kamogashira, T.; Yamada, K.; Kusuda, J.; Takaiwa, F.; Kato, A.; Tohdoh, N.; Shimada, H.; Sugiura, M. (1986). "The complete nucleotide sequence of the tobacco chloroplast genome: Its gene organization and expression". The EMBO Journal. 5 (9): 2043–2049. doi: 10.1002/j.1460-2075.1986.tb04464.x. PMC 1167080. PMID 16453699.

[ohyama1986-23] Ohyama, K.; Fukuzawa, H.; Kohchi, T.; Shirai, H.; Sano, T.; Sano, S.; Umesono, K.; Shiki, Y.; Takeuchi, M.; Chang, Z.; Aota, S. I.; Inokuchi, H.; Ozeki, H. (1986). "Chloroplast gene organization deduced from complete sequence of liverwort Marchantia polymorpha chloroplast DNA". Nature. 322 (6079): 572. Bibcode: 1986Natur.322..572O. doi: 10.1038/322572a0. S2CID 4311952.

[oliver1992-24] Oliver, S. G.; Van Der Aart, Q. J. M.; Agostoni-Carbone, M. L.; Aigle, M.; Alberghina, L.; Alexandraki, D.; Antoine, G.; Anwar, R.; Ballesta, J. P. G.; Benit, P.; Berben, G.; Bergantino, E.; Biteau, N.; Bolle, P. A.; Bolotin-Fukuhara, M.; Brown, A.; Brown, A. J. P.; Buhler, J. M.; Carcano, C.; Carignani, G.; Cederberg, H.; Chanet, R.; Contreras, R.; Crouzet, M.; Daignan-Fornier, B.; Defoor, E.; Delgado, M.; Demolder, J.; Doira, C.; et al. (1992). "The complete DNA sequence of yeast chromosome III". Nature. 357 (6373): 38–46. Bibcode: 1992Natur.357...38O. doi: 10.1038/357038a0. PMID 1574125. S2CID 4271784.

[fleischmann1995-25] Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR, Bult CJ, Tomb JF, Dougherty BA, Merrick JM; et al. (1995). "Whole-genome random sequencing and assembly of Haemophilus influenzae Rd". Science. 269 (5223): 496–512. Bibcode: 1995Sci...269..496F. doi: 10.1126/science.7542800. PMID 7542800. {{ cite journal}}: Explicit use of et al. in: |author= ( help)CS1 maint: multiple names: authors list ( link)

[goffeau1996-26] Goffeau, A.; Barrell, B. G.; Bussey, H.; Davis, R. W.; Dujon, B.; Feldmann, H.; Galibert, F.; Hoheisel, J. D.; Jacq, C.; Johnston, M.; Louis, E. J.; Mewes, H. W.; Murakami, Y.; Philippsen, P.; Tettelin, H.; Oliver, S. G. (Oct 1996). "Life with 6000 genes". Science. 274 (5287): 546, 563–7. Bibcode: 1996Sci...274..546G. doi: 10.1126/science.274.5287.546. ISSN 0036-8075. PMID 8849441. S2CID 16763139.(subscription required)

[viruses2011-27] "Complete genomes: Viruses". NCBI. 2011-11-17. Retrieved 2011-11-18.

[entrez2011-28] "Genome Project Statistics". Entrez Genome Project. 2011-10-07. Retrieved 2011-11-18.

[zimmer2009a-29] Zimmer, Carl (2009-12-29). "Scientists Start a Genomic Catalog of Earth's Abundant Microbes". The New York Times. ISSN 0362-4331. Retrieved 2012-12-21.}

[geba2009-30] Wu, D.; Hugenholtz, P.; Mavromatis, K.; Pukall, R. D.; Dalin, E.; Ivanova, N. N.; Kunin, V.; Goodwin, L.; Wu, M.; Tindall, B. J.; Hooper, S. D.; Pati, A.; Lykidis, A.; Spring, S.; Anderson, I. J.; d'Haeseleer, P.; Zemla, A.; Singer, M.; Lapidus, A.; Nolan, M.; Copeland, A.; Han, C.; Chen, F.; Cheng, J. F.; Lucas, S.; Kerfeld, C.; Lang, E.; Gronow, S.; Chain, P.; Bruce, D. (2009). "A phylogeny-driven genomic encyclopaedia of Bacteria and Archaea". Nature. 462 (7276): 1056–1060. Bibcode: 2009Natur.462.1056W. doi: 10.1038/nature08656. PMC 3073058. PMID 20033048.

[bbc2004-31] "Human gene number slashed". BBC. 2004-10-20. Retrieved 2012-12-21.

[yue2001-32] Yue, G. H.; Lo, L. C.; Zhu, Z. Y.; Lin, G.; Feng, F. (2006). "The complete nucleotide sequence of the mitochondrial genome of Tetraodon nigroviridis". DNA Sequence : The Journal of DNA Sequencing and Mapping. 17 (2): 115–121. doi: 10.1080/10425170600700378. PMID 17076253. S2CID 21797344.

[nhgriDog2004-33] National Human Genome Research Institute (2004-07-14). "Dog Genome Assembled: Canine Genome Now Available to Research Community Worldwide". Genome.gov. Retrieved 2012-01-20.

[mcelheny2010-34] McElheny, Victor (2010). Drawing the map of life : inside the Human Genome Project. New York NY: Basic Books. ISBN 9780465043330.

[1000genomes2012-35] McVean, G. A.; Abecasis, D. M.; Auton, R. M.; Brooks, G. A. R.; Depristo, D. R.; Durbin, A.; Handsaker, A. G.; Kang, P.; Marth, E. E.; McVean, P.; Gabriel, S. B.; Gibbs, R. A.; Green, E. D.; Hurles, M. E.; Knoppers, B. M.; Korbel, J. O.; Lander, E. S.; Lee, C.; Lehrach, H.; Mardis, E. R.; Marth, G. T.; McVean, G. A.; Nickerson, D. A.; Schmidt, J. P.; Sherry, S. T.; Wang, J.; Wilson, R. K.; Gibbs (Principal Investigator), R. A.; Dinh, H.; et al. (2012). "An integrated map of genetic variation from 1,092 human genomes". Nature. 491 (7422): 56–65. Bibcode: 2012Natur.491...56T. doi: 10.1038/nature11632. PMC 3498066. PMID 23128226.

[neilsen2012-36] Nielsen, R. (2010). "Genomics: In search of rare human variants". Nature. 467 (7319): 1050–1051. Bibcode: 2010Natur.467.1050N. doi: 10.1038/4671050a. PMID 20981085. S2CID 5889828.

[barnes2008-37] Barnes, Barry (2008). Genomes and what to make of them. Chicago: University of Chicago Press. ISBN 9780226172958-0226172953. {{ cite book}}: Check |isbn= value: length ( help); Unknown parameter |coauthors= ignored (|author= suggested) ( help) Cite error: The named reference "barnes2008" was defined multiple times with different content (see the help page).

[pontin2011-38] Pontin, Jason (February 2011). "A Decade of Genomics". Technology Review. 114 (1): 8. ISSN 1099-274X.

[culver2002-39] Culver, Kenneth W. (2002-11-08). "Genomics". In Richard Robinson (ed.) (ed.). Genetics. Macmillan Science Library. Macmillan Reference USA. ISBN 0028656067. {{ cite encyclopedia}}: |editor= has generic name ( help); Unknown parameter |coauthors= ignored (|author= suggested) ( help)

[pierce2008-40] Pierce, Benjamin A. (2008-12-05). Genetics. Macmillan. ISBN 9781429233248.

[baker2012blog-41] Monya Baker (2012-09-14). "Benchtop sequencers ship off" (Blog). Nature News Blog. Retrieved 2012-12-22.

[quail2012-42] Quail, M.; Smith, M. E.; Coupland, P.; Otto, T. D.; Harris, S. R.; Connor, T. R.; Bertoni, A.; Swerdlow, H. P.; Gu, Y. (2012). "A tale of three next generation sequencing platforms: Comparison of Ion torrent, pacific biosciences and illumina MiSeq sequencers". BMC Genomics. 13: 341. doi: 10.1186/1471-2164-13-341. PMC 3431227. PMID 22827831.{{ cite journal}}: CS1 maint: unflagged free DOI ( link)

[Staden-43] Staden, R (1979 Jun 11). "A strategy of DNA sequencing employing computer programs". Nucleic Acids Research. 6 (7): 2601–10. doi: 10.1093/nar/6.7.2601. PMC 327874. PMID 461197. {{ cite journal}}: Check date values in: |date= ( help)

[anderson1981a-44] Anderson, S. (1981). "Shotgun DNA sequencing using cloned DNase I-generated fragments". Nucleic Acids Research. 9 (13): 3015–3027. doi: 10.1093/nar/9.13.3015. PMC 327328. PMID 6269069.

[sanger1975-45] Sanger F, Coulson AR (May 1975). "A rapid method for determining sequences in DNA by primed synthesis with DNA polymerase". J. Mol. Biol. 94 (3): 441–8. doi: 10.1016/0022-2836(75)90213-2. PMID 1100841.{{ cite journal}}: CS1 maint: date and year ( link)

[mavro2012-46] Mavromatis, K.; Land, M. L.; Brettin, T. S.; Quest, D. J.; Copeland, A.; Clum, A.; Goodwin, L.; Woyke, T.; Lapidus, A.; Klenk, H. P.; Cottingham, R. W.; Kyrpides, N. C. (2012). Liu, Zhanjiang (ed.). "The Fast Changing Landscape of Sequencing Technologies and Their Impact on Microbial Genome Assemblies and Annotation". PLOS ONE. 7 (12): e48837. Bibcode: 2012PLoSO...748837M. doi: 10.1371/journal.pone.0048837. PMC 3520994. PMID 23251337.

[illumina2012-47] Illumina, Inc. (2012-02-28). An Introduction to Next-Generation Sequencing Technology (PDF). San Diego, California, USA: Illumina, Inc. p. 12. Retrieved 2012-12-28.

[hall2007-48] Hall N (May 2007). "Advanced sequencing technologies and their wider impact in microbiology". J. Exp. Biol. 210 (Pt 9): 1518–25. doi: 10.1242/jeb.001370. PMID 17449817. S2CID 25688677.{{ cite journal}}: CS1 maint: date and year ( link)

[church2005-49] Church GM (January 2006). "Genomes for all". Sci. Am. 294 (1): 46–54. Bibcode: 2006SciAm.294a..46C. doi: 10.1038/scientificamerican0106-46. PMID 16468433.{{ cite journal}}: CS1 maint: date and year ( link)

[tenBosch2008-50] Ten Bosch, J. R.; Grody, W. W. (2008). "Keeping Up with the Next Generation". The Journal of Molecular Diagnostics. 10 (6): 484–492. doi: 10.2353/jmoldx.2008.080027. PMC 2570630. PMID 18832462.

[Tucker2009-51] Tucker, T.; Marra, M.; Friedman, J. M. (2009). "Massively Parallel Sequencing: The Next Big Thing in Genetic Medicine". The American Journal of Human Genetics. 85 (2): 142–154. doi: 10.1016/j.ajhg.2009.06.022. PMC 2725244. PMID 19679224.

[DNA_colony_patents-52] Kawashima, Eric H. (2005-05-12), Method of nucleic acid amplification, retrieved 2012-12-22 {{ citation}}: Unknown parameter |coauthors= ignored (|author= suggested) ( help)

[mardis2008-53] Mardis ER (2008). "Next-generation DNA sequencing methods" (PDF). Annu Rev Genomics Hum Genet. 9: 387–402. doi: 10.1146/annurev.genom.9.081307.164359. PMID 18576944.

[gilbert-54] Gilbert, David (2009-04-29). "DNA of Uncultured Organisms Sequenced Using Novel Single-Cell Approach" (Press Release). DOE Joint Genome Institute. Retrieved 2012-12-23.

[woyke2009-55] Woyke, T.; Xie, G.; Copeland, A.; González, J. M.; Han, C.; Kiss, H.; Saw, J. H.; Senin, P.; Yang, C.; Chatterji, S.; Cheng, J. F.; Eisen, J. A.; Sieracki, M. E.; Stepanauskas, R. (2009). Ahmed, Niyaz (ed.). "Assembling the Marine Metagenome, One Cell at a Time". PLOS ONE. 4 (4): e5299. Bibcode: 2009PLoSO...4.5299W. doi: 10.1371/journal.pone.0005299. PMC 2668756. PMID 19390573.

[zhang1992-56] Zhang, L.; Cui, X.; Schmitt, K.; Hubert, R.; Navidi, W.; Arnheim, N. (1992-07-01). "Whole genome amplification from a single cell: implications for genetic analysis". Proceedings of the National Academy of Sciences of the United States of America. 89 (13): 5847–5851. Bibcode: 1992PNAS...89.5847Z. doi: 10.1073/pnas.89.13.5847. ISSN 0027-8424. PMC 49394. PMID 1631067.

[stein2009-57] Stein, Richard A. (2009-06-01). "Single-Cell Genomics Clarifies Big Picture". Genetic Engineering & Biotechnology News. Vol. 29, no. 11. Retrieved 2012-12-23.

[finishing-58] Chain, P. S. G.; Grafham, D. V.; Fulton, R. S.; Fitzgerald, M. G.; Hostetler, J.; Muzny, D.; Ali, J.; Birren, B.; Bruce, D. C.; Buhay, C.; Cole, J. R.; Ding, Y.; Dugan, S.; Field, D.; Garrity, G. M.; Gibbs, R.; Graves, T.; Han, C. S.; Harrison, S. H.; Highlander, S.; Hugenholtz, P.; Khouri, H. M.; Kodira, C. D.; Kolker, E.; Kyrpides, N. C.; Lang, D.; Lapidus, A.; Malfatti, S. A.; Markowitz, V.; Metha, T. (2009). "Genome Project Standards in a New Era of Sequencing". Science. 326 (5950): 236–237. Bibcode: 2009Sci...326..236C. doi: 10.1126/science.1180614. PMC 3854948. PMID 19815760.

[pop2008-59] Pop, M.; Salzberg, S. L. (March 2008). "Bioinformatics challenges of new sequencing technology". Trends in Genetics : TIG. 24 (3): 142–149. doi: 10.1016/j.tig.2007.12.007. ISSN 0168-9525. PMC 2680276. PMID 18262676.

[batzoglou2005-60] Batzoglou, Serafim (2005-01-15). "Algorithmic challenges in mammalian whole-genome assembly" (PDF). In Lynn B. Jorde, Shankar Subramaniam, Michael J. Dunn (eds.) (ed.). Encyclopedia of Genetics, Genomics, Proteomics and Bioinformatics. Chichester, UK: John Wiley & Sons, Ltd. ISBN 0470849746, 9780470849743, 047001153X, 9780470011539. Retrieved 2012-12-23. {{ cite book}}: |editor= has generic name ( help); Check |isbn= value: invalid character ( help)CS1 maint: multiple names: editors list ( link)

[narzisi2011-61] Narzisi, G.; Mishra, B. (2011). Aerts, Stein (ed.). "Comparing De Novo Genome Assembly: The Long and Short of It". PLOS ONE. 6 (4): e19175. Bibcode: 2011PLoSO...619175N. doi: 10.1371/journal.pone.0019175. PMC 3084767. PMID 21559467.

[zhang2011-62] Zhang, Wenyu; Chen, Jiajia; Yang, Yang; Tang, Yifei; Shang, Jing; Shen, Bairong (2011-03-14). "A Practical Comparison of De Novo Genome Assembly Software Tools for Next-Generation Sequencing Technologies". PLOS ONE. 6 (3): e17915. Bibcode: 2011PLoSO...617915Z. doi: 10.1371/journal.pone.0017915. PMC 3056720. PMID 21423806. S2CID 4648109.

[Stein2001-63] Stein, L. (2001). "Genome Annotation: From Sequence to Biology". Nature Reviews Genetics. 2 (7): 493–503. doi: 10.1038/35080529. PMID 11433356. S2CID 12044602.

[brent2008-64] Brent, Michael R (January 2008). "Steady progress and recent breakthroughs in the accuracy of automated genome annotation" (PDF). Nature Reviews. Genetics. 9 (1): 62–73. doi: 10.1038/nrg2220. ISSN 1471-0064. PMID 18087260. S2CID 20412451.

[ensmbl2013-65] Flicek, P.; Ahmed, I.; Amode, M. R.; Barrell, D.; Beal, K.; Brent, S.; Carvalho-Silva, D.; Clapham, P.; Coates, G.; Fairley, S.; Fitzgerald, S.; Gil, L.; Garcia-Giron, C.; Gordon, L.; Hourlier, T.; Hunt, S.; Juettemann, T.; Kahari, A. K.; Keenan, S.; Komorowska, M.; Kulesha, E.; Longden, I.; Maurel, T.; McLaren, W. M.; Muffato, M.; Nag, R.; Overduin, B.; Pignatelli, M.; Pritchard, B.; Pritchard, E. (2012). "Ensembl 2013". Nucleic Acids Research. 41 (D1): D48–D55. doi: 10.1093/nar/gks1236. PMC 3531136. PMID 23203987.

[bioinfMethods-66] Keith, J. M. (2008). Keith, Jonathan M (ed.). Bioinformatics. Methods in Molecular Biology. Vol. 453. pp. v–vi. doi: 10.1007/978-1-60327-429-6. ISBN 978-1-60327-428-9. PMID 18720577.

[marsden2007-67] Marsden, R. L.; Lewis, T. A.; Orengo, C. A. (2007). "Towards a comprehensive structural coverage of completed genomes: A structural genomics viewpoint". BMC Bioinformatics. 8: 86. doi: 10.1186/1471-2105-8-86. PMC 1829165. PMID 17349043.{{ cite journal}}: CS1 maint: unflagged free DOI ( link)

[brenner2000-68] Brenner, S. E.; Levitt, I. (2008). "Expectations from structural genomics". Protein Science. 9 (1): 197–200. doi: 10.1110/ps.9.1.197. PMC 2144435. PMID 10739263.

[brenner2001-69] Brenner, S. E. (2001). "A tour of structural genomics" (PDF). Nature Reviews Genetics. 2 (10): 801–809. doi: 10.1038/35093574. PMID 11584296. S2CID 5656447. Retrieved 2012-12-07.

[francis2011-70] Francis, Richard C (2011). Epigenetics : the ultimate mystery of inheritance. New York: W.W. Norton. ISBN 9780393070057.

[laird2010-71] Laird, P. W. (2010). "Principles and challenges of genome-wide DNA methylation analysis". Nature Reviews Genetics. 11 (3): 191–203. doi: 10.1038/nrg2732. PMID 20125086. S2CID 6780101.

[Hugenholz1998-72] Hugenholtz, Philip; Goebel, Brett M.; Pace, Norman R. (1 September 1998). "Impact of Culture-Independent Studies on the Emerging Phylogenetic View of Bacterial Diversity". J. Bacteriol. 180 (18): 4765–74. doi: 10.1128/JB.180.18.4765-4774.1998. PMC 107498. PMID 9733676.

[Eisen2007-73] Eisen, JA (2007). "Environmental Shotgun Sequencing: Its Potential and Challenges for Studying the Hidden World of Microbes". PLOS Biology. 5 (3): e82. doi: 10.1371/journal.pbio.0050082. PMC 1821061. PMID 17355177.{{ cite journal}}: CS1 maint: unflagged free DOI ( link)

[MarcoD-74] Marco, D, ed. (2010). Metagenomics: Theory, Methods and Applications. Caister Academic Press. ISBN 978-1-904455-54-7.

[MarcoD2011-75] Marco, D, ed. (2011). Metagenomics: Current Innovations and Future Trends. Caister Academic Press. ISBN 978-1-904455-87-5.

[canchaya2003-76] Canchaya, Carlos; Proux, Caroline; Fournous, Ghislain; Bruttin, Anne; BrüSsow, Harald (June 2003). "Prophage Genomics". Microbiology and Molecular Biology Reviews. 67 (2): 238–276. doi: 10.1128/MMBR.67.2.238-276.2003. ISSN 1092-2172. PMC 156470. PMID 12794192.

[McGrath-77] McGrath S and van Sinderen D, ed. (2007). Bacteriophage: Genetics and Molecular Biology (1st ed.). Caister Academic Press. ISBN 978-1-904455-14-1.

[fouts2005-78] Fouts, Derrick E. (November 2006). "Phage_Finder: Automated identification and classification of prophage regions in complete bacterial genome sequences". Nucleic Acids Research. 34 (20): 5839–5851. doi: 10.1093/nar/gkl732. ISSN 0305-1048. PMC 1635311. PMID 17062630. S2CID 13318164.

[doe2005-79] Office of Science (2005-08). Genomics:GTL Roadmap (PDF). US Department of Energy. {{ cite conference}}: Check date values in: |date= ( help)

[asm2004-80] Microbiology in the 21st Century: Where Are We and Where Are We Going?. American Academy of Microbiology. 2004.

[Herrero-81] Herrero A and Flores E, ed. (2008). The Cyanobacteria: Molecular Biology, Genomics and Evolution (1st ed.). Caister Academic Press. ISBN 978-1-904455-15-8.

[guttmacher2002-82] Guttmacher, Alan E.; Collins, Francis S. (2002-11-07). "Genomic medicine--a primer". The New England Journal of Medicine. 347 (19): 1512–20. doi: 10.1056/NEJMra012240. ISSN 0028-4793. PMID 12421895. ProQuest 220135424. Retrieved 2012-12-07.

[phimster2012-83] Phimister, Elizabeth G.; Feero, W. Gregory; Guttmacher, Alan E. (2012-02-23). "Realizing Genomic Medicine". The New England Journal of Medicine. 366 (8): 757–9. doi: 10.1056/NEJMe1200749. ISSN 0028-4793. PMID 22356329. ProQuest 923268241. Retrieved 2012-12-07.

[feero2012-84] Bodurtha, J.; Strauss Jf, 3rd (2012-01-05). "Genomic Medicine: Genomics and Perinatal Care". The New England Journal of Medicine. 366 (1): 64–73. doi: 10.1056/NEJMra1105043. ISSN 0028-4793. PMC 4877696. PMID 22216843. ProQuest 914354372.{{ cite journal}}: CS1 maint: numeric names: authors list ( link)

[feero2011-85] Hudson, K. L. (2011-09-15). "Genomic Medicine: Genomics, Health Care, and Society". The New England Journal of Medicine. 365 (11): 1033–1041. doi: 10.1056/NEJMra1010517. ISSN 0028-4793. PMID 21916641. ProQuest 890081959. Retrieved 2012-12-07.

[feero2011a-86] O'Donnell, C. J.; Nabel, E. G. (2011-12-01). "Genomic Medicine: Genomics of Cardiovascular Disease". The New England Journal of Medicine. 365 (22): 2098–109. doi: 10.1056/NEJMra1105239. ISSN 0028-4793. PMID 22129254. ProQuest 907232641. Retrieved 2012-12-07.

[church2012-87] Church, George M (2012). Regenesis : how synthetic biology will reinvent nature and ourselves. New York: Basic Books. ISBN 9780465021758-0465021751. {{ cite book}}: Check |isbn= value: length ( help); Unknown parameter |coauthors= ignored (|author= suggested) ( help)

[vitti2012-88] Vitti, Joseph J.; Cho, Mildred K.; Tishkoff, Sarah A.; Sabeti, Pardis C. (March 2012). "Human evolutionary genomics: ethical and interpretive issues". Trends in Genetics. 28 (3): 137–145. doi: 10.1016/j.tig.2011.12.001. ISSN 0168-9525. PMC 5210170. PMID 22265990.

[wade2005-89] Wade, PeterLewis (October 2005). "Genomics and Race". Anthropology Today. 21 (5): 20–21. doi: 10.1111/j.0268-540X.2005.00382.x. ISSN 0268-540X.

[holden2008-90] Holden, Constance (2008-11-07). "The Touchy Subject of 'Race'". Science. 322 (5903): 839. doi: 10.1126/science.322.5903.839a. ISSN 1095-9203 0036-8075, 1095-9203. PMID 18988817. S2CID 32192135. Retrieved 2012-12-07. {{ cite journal}}: Check |issn= value ( help)

[cooper2003-91] Cooper, Richard S.; Kaufman, Jay S.; Ward, Ryk (2003-03-20). "Race and genomics". The New England Journal of Medicine. 348 (12): 1166–70. doi: 10.1056/NEJMsb022863. ISSN 0028-4793. PMID 12646675. ProQuest 220163870. Retrieved 2012-12-07.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]

[49]

[50]

[51]

[52]

[53]

[54]

[55]

[56]

[57]

[58]

[59]

[60]

[61]

[62]

[63]

[64]

[65]

[66]

[67]

[68]

[69]

[70]

[71]

[72]

[73]

[74]

[75]

[76]

[77]

[78]

[79]

[80]

[81]

[82]

[83]

[84]

[85]

[86]

[87]

[88]

[89]

[90]

[91]

v t e Genomics
Fields	Cognitive genomics Computational genomics Comparative genomics Functional genomics Genome project Human Genome Project Metagenomics Human Microbiome Project Pangenomics Personal genomics Population genomics Sociogenomics Structural genomics
Bioinformatics	Biochip Cheminformatics Chemogenomics Connectomics Human Connectome Project Epigenomics Human Epigenome Project Glycomics Immunomics Lipidomics Metabolomics Microbiomics Nutrigenomics Paleopolyploidy Pharmacogenetics Pharmacogenomics Systems biology Toxicogenomics Transcriptomics
Structural biology	Proteomics Human proteome project Call-map proteomics Structure-based drug design Expression proteomics
Research tools	2-D electrophoresis Mass spectrometer Electrospray ionization Matrix-assisted laser desorption ionization Matrix-assisted laser desorption ionization-time of flight mass spectrometer Microfluidic-based tools Isotope affinity tags Chromosome conformation capture
Organizations	DNA Data Bank of Japan (JP) European Molecular Biology Laboratory (EU) National Institutes of Health (USA) Wellcome Sanger Institute (UK)
List Category

v t e Genetics
Introduction Outline History Timeline Index Glossary
Key components	Chromosome DNA RNA Genome Heredity Nucleotide Mutation Genetic variation Allele Amino acid
Fields	Classical Conservation Cytogenetics Ecological Immunogenetics Microbial Molecular Population Quantitative
Archaeogenetics of	Africa the Americas the British Isles Europe Italy the Middle East South Asia
Related topics	Behavioural genetics Epigenetics Geneticist Genome editing Genomics Genetic code Genetic engineering Genetic diversity Genetic monitoring Genetic genealogy Heredity He Jiankui affair Medical genetics Missing heritability problem Molecular evolution Plant genetics Population genomics Reverse genetics
Lists	List of genetic codes List of genetics research organizations
Category

v t e Genomics
Fields	Cognitive genomics Computational genomics Comparative genomics Functional genomics Genome project Human Genome Project Metagenomics Human Microbiome Project Pangenomics Personal genomics Population genomics Sociogenomics Structural genomics
Bioinformatics	Biochip Cheminformatics Chemogenomics Connectomics Human Connectome Project Epigenomics Human Epigenome Project Glycomics Immunomics Lipidomics Metabolomics Microbiomics Nutrigenomics Paleopolyploidy Pharmacogenetics Pharmacogenomics Systems biology Toxicogenomics Transcriptomics
Structural biology	Proteomics Human proteome project Call-map proteomics Structure-based drug design Expression proteomics
Research tools	2-D electrophoresis Mass spectrometer Electrospray ionization Matrix-assisted laser desorption ionization Matrix-assisted laser desorption ionization-time of flight mass spectrometer Microfluidic-based tools Isotope affinity tags Chromosome conformation capture
Organizations	DNA Data Bank of Japan (JP) European Molecular Biology Laboratory (EU) National Institutes of Health (USA) Wellcome Sanger Institute (UK)
List Category

v t e Genetics
Introduction Outline History Timeline Index Glossary
Key components	Chromosome DNA RNA Genome Heredity Nucleotide Mutation Genetic variation Allele Amino acid
Fields	Classical Conservation Cytogenetics Ecological Immunogenetics Microbial Molecular Population Quantitative
Archaeogenetics of	Africa the Americas the British Isles Europe Italy the Middle East South Asia
Related topics	Behavioural genetics Epigenetics Geneticist Genome editing Genomics Genetic code Genetic engineering Genetic diversity Genetic monitoring Genetic genealogy Heredity He Jiankui affair Medical genetics Missing heritability problem Molecular evolution Plant genetics Population genomics Reverse genetics
Lists	List of genetic codes List of genetics research organizations
Category

History

Etymology

Early sequencing efforts

DNA sequencing technology developed

Complete genomes

Next-generation sequencing

Genome analysis

Sequencing

Shotgun sequencing

High-throughput sequencing

454 pyrosequencing

Illumina (Solexa) sequencing

Single cell genomics

Assembly

Challenges

Algorithms

Scaffolding

De-novo vs. mapping assembly

Finishing

Annotation

Sequencing pipelines and databases

Research areas

Functional genomics

Evolutionary genomics

Structural genomics

Epigenomics

Metagenomics

Study systems

Viruses and bacteriophages

Microbes

Cyanobacteria

Animals

Human genomics

Applications of genomics

Genomic medicine

Synthetic biology and bioengineering

Social aspects

'Astrological' genomics

Race and genomics

'Omics

See also

References

External links

History

Etymology

Early sequencing efforts

DNA sequencing technology developed

Complete genomes

Next-generation sequencing

Genome analysis

Sequencing

Shotgun sequencing

High-throughput sequencing

454 pyrosequencing

Illumina (Solexa) sequencing

Single cell genomics

Assembly

Challenges

Algorithms

Scaffolding

De-novo vs. mapping assembly

Finishing

Annotation

Sequencing pipelines and databases

Research areas

Functional genomics

Evolutionary genomics

Structural genomics

Epigenomics

Metagenomics

Study systems

Viruses and bacteriophages

Microbes

Cyanobacteria

Animals

Human genomics

Applications of genomics

Genomic medicine

Synthetic biology and bioengineering

Social aspects