Genome Sequence Databases › Научные исследования в СПбГУ

Standard

Genome Sequence Databases : Sequencing and Assembly. / Lapidus, A. L.

Encyclopedia of Microbiology. Elsevier, 2009. стр. 196-210.

Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › глава/раздел › научная › Рецензирование

Author

Lapidus, A. L. / Genome Sequence Databases : Sequencing and Assembly. Encyclopedia of Microbiology. Elsevier, 2009. стр. 196-210

BibTeX

@inbook{d4bbf33252ed450aba34eab1ee2a7f7f,

title = "Genome Sequence Databases: Sequencing and Assembly",

abstract = "From the date its role in heredity was discovered, DNA has been generating interest among scientists from different fields of knowledge: physicists have studied the three-dimensional structure of the DNA molecule, biologists tried to decode the secrets of life hidden within these long molecules, and technologists invent and improve methods of DNA analysis. The analysis of the nucleotide sequence of DNA occupies a special place among the methods developed. Thanks to the variety of sequencing technologies available, the process of decoding the sequence of genomic DNA (or whole genome sequencing) has become robust and inexpensive. Meanwhile the assembly of whole genome sequences remains a challenging task. In addition to the need to assemble millions of DNA fragments of different length (from 35 bp (Solexa) to 800 bp (Sanger)), great interest in analysis of microbial communities (metagenomes) of different complexities raises new problems and pushes some new requirements for sequence assembly tools to the forefront. The genome assembly process can be divided into two steps: draft assembly and assembly improvement (finishing). Despite the fact that automatically performed assembly (or draft assembly) is capable of covering up to 98% of the genome, in most cases, it still contains incorrectly assembled reads. The error rate of the consensus sequence produced at this stage is about 1/2000 bp. A finished genome represents the genome assembly of much higher accuracy (with no gaps or incorrectly assembled areas) and quality (<1 error/10 000 bp), validated through a number of computer and laboratory experiments.",

keywords = "Contig, DNA sequencing, Genome finishing, Misassembly, Read, Scaffold, Whole-Genome Shotgun assembly",

author = "Lapidus, {A. L.}",

year = "2009",

month = jan,

day = "1",

doi = "10.1016/B978-012373944-5.00028-6",

language = "English",

isbn = "9780123739445",

pages = "196--210",

booktitle = "Encyclopedia of Microbiology",

publisher = "Elsevier",

address = "Netherlands",

}

RIS

TY - CHAP

T1 - Genome Sequence Databases

T2 - Sequencing and Assembly

AU - Lapidus, A. L.

PY - 2009/1/1

Y1 - 2009/1/1

N2 - From the date its role in heredity was discovered, DNA has been generating interest among scientists from different fields of knowledge: physicists have studied the three-dimensional structure of the DNA molecule, biologists tried to decode the secrets of life hidden within these long molecules, and technologists invent and improve methods of DNA analysis. The analysis of the nucleotide sequence of DNA occupies a special place among the methods developed. Thanks to the variety of sequencing technologies available, the process of decoding the sequence of genomic DNA (or whole genome sequencing) has become robust and inexpensive. Meanwhile the assembly of whole genome sequences remains a challenging task. In addition to the need to assemble millions of DNA fragments of different length (from 35 bp (Solexa) to 800 bp (Sanger)), great interest in analysis of microbial communities (metagenomes) of different complexities raises new problems and pushes some new requirements for sequence assembly tools to the forefront. The genome assembly process can be divided into two steps: draft assembly and assembly improvement (finishing). Despite the fact that automatically performed assembly (or draft assembly) is capable of covering up to 98% of the genome, in most cases, it still contains incorrectly assembled reads. The error rate of the consensus sequence produced at this stage is about 1/2000 bp. A finished genome represents the genome assembly of much higher accuracy (with no gaps or incorrectly assembled areas) and quality (<1 error/10 000 bp), validated through a number of computer and laboratory experiments.

AB - From the date its role in heredity was discovered, DNA has been generating interest among scientists from different fields of knowledge: physicists have studied the three-dimensional structure of the DNA molecule, biologists tried to decode the secrets of life hidden within these long molecules, and technologists invent and improve methods of DNA analysis. The analysis of the nucleotide sequence of DNA occupies a special place among the methods developed. Thanks to the variety of sequencing technologies available, the process of decoding the sequence of genomic DNA (or whole genome sequencing) has become robust and inexpensive. Meanwhile the assembly of whole genome sequences remains a challenging task. In addition to the need to assemble millions of DNA fragments of different length (from 35 bp (Solexa) to 800 bp (Sanger)), great interest in analysis of microbial communities (metagenomes) of different complexities raises new problems and pushes some new requirements for sequence assembly tools to the forefront. The genome assembly process can be divided into two steps: draft assembly and assembly improvement (finishing). Despite the fact that automatically performed assembly (or draft assembly) is capable of covering up to 98% of the genome, in most cases, it still contains incorrectly assembled reads. The error rate of the consensus sequence produced at this stage is about 1/2000 bp. A finished genome represents the genome assembly of much higher accuracy (with no gaps or incorrectly assembled areas) and quality (<1 error/10 000 bp), validated through a number of computer and laboratory experiments.

KW - Contig

KW - DNA sequencing

KW - Genome finishing

KW - Misassembly

KW - Read

KW - Scaffold

KW - Whole-Genome Shotgun assembly

UR - http://www.scopus.com/inward/record.url?scp=85069185830&partnerID=8YFLogxK

U2 - 10.1016/B978-012373944-5.00028-6

DO - 10.1016/B978-012373944-5.00028-6

M3 - Chapter

AN - SCOPUS:85069185830

SN - 9780123739445

SP - 196

EP - 210

BT - Encyclopedia of Microbiology

PB - Elsevier

ER -

ID: 61628327

Genome Sequence Databases: Sequencing and Assembly

Standard

Harvard

APA

Vancouver

Author

BibTeX

RIS