Genome sequence data: management, storage, and visualization
Biotechniques 46(5):333-336 (2009) PMID 19480628
Over the last few years there has been a revolution in DNA sequencing technology that has brought down the cost of DNA sequencing and made the sequencing of an increasing number of genomes both feasible and cost effective. There has also been a dramatic shift in the type of sequence data being generated, with vast numbers of short reads or pairs of short reads replacing the traditional relatively long reads produced by Sanger sequencing. These changes in data quantity and format have led to a rethinking of sequence data management, storage, and visualization, and provide a challenge for bioinformatics. The vast amount of sequence data that will be generated over the next few years will require a change in what data are stored and how users query the information.
DOI: 10.2144/000113134
Version: za2963e q8zae q8zbb q8zc6 q8zd7 q8zed q8zfc q8zg2