RefSeq数据库最新版的统计数据

RefSeq的最新的版本号34,更新日期为2009/03/06,总共收录10021870条序列。总共有8054个物种,其中哺乳动物有283,植物150。

RefSeq Release 34 Statistics
Release date Mar 06, 2009
Number of Accessions Included: 10021870

Directory: complete

Number of taxids: 8054

Number of Accessions and total length per molecule type:

Genomic: 1552002              108912317855
RNA:          1778051              2880256975
Protein:     6691817              2299682138

RefSeq Status Counts:

Status                   RNA           Protein
—————————————–
Reviewed           62469        200174
Validated           23349        95117
Provisional       1239070  4134714
Predicted            21095       1405615
Inferred              692             1048
Model                   419736    387073
Unknown             11640       2490

详细:ftp://ftp.ncbi.nih.gov/refseq/release/release-statistics/