GigaBlog

GigaData news: Macaque DOIs published in Nature Biotechnology
This week marks another success for the fledgling practice of data citation, with two datasets from our GigaScience database published in Nature Biotechnology. The genomes sequenced by our colleagues at the BGI for the Cynomolgus and Chinese rhesus macaques were initially released in our first batch of datasets with DOIs at our launch in July, and were amongst the first (at the time) unpublished genomes released in this way. Data citation is an important concept, allowing data producers to obtain an early form of credit for releasing their work, speeding up research by encouraging early data release, and allowing the impact and reuse of data to be tracked.
After the recent success of our first dataset being published in the New England of Medicine (the genome of the recent outbreak strain of E. coli O104:H4), this is the first time one of our data DOIs has been accepted in a Nature journal. For data citation to work the assistance of journals is key, and Nature Biotechnology has been particularly helpful in promoting the scheme, arguing in an editorial as far back as 2009 that novel forms of credit for data producers were needed, and suggesting DOIs as an ideal solution for this. The Datacite consortium was set up in late 2010 to do exactly that, and we would like to thank them and the British Library for their help in issuing these DOIs.
Macaque species are the most
commonly used non-human primate models in medical research, and their genomes
will hopefully aid human disease research and drug discovery. Looking
at orthologues of human druggable protein domains in these species is aiding the
potential therapeutic exploitation of their ‘druggable genome’, and has already lead to BGI producing an exome sequencing
platform for the species. On top of their genome assemblies, the DOI landing
pages include links to functionally annotated and
coding sequence sets, as well as a link to a browser and database. After the release of
other datasets such as the CHO cell line genome, we are currently collecting
another large batch of datasets to be released, so watch this
space for further news and announcements.
References
1. Yan, G. et al. Genome sequencing and comparison of two nonhuman primate animal models, the cynomolgus and Chinese rhesus macaques. Nat Biotech advance online publication, (2011).
2. Credit where credit is overdue. Nat
Biotech 27, 579 (2009).
To cite the two datasets please use the following citations:
3. Yan, G; Zhang, G; Fang, X; Zhang, Y; Li, C; Ling, F; Cooper, DN; Li, O;
Li, Y; van Gool, AJ; Du, H; Chen, J; Chen, R; Zhang, P; Huang, Z;
Thompson, JR; Meng, Y; Bai, Y; Wang, J; Zhuo, M; Wang, T; Huang, Y; Wei,
L; Li, J; Wang, Z; Hu, H; Le, L; Stenson, PD; Li, B; Liu, X; Ball, EV;
An, N; Huang, Q; Zhang, Y; Fan, W; Zhang, X; Li, Y; Wang, W; Katze, MG;
Su, B; Nielsen, R; Yang, H; Wang, J; Wang, X; Wang, J (2011): Genomic
data from the Chinese Rhesus Macaque (Macaca mulatta lasiota).
GigaScience. doi:10.5524/100002
http://dx.doi.org/10.5524/100002
4. Yan, G; Zhang, G; Fang, X; Zhang, Y; Li, C; Ling, F; Cooper, DN; Li, O;
Li, Y; van Gool, AJ; Du, H; Chen, J; Chen, R; Zhang, P; Huang, Z;
Thompson, JR; Meng, Y; Bai, Y; Wang, J; Zhuo, M; Wang, T; Huang, Y; Wei,
L; Li, J; Wang, Z; Hu, H; Le, L; Stenson, PD; Li, B; Liu, X; Ball, EV;
An, N; Huang, Q; Zhang, Y; Fan, W; Zhang, X; Li, Y; Wang, W; Katze, MG;
Su, B; Nielsen, R; Yang, H; Wang, J; Wang, X; Wang, J (2011): Genomic
data from the Crab Eating Macaque/Cynomolgus Monkey (Macaca
fascicularis). GigaScience. doi:10.5524/100003
http://dx.doi.org/10.5524/100003
Posted at 02:26AM Oct 21, 2011 by ScottEdmunds in General | Comments[1]
Hi - this is great stuff. Even better is linking to the articles too:
Credit where credit is overdue. Nat Biotech 27, 579 (2009).
http://dx.doi.org/10.1038/nbt0709-579
Yan, G. et al. Genome sequencing and comparison of two nonhuman primate animal models, the cynomolgus and Chinese rhesus macaques. Nat Biotech
http://dx.doi.org/10.1038/nbt.1992
Posted by Ed Pentz on October 28, 2011 at 12:15 PM BST #