Somatic dataset statistics

Statistics on dataset development

Database release Release date Mutations count Ref. count Last Ref_ID Added ref. Added mutations Deleted mutations* PubMed search**
R3 - 10411 1048 1075 - - - -
R4 July 2000 14050 1320 1369 294 3798 159 Jan 1998 - Apr 2000
R5 June 2001 15121 1412 1480 111 1459 388 May - Dec 2000
R6 Jan 2002 16285 1485 1571 91 1549 385 Jan - June 2001
R7 Sept 2002 17689 1599 1715 144 1477 73 July 2001 - June 2002
R8 June 2003 18585 1680 1810 95 924 28 July 2002 - Feb 2003
R9 July 2004 19809 1769 1921 111 1196 40 March - Dec 2003
R10 July 2005 21587 1876 2055 107 1788 10 Jan - Dec 2004
R11 Nov 2006 23544 1995 2221 120 2014 57 Jan - Dec 2005
R12 Nov 2007 24810 2081 2349 86 1331 65 Jan - Dec 2006
R13*** Nov 2008 24806 2081 2349 - - 4 -
R14 Nov 2009 26597 2179 2483 98 1814 - Jan - Dec 2007
R15**** Nov 2010 27580 2218 2564 48 1021 38 2008 - 2009****
R16**** Nov 2012 29575 2279 2635 71 1995 - 2008 - 2011****
R17**** Nov 2013 29881 2285 2641 6 306 - 2010 - 2013****

* Data may be deleted if (1) they correspond to duplicate entries or (2) errors. Publication of the same set of samples in different papers by the same authors is a serious problem that has led to duplicates entries in the database in the past. We now perform systematic searches of the database under the author’s name to identify earlier entries that may correspond to the same dataset. We have also extensively reviewed the entire dataset in order to find and eliminate these duplicates. However, despite these efforts, some duplicates may remain in the database and their identification is an ongoing task.
** Papers edited in PubMed at the indicated dates were searched with selected keywords and reviewed to extract relevant data.
*** The dataset of somatic mutations has not been updated.
**** The update only include a selection of papers (see database developments).