From: A comprehensive survey of handwritten document benchmarks: structure, usage and evaluation
Database | Year | Language | Content | Mode | Writers | Statistics |
---|---|---|---|---|---|---|
PE92 [17] | 1993 | Korean | Isolated characters | Offline | 500+ | 235,000 characters |
CEDAR [14] | 1994 | English | Words, characters, digits | Offline | Â | 10,570 words, 27,835 characters, 21,179 digits |
NIST [6] | 1995 | English | Isolated Digits | Offline | 3600 | 810,000 digits |
JPCD [171] | 1997 | Japanese | Characters | Online | 80 | 1227 character categories |
MNIST [7] | 1998 | English | Isolated digits | Offline | Â | 70,000 digits |
Al-Isra [21] | 1999 | Arabic | Sentences | Offline | 500 | 500 sentences, 37,000 words, 10,000 digits |
IRONOFF [99] | 1999 | French | Words, characters, digits | Online | Â | 50,000 words, 32,000 characters |
Firemaker [116] | 2000 | English | Paragraphs | Offline | 250 | 4 samples/writers |
GRUHD [29] | 2001 | Greek | Text, symbols | Offline | 1000 | 1760 forms, 667,583 symbols, 102,692 words, 123,256 digits |
2002 | English | Sentences | Offline | 657 | 1539 forms, 5685 sentences, 115,320 words | |
IFN/ENIT [19] | 2002 | Arabic | Words | Offline | 411 | 2265 forms, 26,449 city names |
Checks DB [13] | 2003 | Arabic | Check amounts | Offline | Â | 7000 cheques, 29,498 subwords, 15,000 digits |
2004 | Arabic | Sentences,check amounts | Offline | 100 | 105 Forms | |
ARABASE [139] | 2005 | Arabic | Sentences, words, letters | On/off | 400 | 400 forms |
IAM-OnDB [4] | 2005 | English | Sentences | Online | 221 | 1700 forms; 86,272 words |
Numerals DB [197] | 2005 | Bangla | Digits | Offline | Â | 45,948 numerals |
 |  | Devanagari |  |  |  |  |
IAUT/PHCN [24] | 2008 | Farsi | Isolated words | Offline | 380 | 1140 forms, 34,200 words |
RIMES [63] | 2008 | French | Sentences | Offline | 1300 | 12,723 pages |
IFN Farsi [23] | 2008 | Farsi | Words | Offline | 600 | 7271 words, 23,545 subwords |
CENPARMI-A [11] | 2008 | Arabic | Words, characters, digits | Offline | 328 | 13,439 digits, 21,426 characters, 11,375 words |
LMCA [22] | 2008 | Arabic | Words, characters, digits | Online | 55 | 30,000 digits, 100,000 characters, 500 words |
CENPARMI-U [164] | 2009 | Urdu | Words, characters, digits | Offline | Â | 18,000 words |
FHT [162] | 2009 | Farsi | Sentences | Offline | 250 | 1000 forms, 106,600 words, 8050 sentences |
HCL2000 [177] | 2009 | Chinese | Characters | Offline | 1000 | 3755 characters |
CENPARMI-F [10] | 2009 | Farsi | Words, letters, digits | Offline | 400 | 432,357 images |
IAM-OnDO [52] | 2010 | English | Text, drawings, tables etc. | Online | 200 | 1000 documents |
RODRIGO [105] | 2010 | Spanish | Â | Offline | Â | 1853 pages |
ADAB [130] | 2011 | Arabic | Words | Online | 170 | 20,000+ words |
CASIA [188] | 2011 | Chinese | Text, characters | On/off | 1020 | 3.5M isolated characters, 1.35M characters in text |
SCUT-COUCH [181] | 2011 | Chinese | Characters | Online | 190 | 3.6M characters |
Indonesian TDB [109] | 2011 | Indonesian | Sentences | Offline | 200 | 200 forms |
AMHCD [204] | 2011 | Amazigh | Characters | Offline | 60 | 25,740 characters |
MRG-OHTC [211] | 2011 | Tibetan | Characters | Online | 130 | 910 character classes |
KHTD [201] | 2011 | Kannada | Sentecnes | Offline | 51 | 4000 lines, 26,000 words |
KHATT [146] | 2012 | Arabic | Sentences | Offline | 1000 | 1000 forms |
Devanagari DB [203] | 2012 | Devanagari | Digits, characters | Offline | 750 | 5137 isoloated numerals |
UHSD [167] | 2012 | Urdu | Sentences | Offline | 200 | 400 forms |
QUWI [148] | 2013 | Arabic | Sentences | Offline | 1017 | 4068 forms |
 |  | English |  |  |  |  |
HaFT [163] | 2013 | Farsi | Sentences | Offline | 600 | 1800 images |
CVL [27] | 2013 | English | Sentences | Offline | 311 | 2163 forms |
 |  | German |  |  |  |  |
Tamil DB [202] | 2013 | Tamil | Words | Offline | 500 | 265,00 city names |
AHTID-MW [153] | 2015 | Arabic | Text lines | Offline | 53 | 3710 lines |