Skip to main content

Table 1 An overview of the databases discussed in the paper

From: A comprehensive survey of handwritten document benchmarks: structure, usage and evaluation

Database

Year

Language

Content

Mode

Writers

Statistics

PE92 [17]

1993

Korean

Isolated characters

Offline

500+

235,000 characters

CEDAR [14]

1994

English

Words, characters, digits

Offline

 

10,570 words, 27,835 characters, 21,179 digits

NIST [6]

1995

English

Isolated Digits

Offline

3600

810,000 digits

JPCD [171]

1997

Japanese

Characters

Online

80

1227 character categories

MNIST [7]

1998

English

Isolated digits

Offline

 

70,000 digits

Al-Isra [21]

1999

Arabic

Sentences

Offline

500

500 sentences, 37,000 words, 10,000 digits

IRONOFF [99]

1999

French

Words, characters, digits

Online

 

50,000 words, 32,000 characters

Firemaker [116]

2000

English

Paragraphs

Offline

250

4 samples/writers

GRUHD [29]

2001

Greek

Text, symbols

Offline

1000

1760 forms, 667,583 symbols, 102,692 words, 123,256 digits

IAM [2, 3]

2002

English

Sentences

Offline

657

1539 forms, 5685 sentences, 115,320 words

IFN/ENIT [19]

2002

Arabic

Words

Offline

411

2265 forms, 26,449 city names

Checks DB [13]

2003

Arabic

Check amounts

Offline

 

7000 cheques, 29,498 subwords, 15,000 digits

AHDB [20, 135]

2004

Arabic

Sentences,check amounts

Offline

100

105 Forms

ARABASE [139]

2005

Arabic

Sentences, words, letters

On/off

400

400 forms

IAM-OnDB [4]

2005

English

Sentences

Online

221

1700 forms; 86,272 words

Numerals DB [197]

2005

Bangla

Digits

Offline

 

45,948 numerals

  

Devanagari

    

IAUT/PHCN [24]

2008

Farsi

Isolated words

Offline

380

1140 forms, 34,200 words

RIMES [63]

2008

French

Sentences

Offline

1300

12,723 pages

IFN Farsi [23]

2008

Farsi

Words

Offline

600

7271 words, 23,545 subwords

CENPARMI-A [11]

2008

Arabic

Words, characters, digits

Offline

328

13,439 digits, 21,426 characters, 11,375 words

LMCA [22]

2008

Arabic

Words, characters, digits

Online

55

30,000 digits, 100,000 characters, 500 words

CENPARMI-U [164]

2009

Urdu

Words, characters, digits

Offline

 

18,000 words

FHT [162]

2009

Farsi

Sentences

Offline

250

1000 forms, 106,600 words, 8050 sentences

HCL2000 [177]

2009

Chinese

Characters

Offline

1000

3755 characters

CENPARMI-F [10]

2009

Farsi

Words, letters, digits

Offline

400

432,357 images

IAM-OnDO [52]

2010

English

Text, drawings, tables etc.

Online

200

1000 documents

RODRIGO [105]

2010

Spanish

 

Offline

 

1853 pages

ADAB [130]

2011

Arabic

Words

Online

170

20,000+ words

CASIA [188]

2011

Chinese

Text, characters

On/off

1020

3.5M isolated characters, 1.35M characters in text

SCUT-COUCH [181]

2011

Chinese

Characters

Online

190

3.6M characters

Indonesian TDB [109]

2011

Indonesian

Sentences

Offline

200

200 forms

AMHCD [204]

2011

Amazigh

Characters

Offline

60

25,740 characters

MRG-OHTC [211]

2011

Tibetan

Characters

Online

130

910 character classes

KHTD [201]

2011

Kannada

Sentecnes

Offline

51

4000 lines, 26,000 words

KHATT [146]

2012

Arabic

Sentences

Offline

1000

1000 forms

Devanagari DB [203]

2012

Devanagari

Digits, characters

Offline

750

5137 isoloated numerals

UHSD [167]

2012

Urdu

Sentences

Offline

200

400 forms

QUWI [148]

2013

Arabic

Sentences

Offline

1017

4068 forms

  

English

    

HaFT [163]

2013

Farsi

Sentences

Offline

600

1800 images

CVL [27]

2013

English

Sentences

Offline

311

2163 forms

  

German

    

Tamil DB [202]

2013

Tamil

Words

Offline

500

265,00 city names

AHTID-MW [153]

2015

Arabic

Text lines

Offline

53

3710 lines