First-person reading activity recognition by deep learning with synthetically generated images

EURASIP Journal on Image and Video Processing

Table 1 Main parameters in the generation processes

Process	Parameters	Values
–	Canvas of each page	\(210\sqrt {2} \times 210\) pixel by appropriate scaling
T	Blank space	10% at the top, bottom, left, and right in the canvases
	Columns	1 or 2
	Pages containing figures	80% of the entire generated pages
	Place to put figures	The top or the bottom in a column
	Category of figures	Mathematical figures, tables, and general pictures
	Figure size	Height: 12.5∼50%, or 100% of the column height
		Width: fixed at the same length of the column
	Size of headlines	10% of the page height
	Place to put headlines	Somewhere outside the figure areas
	Place to put texts	Entire areas in the column except the figures and headlines
	Text format	Japanese characters in random order (without any rules)
	Line-breaking	Done with prob. 1% every time putting a character
D	Distortion strength	α∈[0.1,0.3]
P	Homography matrix	Determined as shown in Fig. 4
R	Rotation angles	In [ − 10°,10°]
–	Resize	The long side shrinks to 256 pixel with keeping the ratio