Skip to main content

Table 2 Details of the datasets

From: Improving the quality of Persian clinical text with a novel spelling correction system

Dataset Source

Number of Reports

Number of Words

Number of Sentences

Average Length of Reports

Jan 2011–Feb 2015

22,504

7,538,840

396,781

335

Mar 2015–Jul 2018

15,888

4,782,288

239,114

301

Aug 2018–Jun 2023

40,251

14,007,348

1,253,736

348

Total

78,643

26,328,476

1,889,631

336