Distributed Proofreaders .December 31, 2020
Distributed Proofreaders (commonly abbreviated as DP or PGDP) is a web-based project that supports the development of e-texts for Project Gutenberg by allowing many people to work together in proofreading drafts of e-texts for errors. As of June 2020, the site had digitized 39,000 titles.
Public domain works, typically books with expired copyright, are scanned by volunteers, or sourced from digitization projects and the images are run through optical character recognition (OCR) software. Since OCR software is far from perfect, many errors often appear in the resulting text. To correct them, pages are made available to volunteers via the Internet; the original page image and the recognized text appear side by side. This process thereby distributes the time-consuming error-correction process, akin to distributed computing.