We have about 40 pages of English text. This is text from an OCR machine. You need to compare the ocr with the original text, which is available as a PDF. There are not many mistakes but every page has some corrections. One page has about 800 words. We need an output text which is fully corrected an