Power to the people!

As some of you might be aware, the National Library of Australia has been undertaking a huge newspaper digitisation project. Whilst OCR software has come a long way in providing access to these texts, nothing is perfect. It is, in fact, users of this material that are making the outputted OCR text even better by providing feedback and correcting the OCR transcriptions as they go. The Australian Newspaper Digitisation Program states that “Users have corrected over 2 million lines of electronic text in over 100,000 articles. Over 46,000 tags have been added to articles and many comments about information in articles added.”

There is an article in the latest issue of D-Lib Magazine by Rose Holley, the Manager of the Australian Newspaper Digitisation Program, that covers this in more detail.

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s