any Python tools for reading Scantron-style data [closed]

人走茶凉 提交于 2019-12-10 19:39:49

问题


I am interested in doing some snail mail based surveys but I am looking for quick ways to digitize the surveys they send back.

So if I had a question and 5 boxes beneath it where you would indicate your opinion by checking the appropriate box, does anything exist where I could scan it and run it through a piece of software that spit out the responses.

Edit clarification:

I am inquiring about what I need to do after the paper has been digitized. I want to write some code that looks at an image file and recognizes which box has been marked in and outputs a representation of the respondents answers.

I would be looking at a page scanned from a desktop scanner or something similar.


回答1:


From what i see you don't really need ICR (intelligent character recognition, used for handwritten and handprinted texts), but what you need is OMR - optical mark recognition (capturing human-marked data from document forms such as surveys and tests).

The bad news is you would hardly find an opensource library for python. But there's a solution - you can use a cloud SDK, it's a website that let you upload an image and send you back an OCR'ed data. Try www.ocrsdk.com, it is a cloud based OCR SDK recently launched by ABBYY. It's now in closed beta so it's completely free to use.

It has both ICR and OMR api methods and a set of python code samples.




回答2:


I don't really see what this has to do with python, unless of course you've already digitized the results and are now looking to tally up the results. It sounds like you still need to scan the results in and as far as I know, python doesn't have any direct capabilities of doing something like that. You're going to have to get your hands on a scanner first, and only then can you use python to read through the data.




回答3:


The SDAPS project (repo) might be worth a look. It may not handle arbitrary scanned images, as it seems to expect an ODT or LaTeX document at the beginning of the process.

Overview

SDAPS is an open source (GPLv3, LPPL) optical mark recognition (OMR) program. It is written in python and has an integrated workflow with both LibreOffice and LaTeX to create questionnaires.

Workflow

With SDAPS you create the questionnaire using either LibreOffice or LaTeX. After this some processing is done to collect the information about the survey (questions, and answers) and a printable PDF is created. The filled out questionnaires only need to be scanned in (example). SDAPS will do the optical mark recognition and can create a PDF report (example) or export the data. Optionally it is possible to manually correct the results using a graphical user interface.


来源:https://stackoverflow.com/questions/8793702/any-python-tools-for-reading-scantron-style-data

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!