- What is clearOCR?
clearOCR is a modern OCR tool and OCR API for text extraction from PDFs, scans and document photos. It is built on a vision LLM approach and designed for high-quality OCR for Polish and English documents. clearOCR is developed by TeamQuest Sp. z o.o.
- Who operates clearOCR?
clearOCR is operated by TeamQuest Sp. z o.o., based in Warsaw, Poland.
- What file types does clearOCR support?
clearOCR supports PDF, JPG and PNG files. In the browser demo, file size and page limits may apply.
- Does clearOCR return plain text or preserve layout?
clearOCR focuses on text extraction, not visual layout reconstruction. The website returns plain text that can be copied or downloaded as TXT, while the API returns text together with additional metadata, such as the detected document language.
- Is clearOCR based on traditional OCR or AI?
clearOCR uses a modern vision LLM-based OCR approach. This helps with text extraction from scanned PDFs, document photos and business files, especially for Polish and English documents.
- Can the API return more than plain text?
Yes. In addition to text extraction, the API can return additional metadata, such as detected document language, and in selected scenarios text prepared for downstream processing, including bbcode-style or other structured text-oriented output formats.
- Can clearOCR detect the document language?
Yes. The API can return the detected document language together with the OCR result, which is useful in multilingual workflows, routing and automation.
- Can I test clearOCR without creating an account?
Yes. You can test clearOCR directly in the browser without logging in or configuring anything.
- What do I get after creating an account?
After creating an account, you get access to the OCR API, higher limits and a starter package of 1,000 free single-image OCR runs valid for 30 days.
- Does clearOCR support OCR for Polish and English documents?
Yes. clearOCR is designed for high-quality OCR for both Polish and English documents, including scanned PDFs, document photos and business files.
- How are uploaded files handled?
Files uploaded through the clearOCR website are processed only to complete the OCR request. By default, files and OCR results are removed after processing and download, with automated cleanup completing deletion within a maximum of 48 hours.
- Are uploaded files used for model training?
By default, uploaded files and OCR outputs are not used for training. Account, billing and technical data are retained only as long as needed to provide the service, protect the platform and meet legal obligations.