Zerox
OCR & Document Extraction using vision models. Contribute to getomni-ai/zerox development by creating an account on GitHub.
A dead simple way of OCR-ing a document for AI ingestion. Documents are meant to be a visual representation after all. With weird layouts, tables, charts, etc. The vision models just make sense! Zerox is available as both a Node and Python package. (Node.js SDK - supports vision models from different providers like OpenAI, Azure OpenAI, Anthropic, AWS Bedrock, Google Gemini, etc.) The maintainFormat option tries to return the markdown in a consistent format by passing the output of a prior page in as additional context for the next page. This requires the requests to run synchronously, so it's a lot slower. But valuable if your documents have a lot of tabular data, or frequently have tables that cross pages. Zerox supports structured data extraction from documents using a schema. This allows you to pull specific information from documents in a structured format instead of getting the full markdown conversion. Use extractPerPage to extract data per page instead of from the whole document at once. Zerox supports a wide range of models across different providers: (Python SDK - supports vision models from different providers like OpenAI, Azure OpenAI, Anthropic, AWS Bedrock, etc.) The pyzerox.zerox function is an asynchronous API that performs OCR (Optical Character Recognition) to markdown using vision models. It processes PDF files and converts them into markdown format. Make sure to set up the environment variables for the model and the model provider before using this API. Refer to the LiteLLM Documentation for setting up the environment and passing the correct model name. Note the output is manually wrapped for this documentation for better readability. This project is licensed under the MIT License. OCR Document Extraction using vision models There was an error while loading. Please reload this page. There was an error while loading. Please reload this page. There was an error while loading. Please reload this page. There was an error while loading. Please reload this page.
Unstructured
Transform complex, unstructured data into clean, AI-ready inputs. Connect to any source, process 64+ file types, and power your GenAI projects. Start
Based on the limited social mentions available, there's minimal specific user feedback about Unstructured as a software tool. The mentions primarily consist of YouTube references to "Unstructured AI" without detailed user opinions, and indirect references in discussions about unstructured data processing and RAG systems. One Hacker News post mentions building tools to simplify unstructured data search, suggesting there's demand in this space, but doesn't provide direct user sentiment about Unstructured itself. Without substantial user reviews or detailed social commentary, it's difficult to assess user satisfaction, pricing sentiment, or overall reputation for this tool.
Zerox
Unstructured
Zerox
Pricing found: $50.10, $48.71, $48.71, $48.71, $9.74
Unstructured
Pricing found: $0.03 / page
Only in Zerox (10)
Only in Unstructured (10)
Zerox
No data yet
Unstructured
Zerox
Unstructured