32 data tools compared — reviews, pricing & social mentions
The Document AI solutions suite includes pretrained models for document processing, Workbench for custom models, and Warehouse to search and store.
OCR & Document Extraction using vision models. Contribute to getomni-ai/zerox development by creating an account on GitHub.
Explore Azure Document Intelligence in Foundry Tools (formerly AI Document Intelligence). Transform documents with AI and OCR to extract text and stru
Build production-grade applications with a Postgres database, Authentication, instant APIs, Realtime, Functions, Storage and Vector embeddings. Start
Replace DIY complexity with the context engineering platform built for accuracy. Ship production-grade AI that is secure, scalable, and specialized.
Transform complex, unstructured data into clean, AI-ready inputs. Connect to any source, process 64+ file types, and power your GenAI projects. Start
Build with serverless PostgreSQL, a type-safe ORM for Node.js and TypeScript, visual database tools, and AI-ready workflows from Prisma.
The database you love, on a serverless platform designed to help you build reliable and scalable applications faster.
The web scraping API built for the AI era. Extract structured data from any website — no proxies, no selectors, no maintenance needed.
Dagster is the data orchestrator platform that helps you build, schedule, and monitor reliable data pipelines - fast, flexible, and built for teams.
ScrapingBee is the best web scraping API that handles proxies and headless browsers for you — so you can focus on extracting the data you need.
We've built our own browser automation approach from the ground up to avoid leaving even the most subtle fingerprints.
Improve the accuracy and efficiency of enterprise search and retrieval by reordering results based on semantic relevance.
dbt Labs empowers data teams to build reliable, governed data pipelines—accelerating analytics and AI initiatives with speed and confidence.
Convert images and PDFs to LaTeX, DOCX, Overleaf, Markdown, Excel, ChemDraw and more, with our AI-powered document conversion technology.
Neum AI is a best-in-class framework to build your data infrastructure for Retrieval Augmented Generation and Semantic Search. It provides a collectio
Effortlessly centralize all the data you need so your team can deliver better insights, faster. Start for free.
Cloud platform for web scraping, browser automation, AI agents, and data for AI. Use 32,000+ ready-made tools, code templates, or order a custom solut
The web context API for AI agents. Search, scrape, parse, and interact with the live web — turn any source into clean Markdown or structured data your
Build complex document processing pipelines with large language models. Declaratively extract structured data, link entities, rank information and mor
Amazon Textract is a machine learning (ML) service that uses optical character recognition (OCR) to automatically extract text, handwriting, and data
Agentic Document AI to power business workflows at enterprise scale, with unmatched accuracy.
Platform created by the community to programmatically author, schedule and monitor workflows.