Apache Airflow
Platform created by the community to programmatically author, schedule and monitor workflows.
Apache Airflow® has a modular architecture and uses a message queue to orchestrate an arbitrary number of workers. Airflow™ is ready to scale to infinity. Apache Airflow® pipelines are defined in Python, allowing for dynamic pipeline generation. This allows for writing code that instantiates pipelines dynamically. Easily define your own operators and extend libraries to fit the level of abstraction that suits your environment. Apache Airflow® pipelines are lean and explicit. Parametrization is built into its core using the powerful Jinja templating engine. No more command-line or XML black-magic! Use standard Python features to create your workflows, including date time formats for scheduling and loops to dynamically generate tasks. This allows you to maintain full flexibility when building your workflows. Monitor, schedule and manage your workflows via a robust and modern web application. No need to learn old, cron-like interfaces. You always have full insight into the status and logs of completed and ongoing tasks. Apache Airflow® provides many plug-and-play operators that are ready to execute your tasks on Google Cloud Platform, Amazon Web Services, Microsoft Azure and many other third-party services. This makes Airflow easy to apply to current infrastructure and extend to next-gen technologies. Anyone with Python knowledge can deploy a workflow. Apache Airflow® does not limit the scope of your pipelines; you can use it to build ML models, transfer data, manage your infrastructure, and more. Wherever you want to share your improvement you can do this by opening a PR. It’s simple as that, no barriers, no prolonged procedures. Airflow has many active users who willingly share their experiences. Have any questions? Check out our buzzing slack. Today we re launching the Apache Airflow Registry — a searchable catalog of every official Airflow provider and its modules, live at … The interactive report is hosted by Astronomer. The Apache Airflow community thanks Astronomer for running this survey, for sponsoring it … We are thrilled to announce the first major release of airflowctl 0.1.0, the new secure, API-driven command-line interface (CLI) for Apache … Apache Airflow Core, which includes webserver, scheduler, CLI and other components that are needed for minimal Airflow installation. Read the documentation Apache Airflow CTL (airflowctl) is a command-line interface (CLI) for Apache Airflow that interacts exclusively with the Airflow REST API. It provides a secure, auditable, and consistent way to manage Airflow deployments — without direct access to the metadata database. Read the documentation The Task SDK provides python-native interfaces for defining DAGs, executing tasks in isolated subprocesses and interacting with Airflow resources (e.g., Connections, Variables, XComs, Metrics, Logs, and OpenLineage events) at runtime. The goal of task-sdk is to decouple DAG authoring from Airflow internals (Scheduler, API Server, etc.), provid
Ragie
Meet Ragie.
Powered by the most advanced RAG pipeline, Ragie uses context engineering to deliver fast, accurate, context-rich retrieval—through structured chunking, multi-layered indexing, and LLM-aware optimizations—built for production-grade generative AI. Ragie is built for enterprise-scale workloads with multi-tenant architecture, SOC 2-compliant security, and seamless performance at any scale. Built to handle any data you throw at it — Ragie’s multimodal ingest pipeline processes text, PDFs, images, audio, video, tables, and more. It parses, enriches, and structures diverse content into a unified format ready for chunking, indexing, and retrieval. Ragie offers out-of-the-box features that accelerate your application development. Built to meet the security, scale, and reliability requirements of production AI. Seamless data ingest with built-in authentication and authorization Ragie’s fully-managed connectors handle authentication and authorization to securely access data from popular data sources, freeing up precious engineering time and resources. Automatic syncing keeps data up to date Automatic syncing keeps your RAG pipeline up to date, ensuring your application delivers accurate and reliable information around the clock. Growing library of native integrations Purpose-built for AI applications, Ragie’s growing list of native connectors allow seamless integration with the most popular data sources. Connect your data (or your customers’) to your app, no matter where it lives. With Ragie Connect, your customers can securely connect and manage their own data, directly from your application. For white-label version, chat with sales. Ragie is a fully managed RAG-as-a-Service designed for developers to streamline the ingestion, chunking, and multimodal indexing of structured and unstructured data. It offers simple APIs and SDKs, seamless integration with sources like Google Drive, Notion, and Confluence, and built-in capabilities like summary indexing, chunk reranking, flexible vector filtering, and hybrid semantic-keyword search. With agentic retrieval for multi-step reasoning and a context-aware MCP Server that enables intelligent tool use, Ragie helps your applications deliver state-of-the-art, agent-ready generative AI. Building production applications using RAG can be very tedious. Developers must connect and sync multiple data sources, extract meaningful data from various file formats, implement evolving techniques for chunking and retrieval, build a scalable and resilient data processing pipeline, avoid hallucinations, and ensure content accuracy. Using open-source frameworks can be time-consuming and often results in brittle applications. Originally developed for Glue, Ragie solves this by providing a fully managed RAG-as-a-Service platform. Ragie is ideal for developers who want to build AI applications that leverage their own data for accurate and relevant outputs. Whether you're working on internal chatbots, enterprise SaaS p
Apache Airflow
Ragie
Apache Airflow
Ragie
Pricing found: $100 / month, $500 / month, $500 / month, $0.02 / page, $0.02 / page
Ragie (1)
Only in Apache Airflow (4)
Apache Airflow
Ragie