Docling is praised for its advanced AI capabilities and integration with various platforms, which users find particularly beneficial for enhancing productivity and workflow. However, users express frustration over ongoing challenges with table extraction, especially in complex cases like borderless tables, which remains a significant complaint about its functionality. While there is not much information about the pricing sentiment in the mentions, Docling's overall reputation appears to be positive, with users recognizing its potential for substantial improvements with continued development.
Mentions (30d)
1
Reviews
0
Platforms
2
Sentiment
25%
2 positive
Docling is praised for its advanced AI capabilities and integration with various platforms, which users find particularly beneficial for enhancing productivity and workflow. However, users express frustration over ongoing challenges with table extraction, especially in complex cases like borderless tables, which remains a significant complaint about its functionality. While there is not much information about the pricing sentiment in the mentions, Docling's overall reputation appears to be positive, with users recognizing its potential for substantial improvements with continued development.
Features
Use Cases
2
npm packages
3
HuggingFace models
Why Is Table Extraction with VLM Models Still Challenging? [D]
Hey everyone, I’m struggling to find a good approach for converting PDFs to Markdown (especially for financial data). The main challenge is handling borderless tables and tables with more than 5–6 columns. I’ve tried docling, graphite-docling, marker, etc., but haven’t found a solid open-source solution. The only thing that works well so far is LandingAI (but it’s paid). Does anyone know of a good open-source alternative? TIA! Sample: https://preview.redd.it/tajjcvjt5jyg1.png?width=959&format=png&auto=webp&s=8d04c5e946ab361bfef08021f79d106ab62a07cd https://preview.redd.it/lhpwnbty5jyg1.png?width=630&format=png&auto=webp&s=8dc0475a32b89ce7f8107f3940fd3eb6b0896a3a submitted by /u/No_Stretch_5809 [link] [comments]
View originalOpen Source Alternative to NotebookLM
For those of you who aren't familiar with SurfSense, SurfSense is an open-source alternative to NotebookLM for teams. It connects any LLM to your internal knowledge sources, then lets teams chat, comment, and collaborate in real time. Think of it as a team-first research workspace with citations, connectors, and agentic workflows. I’m looking for contributors. If you’re into AI agents, RAG, search, browser extensions, or open-source research tooling, would love your help. Current features Self-hostable (Docker) 25+ external connectors (search engines, Drive, Slack, Teams, Jira, Notion, GitHub, Discord, and more) Realtime Group Chats Video generation Editable presentation generation Deep agent architecture (planning + subagents + filesystem access) Supports 100+ LLMs and 6000+ embedding models (via OpenAI-compatible APIs + LiteLLM) 50+ file formats (including Docling/local parsing options) Podcast generation (multiple TTS providers) Cross-browser extension to save dynamic/authenticated web pages RBAC roles for teams Upcoming features Desktop & Mobile app submitted by /u/Uiqueblhats [link] [comments]
View originalOpen Source Alternative to NotebookLM
For those of you who aren't familiar with SurfSense, SurfSense is an open-source alternative to NotebookLM for teams. It connects any LLM to your internal knowledge sources, then lets teams chat, comment, and collaborate in real time. Think of it as a team-first research workspace with citations, connectors, and agentic workflows. I’m looking for contributors. If you’re into AI agents, RAG, search, browser extensions, or open-source research tooling, would love your help. Current features Self-hostable (Docker) 25+ external connectors (search engines, Drive, Slack, Teams, Jira, Notion, GitHub, Discord, and more) Realtime Group Chats Hybrid retrieval (semantic + full-text) with cited answers Deep agent architecture (planning + subagents + filesystem access) Supports 100+ LLMs and 6000+ embedding models (via OpenAI-compatible APIs + LiteLLM) 50+ file formats (including Docling/local parsing options) Podcast generation (multiple TTS providers) Cross-browser extension to save dynamic/authenticated web pages RBAC roles for teams Upcoming features Slide creation support Multilingual podcast support Video creation agent Desktop & Mobile app GitHub: https://github.com/MODSetter/SurfSense submitted by /u/Uiqueblhats [link] [comments]
View originalRepository Audit Available
Deep analysis of DS4SD/docling — architecture, costs, security, dependencies & more
Docling uses a tiered pricing model. Visit their website for current pricing details.
Key features include: Automated document parsing, Support for multiple file formats (PDF, DOCX, etc.), Customizable parsing rules, Data extraction and transformation capabilities, Integration with data analysis tools, User-friendly interface for non-technical users, Real-time collaboration features, Version control for documents.
Docling is commonly used for: File not found, GitHub Pages.
Docling integrates with: Google Drive, Microsoft OneDrive, Slack, Zapier, Trello, Asana, Jira, Tableau, Power BI, Notion.