Kling AI, tools for creating imaginative images and videos, based on state-of-art generative AI methods.
Kling AI is appreciated for its integration into creative workflows, offering tools that help automate content creation. While it is noted for its innovative features, there are complaints about the saturation of subscription-based models in the AI video tool market, which some users find financially burdensome. Pricing sentiment reflects general dissatisfaction with the high cost of AI tools, as many are seen as locked behind expensive paywalls. Overall, Kling AI holds a positive reputation for its functionality but faces competition from other models, with users actively comparing its performance against alternatives.
Mentions (30d)
10
2 this week
Reviews
0
Platforms
2
Sentiment
33%
11 positive
Kling AI is appreciated for its integration into creative workflows, offering tools that help automate content creation. While it is noted for its innovative features, there are complaints about the saturation of subscription-based models in the AI video tool market, which some users find financially burdensome. Pricing sentiment reflects general dissatisfaction with the high cost of AI tools, as many are seen as locked behind expensive paywalls. Overall, Kling AI holds a positive reputation for its functionality but faces competition from other models, with users actively comparing its performance against alternatives.
Features
Use Cases
Industry
information technology & services
Employees
120
OpenAI cofounder Andrej karpathy just joined anthropic and the talent war is officially over
this happened literally today ,andrej karpathy one of the most respected ai researchers alive nd the guy whose youtube lectures taught half the developers in this sub how neural networks work, just announced he is joining anthropic's pre training team. He's the 3rd senior openai figure to defect to anthropic in under two years. Jan leike left in may 2024, John schulman (co-founder) left in august 2024 and now karpathy. He is joining the pre training team under nick josef and building a new team focused on using claude to accelerate pre training research which means Anthropic is betting that claude can help make itself smarter, thats recursive self improvement with one of the most capable researchers in the world leading it. The musk trial verdict came in yesterday with the jury ruling in altman's favor, karpathy announces today voilaa . The timing is either coincidental or the most savage talent acquisition move in tech history. I hv been watching this trajectory while building my own workflows on claude ,every month the ecosystem around claude gets stronger. The connectors mean claude orchestrates professional creative tools natively, the api means platforms like magic hour and kling can plug video generation capabilities into claude powered pipelines, the finance templates mean entire industry workflows run through claude and now the guy who built tesla's self driving stack is making the pre training better. Polymarket gives anthropic 67.5% chance of going public before openai and i too think its ipo will be more successfull than openai what's everyone's read on what karpathy specifically brings to claude's pre training? submitted by /u/Healthy-Challenge911 [link] [comments]
View originalHow to Create Viral Stadium Fan Cam Storyboards with GPT Image 2? Prompt Below!
This was one of the most realistic storyboard styles I’ve generated recently with GPT Image 2. The goal was to recreate the feeling of a real televised football broadcast mixed with cinematic commercial production — authentic crowd emotion, live camera imperfections, shallow telephoto depth of field, broadcast overlays, and natural sponsor integration. What makes this style work so well: realistic stadium crowd energy sports TV broadcast aesthetics cinematic advertisement framing emotional candid reactions ultra realistic lighting and skin texture natural product placement that feels like a real sponsorship commercial The storyboard panels can later be animated inside Seedance, Kling, Veo, or similar AI video tools to create a full fan-cam style commercial sequence. Tools used: GPT Image 2 → storyboard generation Seedance / Kling → animation & motion Prompt: "Hyper-realistic cinematic storyboard sheet for a 15-second sports broadcast commercial, beautiful stylish woman with natural blonde wavy hair wearing a cream sleeveless turtleneck knit top and pearl earrings sitting naturally among real football audience inside a packed stadium, yellow and blue fans cheering in background, realistic live sports broadcast camera perspective, authentic stadium lighting, soft cinematic blur, realistic skin texture and facial details, natural candid expressions, she watches the football match intensely while holding a blue Japanese premium beverage can naturally in her hand, realistic crowd interaction, broadcast scoreboard overlays, sports network watermark, smooth TV-commercial camera shots, ultra realistic photography style, documentary sports coverage aesthetic, realistic depth of field, live match atmosphere, product integrated naturally like real sponsorship footage, final shot close-up where she smiles and blows a flying kiss toward the camera, emotional crowd energy, cinematic realism, premium advertisement production storyboard layout, professional shot sequence panels, real broadcast feeling, highly detailed realistic storyboard sheet --ar 16:9" Would love to see more people experimenting with this format. submitted by /u/DataGirlTraining [link] [comments]
View originalAll-in-one AI platforms are quietly taking over end-to-end production. Thoughts?
Posters, trailers, full episode lists, even a Cannes slot lined up this year. Watched on Higgsfield 1-2 of them and was impressed, while some still looked a little bit like slop. The interesting part isn't the AI-Netflix angle though. It's that one platform did the whole thing end to end: character consistency, generation, multi-shot sequencing, audio, distribution. No 5 different tools, no Premiere stitching 47 clips together. Meanwhile Kling, Runway, Veo are all racing to perfect a single model. Higgsfield is quietly building the entire production stack under one roof. Is vertical integration the actual moat in AI video, or are single-model specialists still going to win on quality? Curious where people think this is heading. submitted by /u/BrainTool117 [link] [comments]
View originalAt what point do we stop calling ai generated video slop
I think we passed the line and most people haven't noticed two years ago slop was generous and a year ago sora dropped and quality jumped but everything still had that uncanny wobble where hands melted slop was still accurate. Have you seen what's coming out now though? animated studios are reportedly considering switching to ai generated animation because it drops production costs from $500k to under $100k. Netflix just acquired an ai content company, disney confirmed ai will play a significant role in content production going forward. these aren't creators experimenting, these are the companies that define what quality means for a billion people. On the commercial content side it's already happened quietly. I produce short form video for brands using a mix of ai tools, kling for generation, magic hour for face swaps, capcut for touch ups. sent a client 20 social videos last week and she said "love these" ,they dont care if it ai ,they just want outcome fast. the trick that changed everything is that nobody's using raw text to video as the final output anymore. you layer capabilities and the combined output looks fundamentally different from type a prompt and pray i think "slop" is doing two things right now ,one is legitimate quality criticism for genuinely bad output which still exists. The other is a defense mechanism because admitting the output is commercially viable means admitting something uncomfortable about what human creators are competing against. If a viewer can't tell so the algorithm doesn't care and the commercial results are identical, is it still slop? submitted by /u/Tough_Commercial_103 [link] [comments]
View originalUsed image-gen-2 to generate stylized keyframes for an AI video. Combined with Kling 3 to make this continuous reel of me across different drawing styles.
submitted by /u/phoneixAdi [link] [comments]
View originalI built a hands-free voice AI that sends emails mid-conversation — and that's just one feature. Here's everything AskSary can do.
https://reddit.com/link/1symbsj/video/k2no3zfgq1yg1/player Been building AskSary solo for a while. Just shipped hands-free voice email - you're mid-conversation with an AI and you say "send an email to [john@example.com](mailto:john@example.com) subject X body Y" and it pre-fills the Gmail modal automatically. One tap sends. Powered by OpenAI Realtime API, works in 22 languages. But that's just the latest feature. Here's the full picture: Every major model in one place GPT-5-Nano, GPT-5.2, GPT-5.2 Pro, O1 Reasoning, Claude Sonnet 4.6, Grok 4, Gemini 2.5 Flash, Gemini 3.1 Pro, Gemini Ultra, DeepSeek V3, DeepSeek R1 - with smart auto-routing or manual override. Pro-Active Personalisation On every login the AI reads your previous conversations and sends the first message itself - asking if you want to continue or start fresh. Before you type a single word. Persistent Cross-Model Memory Start a conversation with Claude on your phone, open your laptop, switch to GPT-5.2 - it already knows what you discussed. No copy-pasting, no summaries. Just works. Knowledge Base - RAG Upload docs up to 500MB per file, unlimited uploads, chat with them across any model via OpenAI Vector Store. Your files stay in context forever. Integrations Google Drive, Gmail, Google Calendar, Notion - access files, get email and calendar summaries, use them in chat or push them to your Knowledge Base. Generation Tools Image Gen - GPT-Image-1 and Nano Banana Pro Flux Image Editor - full editing suite with visual history Video Studio - Luma Dream, Veo 3.1, Kling 1.6 / 2.6 / 3, up to 10 second AI videos with audio Music Studio - 30 second tracks with custom or AI lyrics via ElevenLabs, visualizer built into chat 3D Model Studio - Meshy with STL export (deploying soon) Video Analysis - upload up to 500MB or paste a YouTube link Developer and Builder Tools Vision to Code - screenshot any UI, get live editable code Web Architect - build full web apps from a single prompt Game Engine - build and prototype games with AI Code Lab - split screen live coding with SQL Architect, Bug Buster, Git Guru, Regex Generator, Test Genie and more Tavily web search across all models Voice and Audio Real-time 2-way voice chat - 8 voices, near-zero latency WebRTC Podcast Mode - two AI voices, switchable, near-zero latency, downloadable as MP3 Voiceover Studio, Voice Notes, Voice Tuner Productivity and Content Slides, Docs and File Tools Pro Writer and Content Library Social Tools - Hook Generator, Video Script, Hashtag Creator, Idea Spark Business Suite - Pitch Deck Builder, Deep Analytics, Legal Eagle, Maths Solver Daily Briefing and Market Watch CV Creator, Email Polisher, Cover Letter Builder, TL;DR Bot Share conversations or snippets with anyone Platform Extras 30+ live interactive wallpapers and themes Custom Agents and Personas Folder organisation and Smart Search across chat history Media Manager Gallery - all your generated content in one place Fully customisable UI in 26 languages with full RTL support The Stack Frontend: Next.js, Capacitor (iOS + Android), Vanilla JS / React Backend: Vercel serverless, Firebase / Firestore, Firebase Admin SDK AI: OpenAI, Anthropic, Google, xAI, DeepSeek Generation: Luma AI, Kling via Replicate, Veo via Replicate, ElevenLabs, Flux via Replicate, Meshy Integrations: Google Drive, Notion, Tavily, OpenAI Vector Store, Stripe, CloudConvert, Sentry Rendering: Mermaid, MathJax Platforms: Web, iOS, Android, Apple Vision Pro What you get free just for creating an account (1,000 credits/month, rolling): Unlimited chat on GPT-5 Nano, Gemini Flash and DeepSeek V3 - no daily limits, zero credit charge 25 image generations via GPT-Image-1 and Nano Banana Pro - 40 credits each 8 image edits via Flux Studio - 80 credits each 2 song generations via ElevenLabs - 350 credits each 2 video generations via Luma Dream and Kling - 350 credits each ~70 messages on Claude Sonnet 4.6, GPT-5.2, Grok 4, Gemini 3.1 Pro and DeepSeek R1 - 15 credits each No credit card required. Built entirely solo. No CS degree, no team, no funding. Started because I asked an AI to build me a chatbot and it failed - so I built my own. Accepted to LEAP 2026 in Saudi Arabia along the way. Happy to answer anything about the build. asksary.com submitted by /u/Beneficial-Cow-7408 [link] [comments]
View originalI built a solo AI platform from Bahrain with no funding, no team and no ad spend - here's what's inside it after 4 months
https://reddit.com/link/1sxotqx/video/xlaqd9i8guxg1/player I'm a self-taught developer, 39 years old, based in Bahrain. Four months ago I started building AskSary - a multi-model AI platform with a persistent memory layer that sits above all the models. The core idea: the model is not the identity. Most AI tools lose your context the moment you switch models. I built the layer that remembers you across all of them. Here's what's shipped so far: Models & Routing Every major model in one place - GPT-5.2, Claude Sonnet 4.6, Grok 4, Gemini 3.1 Pro, DeepSeek R1, O1 Reasoning, Gemini Ultra and more - with smart auto-routing or manual override. Memory & Context Persistent cross-model memory. Start with Claude on your phone, switch to GPT on your laptop - it already knows what you discussed. Proactive personalisation that messages you first on login before you've typed a word. Integrations Google Drive and Notion - connect once, pull files and pages directly into chat or your RAG Knowledge Base. Unlimited uploads up to 500MB per file via OpenAI Vector Store. Video Analysis - Gemini native video understanding for YouTube URL analysis (no download required, processed natively) and direct file upload up to 500MB. Full breakdown of visuals, audio, dialogue, editing style and key moments. Generation Image generation and editing, video studio across Luma, Veo and Kling, music generation via ElevenLabs, video analysis via upload or YouTube URL. Builder Tools Vision to Code, Web Architect, Game Engine, Code Lab with SQL Architect, Bug Buster, Git Guru and more. Tavily web search across all models. Voice & Audio Real-time 2-way voice chat at near-zero latency, AI podcast mode downloadable as MP3, Voiceover, Voice Notes, Voice Tuner. Platform Custom agents, 30+ live interactive themes, smart search, media gallery, folder organisation, full RTL support across 26 languages, iOS and Android apps, Apple Vision Pro. Where it is now 129 countries. Currently at 40 new signups a day. 1080 Signup's so far after 4 weeks or so. MRR just started. Zero ad spend. All of it built solo, one feature at a time, on a balcony in Bahrain. The Stack: Frontend - Next.js, Capacitor (iOS and Android) and Vanilla JS / React Backend - Vercel serverless functions, Firebase / Firestore (database + auth) and Firebase Admin SDK AI Models - OpenAI (GPT, GPT-Image-1), Anthropic (Claude), Google (Gemini), xAI (Grok), DeepSeek Generation APIs - Luma AI (video), Kling via Replicate (video), Veo via Replicate (video), ElevenLabs (music), Flux via Replicate (image editing), Meshy (3D — coming soon) Integrations - Google Drive (OAuth 2.0), Notion (OAuth 2.0), Tavily (web search), OpenAI Vector Store (RAG), Stripe (payments), CloudConvert (document conversion), Sentry (error tracking), Formidable (file handling) Rendering - Mermaid (flow charts) and MathJax Platforms - Web, iOS, Android, Apple Vision Pro (visionOS) Languages - 26 UI languages with full RTL support asksary.com Happy to answer questions on any part of the build - stack, architecture, API cost management, anything. submitted by /u/Beneficial-Cow-7408 [link] [comments]
View originaltwo years ago this sub had 12k members asking "is claude better than chatgpt for writing" and now the company is worth a trillion dollars
I joined this sub when claude 3 opus dropped and it was a completely different world in here, small group of people who'd stumbled onto something that felt genuinely different from chatgpt and couldn't shut up about it. The posts were stuff like "did anyone else notice claude actually admits when it doesn't know something" and "i think anthropic might be onto something here" loll yesterday google committed $40 billion, amazon committed $25 billion the same week and revenue went from $1 billion to 30 billion in fifteen months which is apparently the fastest growth in american tech history. Secondary market says a trillion dollars and eight of the fortune 10 are customers, the tool we were geeking out about in a tiny subreddit is now arguably the most important ai product in the world and i'm still processing that I'm not trying to brag about being early because being early got me exactly nothing except a tool i love using and talk about too much at dinner parties. I'm writing this because i think this community deserves a moment and this sub was one of the first places where people figured out what claude could actually do in practice, people here were sharing creative pipelines, coding workflows and research systems openly before the enterprise market caught on. My own story is tiny compared to some of yours but it means everything to me, i do video content production and when i found this sub someone here posted about using claude to redesign their creative workflow and i tried the same thing and ended up in a conversation where claude basically told me my problem wasn't my tools it was my architecture,it helped me audit everything i was paying for separately across runway, topaz, heygen, kling, a headshot tool i used twiceand consolidate most of it into magichour, then connect the pipeline to remotion for automated editing. That single conversation saved me roughly $120 a month and cut my production time by 40%. I went from billing $3k a month doing everything manually to $14k a month as a one person studio and claude was involved in almost every step of that growth But honestly my story isn't the pointm hundreds of people in this sub have stories like this and collectively those stories are part of why anthropic is where it is today, the use cases now generating $30 billion in revenue started as experiments shared in communities exactly like this one. The part of the news i care about most as a daily user isn't the valuation it's the 10 gigawatts of new compute capacity. Every single person in this sub has hit rate limits midthought and wanted to throw something, if $73 billion in combined investment means i stop seeing "you've reached your limit" during a client deadline then the entire deal is justified and i will personally write dario a thank you letter haha I m trying not to get ahead of myself about what this means long term because historically when startups become megacorps the product changes and not always for the better but right now in this moment i just feel grateful i found this tool and this community when i did what's your claude story, curious when you joined and what changed for you because i think today's a good day to share those submitted by /u/Jealous-Drawer8972 [link] [comments]
View originalBuilt a multi-model AI platform with real-time WebRTC voice, persistent cross-model memory, and a full generation suite - free account gets 1 min voice/month
https://reddit.com/link/1sutga7/video/ktd3pxcam7xg1/player I've been building AskSary for the past few months - a multi-model AI platform - and just shipped real-time 2-way voice chat powered by OpenAI's WebRTC API. The visualization reacts to your voice in real time: 180 radial frequency bars orbit a glowing orb, 280 particles drift across a full-screen canvas, aurora sweeps and ripple waves emit on voice peaks, and the whole thing color-shifts from cool blue (listening) to warm violet (speaking). Near-zero latency, 8 voice options. Anyone with a free account at asksary.com gets 1 minute of real-time voice every month to try it out - no credit card needed. The platform also has a lot more built around it if you're curious: Models - GPT-5-Nano, GPT-5.2, GPT-5.2 Pro, O1 Reasoning, Claude Sonnet 4.6, Gemini 2.5 Flash, Gemini 3.1 Pro, Gemini Ultra, Grok 4, DeepSeek V3, DeepSeek R1 - with smart auto-routing or manual selection Memory and context - Persistent cross-model memory. Start on mobile with Claude, switch to GPT-5.2 on desktop and it already knows the conversation. Plus proactive personalization: on every login the chatbot reads your previous sessions and opens with a message asking if you want to continue - before you type anything. RAG - Upload docs up to 500 MB each, unlimited uploads, chat with them across any model via OpenAI Vector Store Generation - GPT-Image-1, Nano Banana Pro + Flux editor with visual history, Video Studio (Luma, Veo 3.1, Kling), Music Studio with ElevenLabs and in-chat visualizer, 3D Model Studio with STL export (coming soon) Builder tools - Vision to Code, Web Architect, Game Engine, Code Lab with SQL Architect / Bug Buster / Git Guru and more Voice and audio - Real-time chat, Podcast Mode (two AI voices, downloadable MP3), Voiceover, Voice Notes, Voice Tuner Productivity - Slides, Docs, Pro Writer, Social tools, Business Suite, CV Creator, Daily Briefing, Market Watch Platform - 30+ live wallpapers, Custom Agents, Folder org, Smart search, Media Gallery, 26 languages + RTL, fully customizable UI Happy to answer questions about the WebRTC implementation or anything else. Would love to hear what you think of the voice visualization. submitted by /u/Beneficial-Cow-7408 [link] [comments]
View originalI built real-time 2-way voice chat into my AI platform using OpenAI WebRTC - free to try (1 min/month)
https://reddit.com/link/1sut0jp/video/f7wqfo9zi7xg1/player I've been building AskSary for the past few months - a multi-model AI platform - and just shipped real-time 2-way voice chat powered by OpenAI's WebRTC API. The visualization reacts to your voice in real time: 180 radial frequency bars orbit a glowing orb, 280 particles drift across a full-screen canvas, aurora sweeps and ripple waves emit on voice peaks, and the whole thing color-shifts from cool blue (listening) to warm violet (speaking). Near-zero latency, 8 voice options. Anyone with a free account at asksary.com gets 1 minute of real-time voice every month to try it out - no credit card needed. The platform also has a lot more built around it if you're curious: Models - GPT-5-Nano, GPT-5.2, GPT-5.2 Pro, O1 Reasoning, Claude Sonnet 4.6, Gemini 2.5 Flash, Gemini 3.1 Pro, Gemini Ultra, Grok 4, DeepSeek V3, DeepSeek R1 - with smart auto-routing or manual selection Memory and context - Persistent cross-model memory. Start on mobile with Claude, switch to GPT-5.2 on desktop and it already knows the conversation. Plus proactive personalization: on every login the chatbot reads your previous sessions and opens with a message asking if you want to continue - before you type anything. RAG - Upload docs up to 500 MB each, unlimited uploads, chat with them across any model via OpenAI Vector Store Generation - GPT-Image-1, Nano Banana Pro + Flux editor with visual history, Video Studio (Luma, Veo 3.1, Kling), Music Studio with ElevenLabs and in-chat visualizer, 3D Model Studio with STL export (coming soon) Builder tools - Vision to Code, Web Architect, Game Engine, Code Lab with SQL Architect / Bug Buster / Git Guru and more Voice and audio - Real-time chat, Podcast Mode (two AI voices, downloadable MP3), Voiceover, Voice Notes, Voice Tuner Productivity - Slides, Docs, Pro Writer, Social tools, Business Suite, CV Creator, Daily Briefing, Market Watch Platform - 30+ live wallpapers, Custom Agents, Folder org, Smart search, Media Gallery, 26 languages + RTL, fully customizable UI Happy to answer questions about the WebRTC implementation or anything else. Would love to hear what you think of the voice visualization. Free to try at asksary.com submitted by /u/Beneficial-Cow-7408 [link] [comments]
View original3 months ago I couldn't write Hello World. Today I built a world-first native visionOS AI platform - GPT-5 & GPT-Image-1 living inside a full 360° spatial environment with 30 live wallpapers. Video inside.
https://reddit.com/link/1srzytr/video/8b8pfobgtlwg1/player I want to show you something nobody has ever seen before. Three months ago I had zero coding knowledge. I couldn't write a single line of code. In the time since, I taught myself GitHub, Visual Studio, Xcode, Android Studio, Firebase, Firestore, Vercel, Sentry - and built a fully functional AI platform live across web, iOS, Android, Mac desktop, and Apple Vision Pro. Today I converted it into something completely new. AskSary is now a world-first fully spatial AI experience — built natively for visionOS. Not an iPad app running in compatibility mode. A ground-up, native spatial build where the entire interface is a live immersive 360° wallpaper. You don't open the app. You step inside it. In the video you'll see GPT-5 greeting you from inside the spatial environment, then a live switch to GPT-Image-1 for real-time image generation — all happening inside a 360° world with floating UI, particle effects, and a starfield you're literally standing in. 30 live interactive wallpapers and themes. Each one is a different world to inhabit while you work. Beyond the spatial shell, the platform includes: Image generation via GPT-Image-1 and Nano Banana Pro Flux Image Editor with visual history Video Studio - Luma Dream, Veo 3.1, Kling 1.6, 2.6 and 3, up to 10 second AI videos with audio Music Studio - 30 second tracks via ElevenLabs 3D Model Studio with STL export (coming soon) Vision to Code - screenshot any UI, get live editable code Web Architect, Game Engine, Code Lab Real-time 2-way voice chat, Podcast Mode, Voiceover Full productivity suite, business tools, social tools, 26 languages 18 API integrations total Persistent cross-model memory, custom agents and personas I'm a self-taught developer. No bootcamp. No CS degree. No prior knowledge. Just three months of figuring it out one problem at a time. I wanted to build something that made people say wow. Something nobody had done. I think this might be it. Would love to hear what you think. asksary.com This version of the Apple Vision Pro variant is not currently available on the App Store but if people are genuinely interested I'll release it today. submitted by /u/Beneficial-Cow-7408 [link] [comments]
View originalClaude Design just launched and Figma dropped 4.26% in a single day, we are witnessing history in real time
I genuinely cannot believe what I'm watching unfold today Anthropic dropped Claude Design this morning , a tool that lets anyone describe what they want and get back a full website, landing page, or presentation. No design skills needed and No Figma subscription. Just... talk to it And the market reacted instantly. Figma stock is down $0.86 (4.26%) today alone. Adobe, Wix, GoDaddy all bled too. Anthropic's own CPO literally resigned from Figma's board three days ago. The writing was on the wall and now it's on the landing page Claude just generated for you. What's making my brain short circuit is the full pipeline this unlocks right now, today. You describe your UI in Claude Design, animate it in Magic Hour, turn it into a motion video with Kling, and voice it over in any language with ElevenLabs. That's an entire creative agency workflow built from prompts by one person in an afternoon. I'm trying to stay grounded here because Figma isn't going anywhere overnight , they own something like 80-90% of the UI/UX market and have years of professional tooling that pros genuinely love but the entry point to design just got demolished. The question clients are going to start asking is "wait, why can't we just describe this to Claude?" and that question is going to be really hard to answer. I've been following AI closely for a while now and this is the first announcement where I felt something shift. Slightly terrified and extremely excited, completely unable to go back to sleep. How is everyone else feeling right now? submitted by /u/Future_Language76833 [link] [comments]
View originalI built an open-source 6-agent pipeline that generates ready-to-post TikToks from a single command
Got tired of the $30/mo faceless video tools that produce the same generic slop everyone else is posting. So I built my own. Claude Auto-Tok is a fully automated TikTok content factory that runs 6 specialized AI agents in sequence: Research agent — scrapes trending content via ScrapeCreators, scores hooks, checks trend saturation Creative agent — generates multiple hook variations using proven formulas (contradictions, knowledge gaps, bold claims), writes the full script with overlay text Audio agent — ElevenLabs TTS with word-level timing for synced subtitles Visual agent — plans scenes, pulls B-roll from Pexels or generates clips via Kling AI, builds thumbnails Render agent — compiles final 9:16 video in Remotion with 6 different templates (split reveal, terminal, cinematic text, card stacks, zoom focus, rapid cuts) QA agent — scores the video on a 20-point rubric across hook effectiveness, completion rate, thumbnail, and SEO. Triggers up to 2 revision cycles if it doesn't pass One command. ~8 minutes. Ready-to-post video with caption, hashtags, and thumbnail. Cost per video is around $0.05 without AI-generated clips. Supports cron scheduling for 2 videos/day and has TikTok Direct Post API integration for hands-free publishing. Built with TypeScript, Claude via OpenRouter for creative, Gemini 2.5 for research/review, Remotion for rendering. MIT licensed: https://github.com/nullxnothing/claude-auto-tok Would appreciate feedback from anyone running faceless content or automating short-form video. submitted by /u/Pretty_Spell_9967 [link] [comments]
View originalis AI making us better thinkers or just faster workers
I've been using claude daily for about 8 months now and something has been nagging at me that I want to talk about. when I first started using it I was genuinely thinking more, I'd use claude to challenge my assumptions and explore angles I hadn't considered and stress test ideas before committing to them, it felt like having a thinking partner that made my actual reasoning sharper. lately though I've noticed a shift in myself that I don't love, I've started going to claude brfore even I think instead of after, like I'll get a new project at work and instead of sitting with it for a while and forming my own perspective first I'll immediately open claude and say "here's the situation what should I consider" and whatever it gives me becomes the starting framework I work within. The difference is subtle but it matters, in the first version I'm using AI to refine thinking I've already done, in the second version I'm outsourcing the initial thinking entirely and just editing what comes back and those are very different cognitive processes even though the output might look similar. I noticed it most clearly last week when I was doing research for a client project, I had claude pull together an analysis and I was about to send it and then I stopped and asked myself do I actually agree with this or am I just sending it because it sounds smart and I didn't have to think hard to produce it and I genuinely couldn't tell which one it was and that scared me a little. I think there's a version of using claude that makes you sharper and a version that makes you lazier and the line between them is just whether you're thinking first and using AI to go further or skipping the thinking entirely because the AI can produce something passable without it. I do a lot of creative work too, video stuff for clients where I use midjourney for concepts and kling, magic hour and runway for motion references, and I see the same pattern there, when I have a clear creative vision and use the tools to execute it faster the work is great, when I open the tools with no vision and just see what comes out the work is mediocre even though it looks polished. curious if anyone else has caught themselves making this shift and whether you've found a way to stay on the "better thinker" side instead of sliding into the "faster worker" side because I think it's one of the most important questions about how we use these tools and nobody's really talking about it submitted by /u/Major_Cable_8079 [link] [comments]
View originalSora is dead. What's everyone actually using now?
So OpenAI finally pulled the plug on Sora. Can't say I'm shocked honestly. The writing was on the wall for a while with how they handled access and the whole vibe around it felt off. Anyway, doesn't really matter now. Point is a lot of people (myself included) were holding out hoping Sora would be "the one" and now we gotta figure out what actually works. I've been testing pretty much everything over the past few days so figured I'd share what I've landed on(Actually hoping if you guys could guide me better ) For text-to-video (cinematic/realistic stuff): Kling 2.0 looks genuinely impressive for the price Motion quality is wild. Runway Gen-3 still has the edge on pure quality but you'll burn through credits insanely fast. Veo 2 from Google is worth watching but access is still weird For image-to-video / animating stills: Luma Dream Machine works well for quick generations. Magic Hour has been solid for me too, especially for product shots and turning AI images into clips. Not as flashy as Runway but the credits stretch way further which matters if you're actually producing volume. For face swap / lip sync: Honestly here i need your help .For me HeyGen looks fine but i think there might be some better alternative out there For stylized / video-to-video: Kaiber still works. Pika is fun for experimental things(not a fan of their ui) and Kling handles this decent too. Stuff I gave up on: Pika for anything serious (too inconsistent), waiting for any OpenAI video product at this point Curious what everyone else has migrated to. Feels like the landscape just shifted again and I'm probably missing some newer tools. submitted by /u/Healthy-Challenge911 [link] [comments]
View originalKling AI uses a tiered pricing model. Visit their website for current pricing details.
Key features include: When model_name is kling-v1-6 and mode is std: 2 units (equivalent to $0.28), When model_name is kling-v1-6 and mode is pro: 3.5 units (equivalent to $0.49).
Kling AI is commonly used for: Creating promotional videos for businesses, Generating animated content for social media, Developing educational videos with engaging visuals, Producing artistic video interpretations of music tracks, Crafting personalized video messages for marketing campaigns, Designing visual content for virtual events.
Kling AI integrates with: Adobe Creative Cloud, Slack, Trello, Zapier, Notion, Figma, Canva, Google Drive.
Based on 33 social mentions analyzed, 33% of sentiment is positive, 64% neutral, and 3% negative.
Brett Adcock
CEO at Figure AI
1 mention