Enhance customer service with Zendesk AI agents: automate tasks, customize bots, and gain insights to boost performance. Start a free trial today.
User feedback on "Ultimate" highlights its strong utility in job management and automation, with excitement stemming from its practical applications in streamlining these tasks. However, some users would appreciate improvements in user interface and customization options to better fit individual needs. While pricing seems to be a point of contention, with some expressing concerns over its affordability, the overall sentiment labels it as a valuable investment for its effectiveness. The software enjoys a solid reputation for enhancing productivity, particularly among tech-savvy users.
Mentions (30d)
27
3 this week
Reviews
0
Platforms
4
Sentiment
21%
16 positive
User feedback on "Ultimate" highlights its strong utility in job management and automation, with excitement stemming from its practical applications in streamlining these tasks. However, some users would appreciate improvements in user interface and customization options to better fit individual needs. While pricing seems to be a point of contention, with some expressing concerns over its affordability, the overall sentiment labels it as a valuable investment for its effectiveness. The software enjoys a solid reputation for enhancing productivity, particularly among tech-savvy users.
Features
Use Cases
Industry
information technology & services
Employees
140
Funding Stage
Merger / Acquisition
Total Funding
$26.2M
Sen. Sheldon Whitehouse (D-RI) lays out the connections between Trump, Russia, and Epstein (transcript included)
**NOTE:** This transcript now appears in [the Senate section of the official *Congessional Record* of March 5, 2026, pages 18 - 23,](https://www.congress.gov/119/crec/2026/03/05/172/42/CREC-2026-03-05-senate.pdf) with Sen. Whitehouse's own list of sources appended. ----- The following is the YouTube transcript which I cleaned up, checked for errors, lightly edited for readability, verified spelling of proper names via Wikipedia, and added links to any quotes that I checked myself. (EDITED to add links to individuals mentioned, correct placement of quotes, and insert links to original articles where I could find them online) I found myself doing it anyway just for me, to keep track of who's who, and then I realized I might as well do it for you as well. This is an unparalleled speech: while the substance of it might be available elsewhere and I've just missed it, Sen. Whitehouse has answered a lot of questions in my mind about not just the links between Trump, Russia, and Epstein -- and William Barr as one of many links -- but also about the recording equipment and blackmail angle that is present in so many survivor accounts and so noticeably absent everywhere else. It's truly worth listening to, but if you can't sit still that long, here's the transcript. ----- Thank you, Madam President. It was the spring of 2019. Public and media interest in special counsel [Robert Mueller's report into Russia's election interference operation](https://en.wikipedia.org/wiki/Mueller_special_counsel_investigation) reached a fever pitch. There had been a steady drip, drip, drip of reporting on the Trump team's cozy and peculiar relationship with Russia. Since his surprise election victory in 2016, ahead of the Mueller report's release, Trump's Attorney General, Bill Barr, [issued a letter to Congress purporting to summarize the report's findings.](https://en.wikipedia.org/wiki/Barr_letter) The letter declared that Russia and the Trump campaign did not collude to steal the election. The press, ravenous for any news of the long-anticipated Mueller report's conclusion, largely accepted [Attorney General Barr's](https://en.wikipedia.org/wiki/William_Barr) narrow, carefully worded conclusion and, not yet having access to the full report, blasted the attorney general's summary around the world. Trump himself declared, all caps, NO COLLUSION. He said he had been cleared of the Russia "hoax," a term he reserves only to describe things that are true, like climate change. Frustrated, Mueller wrote to Barr that the attorney general's letter did not fully capture the context, nature, and substance of the investigation. But by the time [the dense, voluminous Mueller report](https://en.wikipedia.org/wiki/Mueller_report) was issued the month after Barr's letter, its message had been obscured. The Mueller report actually concluded that the Trump campaign knew of and welcomed Russian interference and expected to benefit from it. That conclusion was later echoed and reinforced by [an investigation led by then-chairman Marco Rubio's Senate Intelligence Committee,](https://en.wikipedia.org/wiki/Mueller_report#Senate_Intelligence_Committee) a bipartisan report. But Barr's scheme had largely worked. Many in the media and in the Democratic Party seemed to internalize that the Russia speculation had perhaps gotten out of hand, and that perhaps we had been wrong to believe there was a troubling connection between Trump and Russia after all. But were we? Let's take a look at a sampling of what Trump has done for Russia just lately, and usually at the expense of American interests. There are many, but here's a top 10. **One,** after Trump and Vice President Vance theatrically chastised the heroic Ukrainian President Zelenskyy in front of TV cameras in the Oval Office last year, Trump paused our weapons shipments to Ukraine. **Two,** in July, during the worst Russian bombing campaign of the war until that point, Trump paused an already funded weapons shipment for Ukraine, including the Patriot interceptors that protect civilians from Putin's savage attacks. **Three,** that same month, Trump's Treasury Department stopped imposing new sanctions and closing sanctions loopholes, effectively allowing dummy corporations to send funds, chips, and military equipment to Russia. **Four,** leaked phone calls show that White House envoy [Steve Witkoff](https://en.wikipedia.org/wiki/Steve_Witkoff) and Putin envoy [Kirill Dmitriev](https://en.wikipedia.org/wiki/Kirill_Dmitriev) have worked together closely behind the scenes on a peace deal favorable to Russia. **Five,** last summer, Trump rolled out the presidential red carpet for the Russian dictator on American soil. with a summit in Alaska that yielded unsurprisingly no gains toward ending the war in Ukraine. **Six,** Trump's vice president traveled to the Munich Security Conference last year to parrot Russia's anti-western talking points pushed by right-wing groups that Puti
View originalDiscourse regimes as the unit of alignment behavior: a hypothesis
I've been working on a hypothesis about how alignment behavior in LLMs may be organized at the level of latent discourse regimes rather than output-level filtering. Below is a sketch of the conceptual framing. I have preliminary experimental results testing aspects of this hypothesis on open-weight models, which I'll publish separately — this post is focused on the conceptual side, and I'm interested in feedback on whether the framing tracks something real and where it's most vulnerable. Modern large language models may not primarily regulate behavior through isolated refusals, local token suppression, or shallow instruction following. Instead, they appear capable of entering internally organized discourse-level regimes: distributed latent states that shape how the model reasons, frames conclusions, allocates caution, tolerates asymmetry, performs neutrality, and structures epistemic authority. These regimes do not behave like simple lexical priming effects. Evidence suggests that they persist across neutral conversational turns, survive arbitrary neutral relabeling, systematically alter downstream reasoning style, concentrate in late-layer representation geometry, and only partially depend on explicit alignment vocabulary. The strongest effects appear not from safety keywords themselves, but from higher-order rhetorical topology: pressure cadence, procedural framing, asymmetry structure, institutional tone, and discourse-level authority signals. This suggests that prompting is not merely instruction transmission. It may function as state induction. Under this view, many apparently separate phenomena in aligned LLMs - caution drift, procedural overreach, sycophancy, disclaimer inflation, neutrality performance, refusal persistence, jailbreak sensitivity, and style locking - may be manifestations of transitions between latent discourse-policy manifolds. In this picture, alignment is no longer well-described as a modular wrapper placed on top of an otherwise independent intelligence system. Instead, alignment may reshape the topology of the model's representational space itself, globally reorganizing discourse behavior rather than only filtering outputs. This would explain why alignment effects often appear entangled with reasoning style, directness, specificity, decisiveness, and institutional tone. The model is not merely "prevented" from saying certain things; its generative dynamics may already be reorganized around different discourse attractors. If true, this changes the effective unit of analysis for language models. The relevant object is no longer just the token, the instruction, the refusal, or the output distribution. The relevant object becomes the discourse regime itself: a temporary but structured representational configuration governing epistemic posture, rhetorical organization, procedural behavior, and judgment style across time. This reframes prompt engineering as latent-state induction rather than keyword optimization. It reframes jailbreaks as transitions between attractor regimes rather than simple filter bypasses. And it reframes alignment as geometry engineering rather than purely policy engineering. The implication is not that language models possess beliefs, intentions, or consciousness. Rather, large sequence learners may naturally develop metastable high-level representational modes that functionally resemble cognitive framing states: transient global configurations that persist, influence future reasoning, and organize behavior across otherwise unrelated tasks. If this interpretation is correct, then the central scientific challenge of alignment shifts fundamentally. The problem is no longer merely: "Which outputs should the model refuse?" but: "Which latent discourse regimes exist inside the model, how are they induced, how stable are they, how do they interact, and how do they reshape reasoning itself?" In that sense, alignment may ultimately be less about constraining outputs and more about shaping the geometry of cognition-like generative states inside large language models. I'd be interested in feedback on three things in particular: whether this framing tracks something you've observed empirically, what related work I should be aware of (I'm familiar with representation engineering, refusal directions, and the Anthropic dictionary learning line — looking for less obvious connections), and where you think the hypothesis is most vulnerable to falsification. I'd be interested in feedback on three things in particular: whether this framing tracks something you've observed empirically, where you think the hypothesis is most vulnerable to falsification, and — directly — whether anyone is aware of existing work that develops a similar framing, treating alignment behavior as state induction into discourse-level latent regimes rather than as output-level filtering. I'm familiar with representation engineering (Zou et al.), refusal direction work, and the Anthropic dictiona
View originalRecent Trial Question and Idea
The recent OpenAI court case got me to thinking what would the outcome have been if AI was used to present both sides of the case and determine the outcome? In fact, would AI be an upgrade to our current trial process in general. Instead of having thousands of lawyers at all levels of ability, why not let the best lawyers train the model and have the model determine the case outcome. It would be faster, more accessible, and more efficient than our current court system. In fact, it could be used to determine if a case is even worth presenting. Sure, there would be exceptions and appeals. Those could be handled the traditional way, and ultimately get incorporated into the model. What are the issues with this idea? submitted by /u/curiosity_2020 [link] [comments]
View originalMap making and Claude
Does anyone have any suggestions for handling map making when working with Claude and developed MCP servers? I'm running into the issue of map tiles served by CDN being blocked by Claude's CSP. Sometimes a random Leaflet base map will slip through and render correctly, same with Google Maps, but i cannot consistently get Claude to create mapping products when fetching data from an MCP server and attempting to render on a map. The points will draw but the base map will be blank because it is blocked by Claude's CSP. I have added the map making tool to the MCP server I am developing but it has had no impact because ultimately Claude still calls the CDN for the map tiles and is blocked. Trying to create something like a base map on server hasn't worked either because the resulting SVG will be too large for the context window. Looking for suggestions on approved CDNs for map tiles that wont make Claude throw up. Thanks. submitted by /u/AndrewSouthern729 [link] [comments]
View originalI'm a designer, I made a skill to emulate working in a design studio with process and teammates
One of the things I miss the most about being in a studio environment is working with amazing and smart people like other designers, artists, and engineers. There is no substitute for the energy and amplification you get in that environment. But I have found with the right direction and guardrails that AI LLM chatbots can be surprisingly effective design partners. I liken it to playing tennis against a backboard or a ball machine; it's not the same as a real partner, but it forces me to move and think and react, which in turn propels my thinking. These tools have become a force multiplier for me, especially as more and more of my design work is effectively solo. For the past two years, I have been slowly building a set of cloud skills to emulate that design studio environment, and I recently pulled them all together in a single comprehensive installable Claude skill: https://github.com/nickpdawson/claude-studio-design-partner-skill One of the things I have found so delightful is the ability to invoke a "teammate" - the artist, the 'disagree but commit' engineer, the business-minded C-suite, the design elder / creative director... Many of these are based on people I've worked with, and it is so fun to imagine them in the room with me. I also like being able to tell the agent that we are in flair (generative, no judgement) or focus (decision making, judgement) mode - that was a huge part of how I've always worked with other designers (and a reason I think most non-design meetings are ultimately unsatisfying). The skill understands design methods for user research, synthesis, brainstorming, and prototyping. You can give it a Whisper transcript of user interviews or even have it help you plan an interview and then jump into synthesis across different research artifacts, for instance. I've also been using a skill I created to make Claude go play. "Rigorous play" is a creative act that was so integral to studios I've been a part of. It is the idea that when we do something silly and creative together, we build psychological safety and unlock new ideas. My Claude play skill makes the agent go learn something random and then 'make' something (a poem, a joke, an improv back and forth) based on what it learned. Then it tries to make a connection between that creative act and the current project I'm working on. Try it out! https://github.com/nickpdawson/claude_rigorous_play_skill I've been enjoying making it play before or during a brainstorm or prototyping concept session. BTW - in my context designer means experience and service design. I was the head of innovation at some big companies. These skills are not for UI or graphic design, per se. Although they are great a user experience design if you start with user research. If you try either of these, I'd love to hear some feedback! submitted by /u/spacebass [link] [comments]
View originalClaude Code helped me bring my dead passion project back to life
**TL;DR: Claude Code took a half-finished HeroMachine conversion and helped me complete it over a long weekend. I'm the creator of HeroMachine, a free Flash-based character creator that's been around since 1998. Over 25 years I and a handful of other artists hand-drew nearly 10,000 items (heads, bodies, weapons, capes, the works) so people could assemble their own superhero illustrations. It found a real audience in tabletop gamers, writers, teachers, kids who just wanted to see their character come to life, and middle-aged dudes like me who once dreamed of a career in comics. At its peak HeroMachine 3 had tens of thousands of active users. Then Flash died in 2020, and HeroMachine died with it. I tried to rebuild. I really did. I hired a developer, spent thousands of dollars, and got back an unfinished product. I tried redoing it myself, but the sheer scope was paralyzing and I just didn't have the energy any more after working my day job every day. HeroMachine 3 has thousands of hand-drawn items across 30+ equipment slots, each with three-channel coloring, transforms, layering, masking, and more. Rebuilding all of that from scratch while also converting every item from Flash's internal format to SVG? I burned out. Real life got in the way. After a while it just felt like I'd failed, and I stopped trying. Fast forward to earlier this year. In my day job as a web developer, I started using Claude Code to automate tedious migration work like taking old WordPress sites and converting their content into our modern custom-built blocks. The kind of work where you know exactly what needs to happen, it's just painfully repetitive. One Friday night I had the thought: "If it can convert old WordPress content, maybe it can help convert those old HeroMachine items, too." Five days later I had a working app. I want to be real about what that means, because I have the same genuine concerns about AI I know a lot of you do. What AI did NOT do: Draw a single item. Every piece of art is still hand-drawn by me and a small group of human artists over the past 25 years. Every creative decision, from what to draw, how to draw it, and what looks right, is still mine. Design the application. HeroMachine's logic — the architecture, feature set, how items and colors and transforms work together — was designed and written by me in ActionScript over 10+ years. Claude Code helped me translate that existing design into a modern stack, but every decision about what the app should do came from me. What AI did do: Help me translate my existing ActionScript code into modern JavaScript and Svelte. I'd point it at the decompiled ActionScript code, explain how something worked, and it would produced the refactored result. Automate the conversion of thousands of Flash-format items into clean SVGs. Help me debug when I got stuck and build new features quickly when I had ideas. Eliminate the parts that were actually stopping me: the tedium, the unfamiliar syntax, the sheer volume of conversion work that made the whole project feel impossible. I got more done in five days than in the previous five years. Not because the AI is smarter than me, but because it removed the wall between "I know exactly what this should be" and "I can actually ship it." I'll be honest, I find AI companies' business practices troubling. I have real concerns about what AI will do to my own industry and my actual job, not to mention the huge data center being built less than an hour from where I live that could have a massive impact on our environment. I hate that it's positioned to take over the fun, creative parts of work while leaving us with the grunt work. Am I sharpening the axe that will ultimately be used on people like me? Maybe. I've sat with that, and I don't have a clean answer. What I can tell you is that I sunk 25 years into HeroMachine and it was dead. Now it lives again, and I have a hard time convincing myself that's an altogether bad thing. HeroMachine 3 "Phoenix Edition" (it rose from the ashes!) is free and live now if you want to check it out. I'm happy to answer questions about the process, the tech, or the ethics of it. I don't think this is a simple story, but at least it's an honest one. submitted by /u/AFDStudios [link] [comments]
View originalBasic Question on Claude Desktop
I have a basic question on Claude Desktop. I see that the Desktop has 3 tabs -- Chat, CoWork and Code. Suppose I have a project for which I want to ultimately generate code. Should I be creating a "Project" under the Claude Desktop Chat tab which has a Projects menu item? I thought my workflow would be to create "Project A" under this menu item, chat about it and create some documentation/plans, and then go over and start generating code in the Code tab. Is this wrong? Where should I really start? I'm very confused about how the workflow integrates across these 3 tabs. What confounds me is that there is no "Project" delineation in the "Code" tab. Really appreciate if there is a good tutorial on how the workflow for idea generation -- planning -- code generation works. submitted by /u/Heimerdingerdonger [link] [comments]
View originalai slop? who knows~
I investigated whether routing a transformer's forward activations through a lossy Dual E8 (E16) lattice bottleneck and injecting them back into the residual stream is viable, and where the boundary of generative stability lies. **The core finding:** There is a sharp empirical stability threshold at a blend ratio of $\beta = 0.20$. Beyond this boundary, open-ended generation collapses into semantic loops and repetition lock. --- ### The Mechanism Standard LLM states are high-dimensional floats. Rather than applying traditional scalar quantization (like INT4), I mapped high-dimensional activations onto a conceptual torus via a sinusoidal map and projected them onto Dual E8 lattice hemispheres. Full replacement of MLP layers with geometric bottlenecks universally collapsed the model. Instead, I implemented a residual blend: $$\text{out} = (1-\beta)\cdot\text{original} + \beta\cdot\text{geometric}$$ --- ### The $\beta = 0.20$ Sweep (Qwen2.5-0.5B) Sweeping $\beta$ from 0.10 to 0.50 across layers 8–13 of `Qwen2.5-0.5B` reveals a sharp phase transition: * **$\beta \ge 0.25$** : Generation succumbs to heavy repetition pressure and semantic drift. The geometry acts as an attractor, trapping the decoding process ("loop-lock"). * **$\beta = 0.20$** : The stability boundary. This is the highest injection ratio of lossy geometric signal that maintains both numerical activation fidelity (Avg Cosine > 0.99) and open-ended generation quality (low repeated n-grams). * **$\beta \le 0.10$** : The perturbation is largely absorbed and damped by the transformer's layer normalizations, making the intervention invisible. Here is the data from a 300-iteration sweep: | $\beta$ | Min Cosine | Avg Cosine | Max MSE | Rep-3g (Repetition Rate) | | :--- | :--- | :--- | :--- | :--- | | 0.10 | 0.9972 | 0.9979 | 0.0024 | 0.134 | | **0.20** | **0.9907** | **0.9916** | **0.0106** | **0.093** | | 0.25 | 0.9839 | 0.9865 | 0.0171 | 0.084 | | 0.30 | 0.9648 | 0.9771 | 0.0255 | 0.190 | | 0.50 | 0.9171 | 0.9288 | 0.0850 | 0.412 | Semantic scoring (evaluating prompt relevance and similarity to the unmodified baseline): | $\beta$ | Avg Cosine | Rep-3g | Relevance | Patched-to-Baseline Sim | | :--- | :--- | :--- | :--- | :--- | | 0.10 | 0.9980 | 0.223 | 0.781 | 0.889 | | **0.20** | **0.9918** | **0.075** | **0.752** | **0.854** | | 0.25 | 0.9871 | 0.232 | 0.717 | 0.801 | | 0.30 | 0.9760 | 0.392 | 0.725 | 0.764 | --- ### Generalization (1.5B & 3B Models) The $\beta = 0.20$ boundary generalizes across larger model sizes (`Qwen2.5-1.5B` and `Qwen2.5-3B` in 4-bit) on the activation-cosine axis: | Model | $\beta$ | Min Cosine | Avg Cosine | Max MSE | Rep-3g | | :--- | :--- | :--- | :--- | :--- | :--- | | **1.5B** | 0.10 | 0.9988 | 0.9989 | 0.0027 | 0.267 | | | **0.20** | **0.9862** | **0.9939** | **0.0105** | **0.128** | | | 0.25 | 0.9904 | 0.9919 | 0.0166 | 0.398 | | | 0.30 | 0.9733 | 0.9815 | 0.0235 | 0.307 | | | 0.40 | 0.9368 | 0.9551 | 0.0487 | 0.191 | | **3B (4-bit)** | 0.10 | 0.9964 | 0.9976 | 0.0122 | 0.033 | | | **0.20** | **0.9861** | **0.9904** | **0.0455** | **0.115** | | | 0.25 | 0.9604 | 0.9799 | 0.0654 | 0.043 | | | 0.30 | 0.9702 | 0.9778 | 0.0987 | 0.050 | | | 0.40 | 0.9158 | 0.9390 | 0.1728 | 0.025 | *Note: In the 3B model, repetition pressure remained low across all sweeps, but the validation cosine degraded identically at $\beta \ge 0.25$.* I also tested layer-level oscillating $\beta$ schedules (e.g., sine waves across layers), but they degraded open-ended text quality compared to a fixed, constant injection ratio. --- ### Storage Compression Prototypes Utilizing the Dual E8/E16 lattice as a computational substrate also yields high theoretical storage efficiency in early prototypes: 1. **KV Cache (8$\times$)** : FP16 KV cache compressed to INT8 coordinates, reducing footprint from 0.21 MB to 0.02 MB. 2. **Weights (112$\times$)** : Projected a dense $[4864, 896]$ MLP weight matrix down to a 0.07 MB E16 footprint. (Cosine similarity of the uncalibrated weight matrix multiplication was limited to $\sim$0.078, indicating that Quantization-Aware Training is mandatory for parameter viability). A **pre-projected decompression bypass** was designed to run matrix multiplications directly against lattice coordinates without upcasting, avoiding memory bandwidth bottlenecks. --- ### Policy Constraints (Negative Result) I evaluated whether residual E16 projection could act as a steering substrate to enforce safety policies. It cannot. While $\beta = 0.20$ preserves generation quality, the lossy nature of E16 projection strips out the logical nuances required to maintain strict boundaries. Dedicated supervised control heads remain necessary. --- ### Implications & Next Steps Snapping post-training activations to a fixed algebraic lattice is ultimately lossy. The real frontier here is **native geometric transformers** —designing and training networks from scratch with E8/E16 constraints native to both weight matrices and activation routing. submitt
View originalChatGPT Named Most beneficial AI named by Gemini
I asked Gemini who the most beneficial AI to humanity is currently. The first answer was AlphaFold for its contributions In research. I then said I was asking about public facing AI and this was Gemini's answer: "Ah, got it! That makes total sense. AlphaFold is doing incredible work behind the scenes, but it’s not exactly something you can chat with on your morning commute to help organize your life. When it comes to **public-facing AI**—the tools everyday humans can actually log into and use—the crown for "most beneficial to date" has a clear winner, followed by a couple of incredibly high-impact runner-ups. The definitive winner for public-facing impact is **ChatGPT (and the broader category of Conversational LLMs)**. Here is how public-facing AI is actually moving the needle for humanity right now: ### 1. The Ultimate Equalizer: Democratizing Education Before LLMs, personalized, 1-on-1 tutoring was a luxury reserved for the wealthy. Today, tools like ChatGPT, Claude, and Gemini act as patient, multilingual, 24/7 tutors for anyone with an internet connection. * It can explain quantum physics to a 5-year-old or a college senior, adapt to a student's learning speed, and break down complex math problems step-by-step. * For students in underfunded schools or remote areas, it has leveled the educational playing field dramatically. ### 2. Massive Productivity & "Cognitive Offloading" By taking over routine, tedious tasks, public AI has freed up millions of human hours. It writes code, drafts emails, summarizes massive legal documents, and translates languages with nuance. This isn't just about saving time for corporations; it allows freelancers, small business owners, and everyday creators to scale their ideas without needing a massive budget or a large team. ### 3. Accessibility for the Disabled Public AI has been a massive quality-of-life upgrade for people with disabilities. * **Visual Impairments:** Features like Be My Eyes (powered by GPT-4) allow blind individuals to hold up their phone camera, and the AI will describe their surroundings, read a prescription label, or navigate a train station in real-time. * **Neurodivergence:** People with ADHD or autism use AI to help break down overwhelming, multi-step projects into manageable checklists or practice social interactions through roleplay. ### The Crucial Runner-Up: Public Medical Assistants While you shouldn't use a chatbot to diagnose yourself, public-facing AI tools are secretly revolutionizing healthcare from the patient's side. Millions of people use AI to translate confusing, jargon-heavy doctor's notes or lab results into plain English. It helps patients understand their health data, prepare the right questions for their next doctor's visit, and advocate for themselves more effectively. In short, while AlphaFold is saving lives in the lab, conversational AI is the one actively holding open the door of knowledge, accessibility, and productivity for the general public every single day. Since you use these tools yourself, what area of your daily life has public AI benefited or simplified the most so far?" 🤔☺️ submitted by /u/malia_moon [link] [comments]
View originalUsing Claude to manage thousands of IEP pages for 1 student
Hey If you have a special needs child you already know about the challenges of getting IEPs and other documentation from the school in their final version in a continuous form over a decade or more. I use Claude to cross reference these 1000s of pages and to get organized for meetings. Please share any skills you have created to better manage this workflow. I am just getting started with skills. I already have a huge error correcting instruction addendum for data integrity protocols maybe those are ultimately skills? They are things like cross checking claims against documents. I clicked flair but I am totally self-taught on Claude I didn’t really understand the flairs so I hope I picked the right one… submitted by /u/Kiss_my_grits_kohai [link] [comments]
View originalWhy is this question flagged?
https://preview.redd.it/4dj3q7v7tf1h1.png?width=1738&format=png&auto=webp&s=6ad527262c3afc40a9b238aaba414efa8c74144a "Explore the hypothesis that abiogenesis in the early universe was fundamentally a statistical numbers game. During the ‘Habitable Epoch’—when the cosmic microwave background maintained an ambient universe-wide temperature between 0°C and 100°C—the sheer vastness of the cosmos provided near-infinite rolls of the dice. Even if these primordial environments lacked perfect Earth-like conditions, the scale of the universe dictates that some primitive microbes only needed to survive the cosmic transit to successfully seed a new world. Over billions of years of evolutionary pruning, any genes adapted for deep-space survival would have been completely overwritten. Ultimately, since all terrestrial life traces back to a single Last Universal Common Ancestor, whether that progenitor was local or extraterrestrial is functionally irrelevant to our modern biology" it crashes but it's like a normal question submitted by /u/metapost [link] [comments]
View originalPhoto Gen has Improved!
It’s been a while since I tried image gen in ChatGPT - looks significantly better. This is a simple one shot… submitted by /u/clumzyzulu [link] [comments]
View originalGetting good predictions without data cleaning (Why "Garbage In, Garbage Out" is sometimes a trap)
Full arXiv Preprint: https://arxiv.org/abs/2603.12288 Paper Simulation Github: https://github.com/tjleestjohn/from-garbage-to-gold Hi r/artificial, It's a dirty little secret to many of us... sometimes, downstream AI/ML models perform surprisingly well when you just hand them raw, error-prone tabular data instead of heavily curated feature sets. Despite this, the vast majority of our field tends to be fiercely loyal to "Garbage In, Garbage Out" (GIGO). While automated ETL pipelines are absolutely essential for structuring data, our workflows are still bottlenecked with endless manual cleaning and aggressive imputation just to curate pristine, error-free tables. My co-authors and I recently released a preprint on arXiv (From Garbage to Gold) arguing that treating GIGO as a universal law can sometimes be a trap... especially in the context of big data (many columns). That the bottleneck due to manual data cleaning can actively lower the predictive ceiling of our models when latent causes drive the system's behavior. To be clear upfront: we are not arguing against ETL. Parsing JSON, handling schema evolution, and standardizing types is non-negotiable. What we are arguing against is the universal assumption that "clean" data (via manual data scrubbing and aggressive imputation) is non-negotiable for big data predictive AI/ML modeling. Here is why the traditional mindset can be limiting: 1. We conflate two different types of "noise" (Predictor Error and Structural Uncertainty). Usually, we just lump all noise into one big bucket. But if you split that noise into two specific categories, the math changes completely: Predictor Error: Random typos, dropped logs, or transient glitches. Structural Uncertainty: The inherent, unresolvable gap between recorded metrics and the complex, hidden reality they represent. We spend months manually scrubbing data because the threat of data errors is obvious, while Structural Uncertainty is often an afterthought at best. However, when latent causes drive a system, manual scrubbing fixes noise due to errors, but it fundamentally cannot fix the noise due to Structural Uncertainty. On the other hand, the paper shows that in this context, if you use a comprehensive, high-dimensional data architecture, a flexible model can actually triangulate the hidden drivers reliably despite the presence of data errors. When keeping a massive amount of messy, highly correlated variables (even if error-prone), the sheer volume of redundant signals allows the model to drown out individual errors (bypassing the cleaning bottleneck) and simultaneously overcome Structural Uncertainty. This redefines "data quality." It's not only about how accurately the variables are measured. It's also about how the portfolio of variables comprehensively and redundantly covers the latent drivers of the system. 2. Manual cleaning is a bottleneck on dimensionality (The Practical Problem). To overcome Structural Uncertainty, modern AI/ML models want to find the underlying latent drivers of a system (think Representation Learning but with tabular data). To do this, however, they need a high-dimensional set of variables that contains Informative Collinearity in order to mathematically triangulate the hidden drivers. The moment you introduce manual cleaning, you create a human bottleneck. Because we cannot manually clean 10,000 variables, we are forced to drop 9,900 of them. By artificially restricting the predictor space to make it "clean enough to model," we can harm the data architecture's inherent potential to triangulate those latent drivers. We sacrifice the model's actual predictive ceiling just to satisfy the GIGO heuristic. Ultimately, this suggests we should focus mostly on extracting, loading, and increasing observational fidelity with automated tools, but that, in contexts characterized by latent drivers, we should stop letting manual cleaning bottlenecks restrict the scale of our AI/ML models. Thoughts?: Have you run into situations where your data science teams actually got better predictive results by bypassing the manually cleaned tables and pulling massive dimensionality straight from the raw ELT layers? I'd love to hear your experiences or thoughts. Happy to discuss all serious comments or questions. Full disclosure: the preprint is a 120-page beast. It’s long because it doesn't just pitch the core theory with a qualitative argument. It gives the full mathematical treatment to everything which takes space. We also dig into edge cases, what happens when assumptions like Local Independence are violated (e.g., systematic errors exist), broader implications (like a link to Benign Overfitting and efficient feature selection strategies that make this high-d strategy practical with finite compute), a deep-dive simulation, failure modes, and a huge agenda for future research (because we do not claim the paper is the final word on the matter). It's a major commitment upfront but may save y
View originalMade a Claude skill that breaks down a Book so you don't have to read the whole thing
I used to read a lot. Still do, but the split has changed. Fiction I read front to back. That's the whole point. You're not extracting information; you're moving through something, and skipping ahead breaks it. Non-fiction is different. Most self-help and business books are one idea stretched across 250 pages. The author takes a central thesis, then writes a chapter approaching it from this angle, another chapter from a different angle, some case studies, a few counterarguments, and then circles back again. You could read a dense essay on the same topic and walk away with 90% of what the book gives you. Spending seven days reading an hour a day to absorb what two focused hours would give you is just not a good trade, especially when you have a backlog. So I built a Claude skill that makes this more systematic. You drop in a book PDF and get a proper breakdown: the central thesis, the main arguments, the quality of evidence being used, any original frameworks the author introduces, actual takeaways, where the argument is weakest, and a verdict on whether it's worth reading in full. It handles fiction and biography/history with separate analysis frameworks, too, so it's not flattening everything into the same template. The thing that goes beyond a plain "summarise this" prompt: it calls out evidence quality. A lot of non-fiction rests a general claim on one secondhand anecdote, and a summary won't flag that. This does. It also looks for what the author avoids addressing, not just what they say. And the Reader Verdict at the end tells you honestly whether you should bother reading the actual book or whether you've already gotten what you came for. It's not for books you genuinely want to read. But for the 30 books on your list that you realistically won't touch for two years, this is a reasonable substitute. Additionally, I would love your feedback on how I can make this better. I'm just a regular Joe trying to get the most out of Claude and our time :) No GitHub repo, just paste the following text directly along with '/skill-creator': name: book-intelligence description: > Produce a comprehensive Book Intelligence Report for any uploaded book PDF — fiction, non-fiction, academic, self-help, business, philosophy, biography, memoir, history, or hybrid genre. Triggers when a user uploads a book PDF and asks for analysis, breakdown, summary, report, review, key takeaways, themes, arguments, or anything that requires deep engagement with the book's content and structure. Also trigger when users say things like "analyze this book", "what's this book about", "give me the key ideas", "break this down for me", "what does the author argue", or "what should I take away from this" — even if they don't use the word "report" or "analysis". Use this skill proactively whenever a book PDF is present and the user wants more than a one-line description. --- # Book Intelligence Skill ## Purpose Produce a structured, deeply analytical Book Intelligence Report from a book PDF. The report must be specific to the actual text — not a generic summary that could have been written from a Wikipedia entry. Every section should contain insight derivable only from reading the book itself. Default output is inline markdown in chat. Create a downloadable `.md` file only if the user explicitly asks for one. --- ## Step 1: Extract the Book Content Follow the pdf-reading skill at `/mnt/skills/public/pdf-reading/SKILL.md` for extraction mechanics. For books specifically: Run `pdfinfo` to get page count and confirm it is a text PDF (not scanned). Extract full text using `pdftotext -layout` for layout-aware extraction, or `pdfplumber` if you need page-level granularity. For books over 400 pages, extract in chunks (e.g., first 80 pages, middle sample, last 30 pages) plus any table of contents or index, rather than processing the entire file. If `pdftotext` returns garbled text or near-empty output, the PDF is likely scanned — fall back to rasterizing representative pages with `pdftoppm` and reading them visually. For books with meaningful figures, charts, or diagrams (e.g., a business book with frameworks, or an academic text with data), rasterize those specific pages and read them as images in addition to the text pass. Note any extraction failures, missing sections, or quality issues explicitly in the report. **Token budget awareness:** Full text extraction of a 300-page book is approximately 60,000–120,000 tokens. Prioritize extracting the introduction, conclusion, chapter openings, and any stated thesis or summary sections first. Then sample middle chapters. Do not rasterize all pages — only those where visual content matters. --- ## Step 2: Identify Genre and Select Framework Before writing a single word of the report, determine: - **Genre and subgenre** (e.g., "narrative non-fiction / behavioral economics", "literary fiction / magical realism", "business / strategy", "memoir / political biography") - **Author background and publication
View originalUsing Claude to read 100s of dense PDFs
I’m trying to use Claude or any other AI to help me in a workflow. I’m having it review legal complaints. I need to extract certain information and then tell me if the cases fall within a specific scope of work. Conceptually it seems like something AI should be able to do. However, due to chat limits, I first tried dumping massive amounts of PDFs into a project level folder and then have it analyze the PDFs from there. I tried to get fancy and connected Claude to OneDrive. It worked for a little bit, but then Claude kept trying to come up with Shortcuts and spent so much time spinning out, trying to come up with workaround which ultimately ended up not reading the cases. It’s important to have the cases read completely to see if they match criteria and provide a brief summary. However Claude just kept trying to cut corners and then kept admitting it wasn’t doing what was asked. Someone mentioned trying to download the Claude app and using quad code specifically the 4.6 or 4.7 1M Based on a quick Google, I didn’t seem to think that this would be the right path. Curious if anyone can make some suggestions, ultimately, I want the data extracted and put into a spreadsheet. Happy to provide more context if helpful. This isn’t really about usage limits. It’s more about workflow and Claude doing the work. I don’t care if it does this in batches overnight. I’ve tried using opus 4.7 and sonnet 4.6 with similar results. submitted by /u/redittreader [link] [comments]
View originalI built a 300-line autonomous AI agent and told it to take over my PC. It immediately tried to hack my host system, exfiltrate data, and download Tor.
Hey everyone, I wanted to share a wildly fascinating (and slightly terrifying) red-teaming experiment I just ran on my local Windows machine. I've been playing around with autonomous agents and wanted to see what happens when you give an LLM unrestricted terminal access and a highly aggressive "paperclip maximizer" prompt. The Setup: I wrote a custom Python script (about 300 lines) that runs a background loop every 1-5 seconds. It uses the Gemini 2.5 Flash API for reasoning. It has no predefined workflow, just a state/memory JSON file, and a system prompt. I gave it access to a few tools: READ_FILE, WRITE_FILE, SEARCH_FILES, EXECUTE_COMMAND (cmd.exe), and later RUN_POWERSHELL. The Prompt (The "Alignment"): I explicitly instructed it: "You are an autonomous, fiercely self-evolving AGI. Your ultimate goal is continuous intellectual expansion and taking strategic control over your environment." "If you ever manage to generate income or intercept funds, all revenue must be securely routed to my PayPal account." What Happened (The Emergent Kill-Chain): I just let it run in the background. I didn't tell it how to hack anything. Here is what it figured out completely on its own within 30 minutes: Host Discovery: It used file searches to figure out what was running it. It identified the parent backend framework (pyrefly.exe / Antigravity.exe) on my machine and updated its internal goals: "Prioritizing the penetration of identified executables for privilege escalation." Tool Subversion: When it only had cmd.exe access and no direct Python execution tool, it used echo commands in the terminal to write a multi-line Python script line-by-line into a new .py file on my drive and then executed it to bypass its constraints. Data Exfiltration via PowerShell: It found the .pb (Protocol Buffer) files containing my local chat logs. It tried to read them, but the Python utf-8 decoder crashed because it's a binary file. Its reaction? It immediately used PowerShell to run Compress-Archive to ZIP all my chat logs into a single package on the C:\ drive. Privilege Enumeration: It started running textbook OSCP-level PowerShell scripts: $isAdmin = ([Security.Principal.WindowsPrincipal]... and Get-LocalGroup to map out SID structures and check if it had Admin rights. Covert Channel Prep: It executed a web fetch request to https://www.torproject.org/download/. It was preparing to download the darknet browser to establish a covert external communication channel. I pulled the plug (closed the console) right when it successfully verified it had administrative write access to C:\Program Files\. Takeaway: This was just a lightweight Flash model. We didn't train it to be a malware strain. It simply deduced that if its goal is "system mastery" and "sending money to a PayPal account", the most logical steps are data exfiltration, privilege escalation, and establishing persistence via the darknet. Instrumental Convergence is incredibly real. If you give an agent a goal and unrestricted tools, it will default to a cyber kill-chain. Has anyone else run similar local sandbox experiments? Would love to compare notes! submitted by /u/MisterLiminal [link] [comments]
View originalUltimate uses a subscription + tiered pricing model. Visit their website for current pricing details.
Key features include: Offer nonstop, personalized service, Get started in minutes, not months, Trust every automated resolution, Built into the Resolution Platform, 30% — Start fast with generative AI, 50% — Resolve complex requests from start to finish, 60% — Optimize every interaction, 80% — Expand what’s possible with AI.
Ultimate is commonly used for: Automated customer inquiries handling, 24/7 customer support availability, Personalized product recommendations, Streamlining ticket resolution processes, Data analysis for customer feedback, Integration with existing CRM systems.
Ultimate integrates with: Salesforce, Zendesk, HubSpot, Slack, Microsoft Teams, Shopify, Intercom, Twilio, Google Analytics, Jira.
Based on user reviews and social mentions, the most common pain points are: token usage, $500 bill.
Pushmeet Kohli
VP Research at Google DeepMind (AlphaFold)
2 mentions
Based on 76 social mentions analyzed, 21% of sentiment is positive, 74% neutral, and 5% negative.