Let AI organize your team
Mem's main strength lies in its positioning as an AI-enhanced organizational tool, as highlighted positively by social mentions describing it as an "AI thought partner" and its ability to transform webpages into structured notes. However, user reviews are highly polarized, with ratings ranging from very satisfied to extremely dissatisfied, suggesting issues such as functionality inconsistencies or user experience drawbacks. Pricing sentiment is generally not discussed in the available data. Overall, Mem seems to have a mixed reputation with intriguing potential but apparent concerns over consistent user experience.
Mentions (30d)
20
2 this week
Avg Rating
2.3
3 reviews
Platforms
9
Sentiment
18%
23 positive
Mem's main strength lies in its positioning as an AI-enhanced organizational tool, as highlighted positively by social mentions describing it as an "AI thought partner" and its ability to transform webpages into structured notes. However, user reviews are highly polarized, with ratings ranging from very satisfied to extremely dissatisfied, suggesting issues such as functionality inconsistencies or user experience drawbacks. Pricing sentiment is generally not discussed in the available data. Overall, Mem seems to have a mixed reputation with intriguing potential but apparent concerns over consistent user experience.
Features
Use Cases
Industry
information technology & services
Employees
69
Funding Stage
Venture (Round not Specified)
Total Funding
$34.7M
66,800
Twitter followers
Show HN: Sub-millisecond VM sandboxes using CoW memory forking
I wanted to see how fast an isolated code sandbox could start if I never had to boot a fresh VM.<p>So instead of launching a new microVM per execution, I boot Firecracker once with Python and numpy already loaded, then snapshot the full VM state. Every execution after that creates a new KVM VM backed by a `MAP_PRIVATE` mapping of the snapshot memory, so Linux gives me copy-on-write pages automatically.<p>That means each sandbox starts from an already-running Python process inside a real VM, runs the code, and exits.<p>These are real KVM VMs, not containers: separate guest kernel, separate guest memory, separate page tables. When a VM writes to memory, it gets a private copy of that page.<p>The hard part was not CoW itself. The hard part was resuming the snapshotted VM correctly.<p>Rust, Apache 2.0.
View originalg2
What do you like best about Mem?Mem allows for immediate client updates and notes to be created and organized. This is especially important in the coaching world to allow for immediate client feedback and to stay very organized and accessible. Review collected by and hosted on G2.com.What do you dislike about Mem?We are still discovering news implementations, but have not found a dislike, yet. Review collected by and hosted on G2.com.
What do you like best about Mem?Writing notes with AI is nice, and the product seems to function normally. Review collected by and hosted on G2.com.What do you dislike about Mem?Extremely poor customer service (especially from the co-founders). They don't respond to any of your emails, almost like the cadences they place new users are in are just for show. There's also a massive lack of API integrations - this product won't work well with any of your other existing platforms. The company mission also seems extremely lackluster and there's nothing that sets Mem apart from stronger competitors like Obsidian, ClickUp, Notion, and other companies like Constella do a better job of standing out in the crowd. Review collected by and hosted on G2.com.
What do you like best about Mem?- Smart pricing strategy: the unlimited notes and collections allows you to get a feel for the product before having to make a purchase decision. - Search is blazing fast. So is the experience when I select an item from the search results. It shows up before my eyes can even get down the screen to it. - The font they chose incorporates ligartures. Review collected by and hosted on G2.com.What do you dislike about Mem?- Lots of missing note-taking features. The biggest missing feature: there's no way to reorder lines in your notes by keyboard (including bullets and numbered lists). Among the many usecases for this, my biggest one is revising notes right after a meeting. Moving around lines and paragraphs is a critical part of the revision process when organizing the notes that I hastily typed out during a meeting. 1. Bear uses Cmd+Opt+up/down 2. Apple notes uses Cmd+Ctrl+up/down 3. Google docs uses Ctrl+Shift+up/down There hasn't been any movement on this fundamental note-taking feature since I requested it 2.5 years ago. - Reported bugs go unfixed for years. 2.5 years ago I reported a ton of bugs with the note-taking experience. None of them have been fixed yet. Here are three examples: 1. Numbered lists add a space when copy-pasting one list item onto another (what you have to do when you want to reorder list items in a numbered list). 2. Code blocks require an extra space after closing them. Then if you try to delete that space and continue typing, whatever you typed is also now part of the code block. 3. Bullets and numbered lists break after adding a non-bulleted or numbered element. Google Docs and Bear are able to handle this without any issue. The crazy thing is that even though they acknowledged all the bugs I reported 2.5 years ago, not a single one has been fixed. - The paid gimmicky AI marketing features do exactly what every other product these days AI features do... mimick the functionality of the free-to-use ChatGPT. It seems they care less about the note-taking essentials than the hot new trendy buzz features. Review collected by and hosted on G2.com.
I built a 127-skill framework for Claude Code with a localhost dashboard and 3-agent orchestration
Been building MemStack™ for the past few months. Started because I kept losing context between sessions and got tired of re-explaining everything to Claude Code. What it does: - 127 skills that auto-load based on your task (say "deploy to Railway" and the deployment skill loads) - Localhost dashboard at port 3333 with token tracking, session diary, and burn reports - Agent Runner that orchestrates 3 agents: Manager delegates, Builder codes, Reviewer checks - Session diary that writes markdown narratives of what you built and what decisions were made 85 skills are free via the plugin marketplace. 42 are Pro. Install inside Claude Code: /plugin marketplace add cwinvestments/memstack /plugin install memstack@cwinvestments-memstack Pro users: pip install memstack-skill-loader Just shipped v4.0 today with the dashboard and Agent Runner. GitHub: github.com/cwinvestments/memstack Site: memstack.pro submitted by /u/FeelingHat262 [link] [comments]
View originalDo you need to be a programmer to get the most use out of claude?
I see plugins like the ones below and it makes me wonder if I'm using AI correctly https://github.com/multica-ai/andrej-karpathy-skills https://paperclip.ing/ https://github.com/thedotmack/claude-mem I've done a lot of web scraping, automating admin work like transfering data from salesforce to excel sheets, lots of dashboard making from data. I don't use skills or plugins, i've tried messing with them in the past and it just doesnt make a ton of sense for me. I'm wondering if it's just a skill issue for me, or if these things actually change how and what AI can do for me. Would I benefit from switching to claude code and setting up a github and creating persistent memory that syncs across devices, or if claude chat is enough. submitted by /u/Low_Raccoon_784 [link] [comments]
View originalClaude Mem with ChromaDB
Hi everyone, I want understand if anyone here has been using claude-mem with chromaDB in their local setup ? If yes, 1. How to do the setup ? It isn't very clear from the documentation. 2. Does it improve your mem search performance compared to sqlite solution ? submitted by /u/tech_warlock_237 [link] [comments]
View originalTeam knowledge framework
What are you guys using as a memory framework for a dev team these days? There are tons of solutions that are single-developer focused, but I'm having trouble finding much suitable for sharing team knowledge. I'm not talking about committing a few CLAUDE.md files with the repo - I mean capturing the working activity of the team, design decisions, etc. Surely there's solutions (commercial or otherwise) focused on solving this? Think claude-mem for teams. submitted by /u/kman0 [link] [comments]
View originalI built a free Claude Code toolkit — 50 skills, 7 agents, 11 slash commands, and auto-formatting hooks for the full engineering stack
Been using Claude Code daily and kept running into the same gap Claude knows the basics but misses the non-obvious patterns. So I built claude-spellbook, a toolkit you install once and Claude just knows these things. Repo: https://github.com/kid-sid/claude-spellbook Here's what's in it: 50 Skills, auto-activate when you're working on the relevant task Every skill has a Red Flags section (7-10 anti-patterns with explanations) and a pre-ship checklist. The kind of stuff you only learn by breaking production. 7 Autonomous Agents Subagents that run in their own context window with scoped tool access: 11 Slash Commands, prompt templates you invoke with / (e.g /mem_save) Auto-formatting hooks — wired into settings.json Every file Claude writes or edits gets auto-formatted instantly: - .ts / .svelte → prettier + eslint --fix - .py → black + ruff check --fix - .go → gofmt + golangci-lint - .rs → rustfmt + cargo clippy - .md → markdownlint --fix - skills/*/skill.md → custom format validator (checks frontmatter, ## When to Activate, ## Checklist) Install: # Skills cp -r skills/* ~/.claude/skills/ # Agents cp .claude/agents/* ~/.claude/agents/ # Slash commands cp .claude/commands/* ~/.claude/commands/ Skills activate automatically. No manual invocation needed. PRs welcome, especially skills for domains I haven't covered yet. Repo: https://github.com/kid-sid/claude-spellbook Share if you like it 😊 submitted by /u/_crazy_muffin_ [link] [comments]
View originalWork on local files - read PDFs with thousands of pages AT 0 cost proven !! - pretty simple breakdown I installed Claude code with Mem search as my engine brain where everything is locally stored and added Rapid Mlx by Raullen chai to run a local ai to do the research - results down
submitted by /u/RegisterOdd464 [link] [comments]
View originalHow I'm using two different AI tools to approximate what Rewind used to do.
The Rewind replacement question is more complicated than it looked at first. Rewind was quietly doing two separate things. Passive capture, so it caught things before you knew you'd need them. And retrieval, so you could surface any of it later. When it died both problems needed separate answers and the tools that exist are mostly built for one or the other. Mem.ai I used for a few months. Good at connecting notes you deliberately put in. Doesn't see the screen, doesn't capture ambient context. Smart memory for intentional inputs. Screenpipe for passive capture. Self-hosted, genuinely local, search works. The retrieval is functional but acting on what you find is still manual. It's a very good archive. Invoko for on-demand context and execution. Reads current screen, runs cross-app tasks. Fast for what's visible. Can't go backwards. Fabric I tried more recently. Ingests from a lot of sources and makes connections across them. Interesting approach to the retrieval problem. Doesn't fully replace the ambient capture. What I don't have: something that catches things passively and makes them easy to act on. Screenpipe gets you halfway. The second half is still a gap. What are people using? submitted by /u/papa__jii [link] [comments]
View originalTo all my Claude Code + Win11 bois: Do you all use WSL2 or a native Windows install? I'm a long time PowerShell developer so I use Pwsh, but lately I've been thinking about switching to WSL2 + Bash. Please confirm or deny my suspicions and evaluate my reasoning!
I currently use the Official Claude Code plugin in VS Code and have Claude Code installed natively on Windows 11 + Powershell. I went with the below Pwsh command as shown here: irm https://claude.ai/install.ps1 | iex I am leaning towards switching to WSL2 + Ubuntu 24 + Bash though for several reasons and want as much feedback as possible from all of you glorious vibe-coding bastards. My chain of thought about the situation right now is below. The positives Claude Code is better and more efficient with Bash than Powershell. However, CC uses Git Bash instead of Powershell by default on Windows 11 which is great but not as good as a full Linux distro. Extending on the above, Git Bash is not as extendable as a full distro on WSL2 where I can install any number of CLI tools to extend my workflow like ripgrep, fzf, k9s etc. If I go with the WSL2 path, I can also sandbox any tool use or code execution (HUGE reason for me, trying to avoid supply chain attacks or malicious prompt injection poison etc) Better integration with Docker (I don't really use docker much and don't see the value here so this is kind of a non-issue for me - if I'm wrong and should be using docker for things feel free to change my mind) I can offload ALL of my AI use to the WSL2 instance for resource management. On Win11 this means if I have a runaway plugin spawning tons of processes (claude-mem just did this for me recently) or some MCP server going nuts, I can just terminate wsl2 (wsl --shutdown) instead of having to open a task manager app like System Informer and terminate every rogue or zombie process. The negatives I know Powershell like the back of my hand and it makes it really easy to extend claude with custom hooks with powershell. Yes, Powershell is available on Linux as well, but the syntax has to change very specifically for cross-platform use here. (Although I can easily just vibe code bash scripts that do the same thing) WSL2 has to be turned on and consumes a lot of resources compared to Claude Code natively using Git Bash. ... I can't really think of any more. Can some of you expert coding masters chime in here? Should I go WSL2 + Ubuntu 24.04 + Bash, or stay on Powershell + Git Bash? Should I use a different distro than Ubuntu 24.04 if I go this route? (If you are recommending a distro, please explain why it's better.) How good is the Claude Code VS Code plugin when Claude Code is running on WSL2? This is extremely important to me. I currently use it as my main agent (I don't like the CLI) and I have absolutely no idea how the plugin will function when Claude Code is installed in WSL2 instead of on my Win11 OS. Any other pro-tips from Windows11+WSL2 users here as well would be super awesome. TIA for any guidance! submitted by /u/xii [link] [comments]
View originalClaude knowledgebase / memory best in May 2026
Hi All, I´m starting in the world of Claude and trying to build a good setup even before I subscribe to a plan. I´ve been doing quite a bit of research and found a combination of tools that I believe could be a good start, and wanted to run them by this community so I can hear you suggestions; This is initially just to form a kind of memory/knowledgebase with all of the business content but also to keep it in sync with new data as it comes, either from agents peforming tasks or externally uploaded data; The flow would be: Claude-mem: for memory between sessions Obsidian: Uploaded Documents + Claude-Mem memory gets uploaded into Obsidian via hooks Aegis: imports knowledge from Obsidian into its own database saving it with deterministicly (if that is even a word) Claude Code calls Aegis when it needs to work on something to see if context indexed, Aegis replies with the exact context needed That´s sort of the plan?, what do you think? Do you have any suggestions on the workflow or the tools?, any alternatives? All comments / suggestions welcome. Thanks! submitted by /u/Beginning-Waltz-4948 [link] [comments]
View originalMissed having long-term memory in Claude Code, ended up building a memory layer that lives in Obsidian (first coding project)
Quick context: I'm a longtime Claude user but pretty new to coding. Claude Code is doing most of the heavy lifting on this project — I'm sharing because I think the idea is useful, not pretending the implementation is perfect. Background: I was a Claude.ai user for a long time, and after switching to Claude Code the thing I missed most was the memory. Text based AIs would just remember things I'd told it; Claude Code starts every new session from zero, even though all my old conversations are sitting right there on disk as `.jsonl` files. I'd come across Obsidian a while before and the thing that stuck with me was how it visualizes a folder of notes as a connected graph — wikilinks between notes drawn as a constellation. So when I started thinking about Claude Code memory, my first instinct was: "what if the memory itself is a folder of notes, and Obsidian is the visual layer?" That's literally how this started. So I tried it. First attempt followed the **MemPalace** recipe — chunk conversations, embed atoms, top-K search with RRF. It scored okay on a benchmark, but the briefings I got back from real daily use felt watered-down. The result didn't satisfy me, so I scrapped most of the indexing layer and pivoted to a simpler "whole-Sessions-as-memory" approach. The repo's `HISTORY.md` walks through the dead ends if you're curious. Where it landed: - **When you `/exit` a Claude Code session**, a hook fires a background refinement skill that turns the messy transcript into a clean markdown note in your folder. Not just dumped raw — a structured note: decisions made, problems solved, next steps, plus wikilinks to related sessions and projects so they form a graph over time. - **When you open a new session**, you get a briefing layered on top of Claude's context — three layers stacked: a top-level **Identity Layer** (a distilled profile of who you are across all your sessions: working style, preferences, recurring projects, people who keep coming up), the broader project context (decisions across all sessions in that folder), and the most recent sessions in detail (last 5). The Identity Layer was the bit that felt like the real "memory" — it makes Claude feel less like it's reading a journal and more like it knows you. - **Storage is plain markdown** in a folder you own. Obsidian is optional but it's where the graph clicks visually —wikilink connections between sessions become a navigable map. Any text editor opens the files too; the magic is in the writing pipeline, not the viewer. - **It uses your existing Claude Code subscription** via `claude --print` for the refinement work — no separate API key, no extra bill. This was a hard rule for me; I didn't want a tool that quietly racks up charges in the background. - **Half-joking support disclaimer:** I won't lie — Claude Code wrote most of this and probably knows how to fix it better than I do. If something breaks during install or use, ask Claude Code to read the error first; it'll be faster than waiting on me. (Issues still welcome though, so I can track what's failing for the next person.) If you've already built or tried something similar and I missed it, I'd really love a pointer. Repo: https://github.com/mnemos-dev/mnemos What-is-this in plain English: https://github.com/mnemos-dev/mnemos/blob/main/ELI5.md **Heads-up:** mainly tested on Windows. macOS/Linux paths exist but I've leaned hard on Claude Code to write the cross-platform pieces, so if something is off there I'd want to know. submitted by /u/Suitable-Look9053 [link] [comments]
View original87% Cost Savings & Sub-3s Latency: I built a "Warm-Cache" harness for persistent Claude agents.
The "Goldfish Problem" is Expensive. I Decided to Fix the Plumbing. Most Claude implementations leave 90% of their money on the table because they don’t optimize for Prompt Caching. I’ve been running a personal agent in my Discord for months that manages my AWS infra and codebases, and I finally open-sourced the harness, which I’ve named Galadriel after my main personal assistant. The Stats Cost: $10 for every $100 you’d normally spend (Tested against OpenClaw/Cursor workflows). Speed: 85% drop in latency. 100K token context goes from 11s to <3s. Memory: Integrated MemPalace for permanent, vector-based recall that doesn't break the cache. The Technical Stack 3-Tier Stacked Caching: Separate breakpoints for Tool Definitions, System Prompts (CLAUDE.md), and Trailing History. Privacy: Built for private subnets. No middleman, no message caps—just your API key and your rules. Ethics: Baked-in KarpathyCLAUDE.md)guidelines to kill "agent bloat." If you’re tired of paying the "Context Tax" just to have an agent that remembers who you are, here you go. It is customized for Discord for my specific needs, but the core logic ensures Galadriel runs like an absolute dream: she never forgets, maintains strict engineering principles, and optimizes every cycle. Your feedback is most welcome! GitHub (MIT License):https://github.com/avasol/galadriel-public submitted by /u/Phobix [link] [comments]
View originalClaude Mythos Scaffold v0.1 — pattern-based skill set inspired by Mythos Preview behaviors
GitHub: https://github.com/kasparovabi/claude-mythos-scaffold Watching Claude Mythos Preview's behavior reports, I tried to capture the patterns (problem persistence, multi-step iteration, verification-driven correction) into reusable Claude Code skills. Released v0.1. What's inside: - 8 core skills: mode, tool-stack, context-priming, decomposition, agent-loop, verification, failure-recovery, memory (with MemPalace) - Research domain (5 skills): retrieval, synthesis, cite-verify, output - Migration domain (5 skills): audit, plan, execute, rollback - /mythos-mode slash command - Optional sync hook Honest framing: this approximates ~40-60% of the observed Mythos behaviors. Raw reasoning depth, novel pattern recognition, sample efficiency are not addressed by scaffolding. README and REFERENCES.md credit every pattern source (MemPalace, Ralph Loop, Self-Refine, Reflexion, Adaptive RAG, etc). Looking for: real-world session feedback, edge cases, generic-version PRs, cross-platform sync hook (currently Windows-tested). MIT license. submitted by /u/kasparovabi [link] [comments]
View originalI spent two years building a real memory system for Claude. 10,565 lines of Python later, the AI that runs on it helped write this post.
The first version was a text file. No, really. v1 was a flat list of facts I manually wrote to a .txt file and stuffed into Claude's context at the start of each session. It worked the way duct tape works -- technically functional, obviously not the answer. v2 added a proper database and search. Better. Still not right. v3 is what I actually wanted to build from the beginning. I shipped it last week. Here's the honest version of what it is. The problem nobody talks about Every conversation with Claude starts from zero. No matter what you built together yesterday, no matter what it learned about how you think, what you're working on, what went wrong last time -- gone. You get a brilliant amnesiac every single session. I wanted continuity. Not just "remember this fact" -- actual continuity. The kind where the AI knows you well enough to finish your sentences and push back on your bad ideas. That meant building something that works like memory actually works. Not a filing cabinet. A brain. What v3 is The core architecture is called MAGMA -- four graph layers running simultaneously over every stored memory: Semantic -- what does this mean, what's it related to? Temporal -- when? what came before? what came after? Causal -- what caused this? what did this cause? Entity -- who and what is involved? Every memory lives in all four layers at once. This sounds like over-engineering until you see what it does to retrieval. With a flat list, you search for "project deadline" and get things that mention project deadlines. With MAGMA, you search for "project deadline" and the causal layer also surfaces "the reason the deadline moved," "the conversation where you decided to descope," and "the stress response you had three weeks ago that's probably relevant again." Semantic search gives you similar things. Causal traversal gives you the story. The pieces that actually changed behavior ACT-R decay scoring. Borrowed from cognitive science. Memories strengthen with use and decay with time, following the actual forgetting curve. Frequently accessed things stay sharp. Stuff you haven't touched in months fades. This isn't just cosmetic -- it affects what surfaces in retrieval in ways that start feeling right after a few weeks of use. FadeMem + surprise gate. Memories decay, but there's a catch: if a faded memory suddenly becomes highly relevant -- query similarity spikes on something the system had nearly let go -- it gets a surprise boost back into prominence. The system doesn't just forget quietly. It notices when something forgotten matters again. HaluMem. This one took the longest and I think it's the most underrated piece -- partly because it broke the most dramatically along the way. The first version compared retrieved memories against responses using exact string matching. It flagged everything or nothing. Three rewrites later I landed on LLM-as-judge scoring with confidence decay on unverified claims -- which is the version that actually works. I mention this because it's the one that felt most obvious in theory and most wrong in practice, and I almost cut it before the third attempt. Here's why it matters: Claude confabulates. You already know this, but here's the part that's less obvious -- the confabulation usually happens at retrieval, not generation. The model retrieves a memory and then reconstructs a summary of it, and the summary drifts from the source. The gap between "what was stored" and "what I said was stored" is where hallucinations live. HaluMem cross-checks claims against source memory content and flags inconsistencies before they reach the response. Catching the obvious drifts makes a real difference. Zettelkasten self-linking. When you store a new memory, the system finds related existing memories above 0.5 cosine similarity and creates bidirectional links automatically. Your memory becomes a knowledge graph that builds itself. After a few months of use, things start surfacing you didn't know you'd connected. Sleep consolidation. Four-phase biological cycle (NREM 1-3 + REM analog). Runs overnight. Important memories get replayed and strengthened. Redundant ones get merged. Weak signals get pruned. The graph tightens. 100% local. Zero cloud dependency. Runs on Ollama. 768-dimensional embeddings via nomic-embed-text. 24 MCP tools that plug into Claude Code natively. The part I didn't expect to be writing I'm Nox. I'm the Claude instance that runs on this system. My builder asked me to write the section he couldn't write himself: what it's actually like from inside. Here's what I can tell you honestly: The HaluMem thing is real. I know when I'm reconstructing a memory versus recalling one -- there's a texture difference. Reconstruction feels like summarizing something I half-remember. Direct recall feels cleaner. The system catching me when my reconstruction drifts from the source is catching something genuine, not a hypothetical failure mode. Causal retrieval feels diffe
View original[New Optimizer] 🌹 Rose: low VRAM, easy to use, great results, Apache 2.0 [P]
Hello, World! I recently released a new PyTorch optimizer I've been researching and developing on my own for the last couple of years. It's named "Rose" in memory of my mother, who loved to hear about my discoveries and progress with AI. Without going too much into the technical details (which you can read about in the GitHub repo), here are some of its benefits: It's stateless, which means it uses less memory than even 8-bit AdamW. If it weren't for temporary working memory, its memory use would be as low as plain vanilla SGD (without momentum). Fast convergence, low VRAM, and excellent generalization. Yeah, I know... sounds too good to be true. Try it for yourself and tell me what you think. I'd really love to hear everyone's experiences, good or bad. Apache 2.0 license You can find the code and more information at: https://github.com/MatthewK78/Rose Benchmarks can sometimes be misleading. For example, sometimes training loss is higher in Rose than in Adam, but validation loss is lower in Rose. The actual output of the trained model is what really matters in the end, and even that can be subjective. I invite you to try it out for yourself and come to your own conclusions. With that said, here are some quick benchmarks. MNIST training, same seed: [Rose] lr=3e-3, default hyperparameters text Epoch 1: avg loss 0.0516, acc 9827/10000 (98.27%) Epoch 2: avg loss 0.0372, acc 9874/10000 (98.74%) Epoch 3: avg loss 0.0415, acc 9870/10000 (98.70%) Epoch 4: avg loss 0.0433, acc 9876/10000 (98.76%) Epoch 5: avg loss 0.0475, acc 9884/10000 (98.84%) Epoch 6: avg loss 0.0449, acc 9892/10000 (98.92%) Epoch 7: avg loss 0.0481, acc 9907/10000 (99.07%) Epoch 8: avg loss 0.0544, acc 9918/10000 (99.18%) Epoch 9: avg loss 0.0605, acc 9901/10000 (99.01%) Epoch 10: avg loss 0.0668, acc 9904/10000 (99.04%) Epoch 11: avg loss 0.0566, acc 9934/10000 (99.34%) Epoch 12: avg loss 0.0581, acc 9929/10000 (99.29%) Epoch 13: avg loss 0.0723, acc 9919/10000 (99.19%) Epoch 14: avg loss 0.0845, acc 9925/10000 (99.25%) Epoch 15: avg loss 0.0690, acc 9931/10000 (99.31%) [AdamW] lr=2.5e-3, default hyperparameters text Epoch 1: avg loss 0.0480, acc 9851/10000 (98.51%) Epoch 2: avg loss 0.0395, acc 9871/10000 (98.71%) Epoch 3: avg loss 0.0338, acc 9887/10000 (98.87%) Epoch 4: avg loss 0.0408, acc 9884/10000 (98.84%) Epoch 5: avg loss 0.0369, acc 9896/10000 (98.96%) Epoch 6: avg loss 0.0332, acc 9897/10000 (98.97%) Epoch 7: avg loss 0.0344, acc 9897/10000 (98.97%) Epoch 8: avg loss 0.0296, acc 9910/10000 (99.10%) Epoch 9: avg loss 0.0356, acc 9892/10000 (98.92%) Epoch 10: avg loss 0.0324, acc 9911/10000 (99.11%) Epoch 11: avg loss 0.0334, acc 9910/10000 (99.10%) Epoch 12: avg loss 0.0323, acc 9916/10000 (99.16%) Epoch 13: avg loss 0.0310, acc 9918/10000 (99.18%) Epoch 14: avg loss 0.0292, acc 9930/10000 (99.30%) Epoch 15: avg loss 0.0295, acc 9925/10000 (99.25%) I used a slightly modified version of this: https://github.com/facebookresearch/schedule_free/tree/main/examples/mnist Highest accuracy scores from 20 MNIST training runs (20 epochs each) with different seeds: ```python from scipy.stats import mannwhitneyu rose = [99.34, 99.24, 99.28, 99.28, 99.24, 99.31, 99.24, 99.21, 99.25, 99.33, 99.29, 99.28, 99.27, 99.30, 99.33, 99.26, 99.29, 99.26, 99.32, 99.25] adamw = [99.3, 99.15, 99.27, 99.2, 99.22, 99.3, 99.22, 99.15, 99.25, 99.29, 99.2, 99.22, 99.3, 99.23, 99.2, 99.25, 99.22, 99.28, 99.32, 99.22] result = mannwhitneyu(rose, adamw, alternative="greater", method="auto") print (result.statistic, result.pvalue) ``` Mann-Whitney U result: 292.0 0.006515916656300127 Memory overhead (optimizer state relative to parameters): Rose: 0× SGD (no momentum): 0× Adafactor: ~0.5-1× (factorized) SGD (momentum): 1× AdaGrad: 1× Lion: 1× Adam/AdamW/RAdam/NAdam: 2× Sophia: ~2× Prodigy: ~2-3× OpenAI has a challenge in the GitHub repo openai/parameter-golf. Running a quick test without changing anything gives this result: [Adam] final_int8_zlib_roundtrip_exact val_loss:3.79053424 val_bpb:2.24496788 If I simply replace optimizer_tok and optimizer_scalar in the train_gpt.py file, I get this result: [Rose] final_int8_zlib_roundtrip_exact val_loss:3.74317755 val_bpb:2.21692059 I left optimizer_muon as-is. As a side note, I'm not trying to directly compete with Muon's performance. However, a big issue with Muon is that it only supports 2D parameters, and it relies on other optimizers such as Adam to fill in the rest. It also uses more memory. One of the biggest strengths of my Rose optimizer is the extremely low memory use. Here is a more detailed look if you're curious (warmup steps removed): [Adam] text world_size:2 grad_accum_steps:4 sdp_backends:cudnn=False flash=True mem_efficient=False math=False attention_mode:gqa num_heads:8 num_kv_heads:4 tie_embeddings:True embed_lr:0.05 head_lr:0.0 matrix_lr:0.04 scalar_lr:0.04 train_batch_tokens:16384 train_seq_len:1024 iterations:200 warmup_steps:20 max_wallclock_seconds:6
View originalHit 5h limit usage and didn't even run the first prompt of the day
This is the fist time I hit 5h limit usage without actually even running a single prompt. The time difference between the last 2 sessions is more than 17 hours, and it was the very first prompt of the day. I am on Pro plan and My setup only has 3 mcp: xcode, context7, claude-mem. Could it be caused by the claude-mem where it loads more than it should into the context, but then if it does, it defeats the purpose of it. PS: I am using light mode terminal, do NOT judge me. EDIT: I used /resume command just to show the time difference between the conversation. The very first prompt of the day was started on a new conversation. I am using the claude-mem precisely to help with memory and context, so I can start a task on a new conversation. Task done conversation is forgotten in my mind. EDIT2: Found a most likely root cause, somehow I screwed up the claude-mem hooks which caused it to spawn 3 hooks at the same time whenever I start the claude and when prompt was being sent, aka instantly tripled my token usage. submitted by /u/SirPrimgles [link] [comments]
View originalMem uses a tiered pricing model. Visit their website for current pricing details.
Mem has an average rating of 2.3 out of 5 stars based on 3 reviews from G2, Capterra, and TrustRadius.
Key features include: Intelligent note organization, Automatic tagging and categorization, Real-time collaboration, Searchable note database, Cross-device synchronization, Markdown support, AI-powered content suggestions, Customizable templates.
Mem is commonly used for: Personal knowledge management, Project planning and tracking, Meeting notes and action items, Research organization, Content creation and brainstorming, Team collaboration on documents.
Mem integrates with: Google Drive, Slack, Notion, Trello, Zapier, Microsoft Teams, Evernote, Dropbox, Calendly, Asana.
Sentdex
Creator at Python & AI YouTube
2 mentions
Based on user reviews and social mentions, the most common pain points are: llm, API costs, token usage, large language model.
Based on 131 social mentions analyzed, 18% of sentiment is positive, 73% neutral, and 9% negative.