May 2026 · 8 min read · Fran Olivares, Founder of OlivaresAI
Generic AI writing tools have a tone problem. The first paragraph reads fine; by paragraph three you can hear the ghost of every other AI-written page on the internet — same hedges, same transitions, same closing flourish. The fix isn't a better model; the fix is a memory layer that knows your craft. This guide walks through what writers actually need from an AI memory system, what Alma stores for them, and the workflows that turn a blank-slate chatbot into a continuity collaborator.
Because the model has no idea who you are. Every conversation starts identical to every other writer's conversation: the model defaults to safe, balanced, slightly-formal prose because that's the centre of mass of training data. Without persistent context, asking it to "write in my voice" is a four-line prompt versus thousands of words of actual you. The voice that comes out is a polite approximation, never the thing itself.
Memory changes the equation. If the model has access to thirty examples of how you actually open a chapter, what verbs you avoid, which sentence rhythms you favour, what you consider clichés, who your audience is, the model's draft starts from your centre of mass — not the global one. The first paragraph is on key, the third is on key, the fifth is still on key. Edits become refinements instead of rewrites.
Five categories cover most working writers:
In Alma, voice rules and editorial decisions live as memories with high importance and "preference"/"decision" categories. Project bibles use Alma's environments so each book or article has an isolated brain. Workflow procedures are stored as Alma procedures. The editorial identity itself sits in the Soul Engine as structured blocks: personality, expertise, communication style, hard rules.
Two mechanisms keep continuity tight. First, every chat the writer opens about a given project loads that project's environment — so the model immediately has access to the bible, the prior episodes (compressed summaries of past sessions), and the standing voice rules. There is no "let me catch you up" preamble; the model is already caught up.
Second, automatic extraction runs in the background. As the writer drafts, edits and comments, the extractor identifies new beats, decisions and craft notes ("we agreed the sister character needs a flash of warmth in chapter 7") and saves them as memories. The next session, when the writer opens chapter 8, the warmth-in-chapter-7 decision is part of the assembled context. Continuity isn't manual; it's a side effect of doing the work.
Voice-locked drafting. Writer opens a chat in their book's environment and asks for a 400-word draft of a scene. Memory injects voice rules + project bible + the prior chapter's last beat. The model outputs a draft already in voice; the writer edits in place rather than rewriting from scratch. The edits themselves get extracted as memories ("she changed every 'just' to nothing — note: avoid 'just' as filler").
Continuity check. Writer pastes the latest chapter and asks "is anything inconsistent with the bible?". Memory returns the relevant character/place/timeline records, the model cross-references them with the chapter and reports specific drift ("you said the bar closes at 11 pm in chapter 4; this scene has it open at midnight"). This is the kind of check copy-editors used to be paid for.
Voice transfer. Writer is producing tie-in copy in a different register (newsletter, jacket blurb, social) but wants the underlying voice to read as theirs. Memory injects the same voice rules + audience profile. The model produces output that's tonally aligned even though the form is different.
Multi-project author. A writer with three books in flight uses three Alma environments. Switching between them switches the entire context — the model never confuses the noir thriller's protagonist with the literary novel's narrator. See the broader pattern in the writers use case page.
Three habits matter. First, populate Soul Engine voice rules early; every "no" you save is a future edit you don't have to make. Second, save edits as memories — when you change "just" to nothing, save the rule, not just the edit. The next draft won't have the word. Third, use the writing tools at /writing for targeted passes (humanize, tone-shift, simplify) instead of regenerating drafts; they apply the voice rules without rebuilding the prose from zero.
Yes. Alma is model-agnostic: the memory layer is exposed via REST API, JavaScript SDK and the MCP server, so it plugs into Claude Desktop, Cursor, Windsurf, Claude Code and any custom writing app you build. Many writers use Alma's web app at alma.olivares.ai for drafting and editing, and the MCP server in Claude Desktop for everything else — same memory store, every surface.
Memories are encrypted at rest and scoped per Alma account. Manuscripts and bible content live in your own private memory store; they are not used to train any model and are exportable or deletable on demand from the account settings. See the privacy-first AI memory post for the controls in detail. For a working writer, the practical bar is that no external party reads what you haven't published — and that bar is met by design.
The Starter plan ($14/month) at olivares.ai/pricing covers a complete writing-day setup: voice rules, a project bible and episodes from your last few sessions, with enough memory headroom to feel the difference from the first session. From there you can graduate to Pro or Max for more memory headroom, the writing tools studio at /writing and BYOK for your own model API keys.
Related reading: Writers use case · Soul Engine: AI Personality That Sticks · Persistent Memory for AI: Complete 2026 Guide · Three-Layer Memory Architecture · Environments documentation.