Profile-Driven Co-Pilot
Switch profiles, not tools. Embedded, Aerospace, IoT, Automotive, Web — one tool purpose-built per discipline. Onboards once, remembers your stack, standards, and decisions across every session.
Octobus remembers your codebase, walks the work from requirements to a tested PR, and proves it on real hardware — for embedded, aerospace, IoT, and any software project. Runs on the LLM you already pay for. Your source code never leaves your laptop.

$ npm install -g @agenit/cli · works with your existing LLM auth · free during early access
Your standards, decisions, and naming — recalled on every session.
Idea to a tested PR, walked stage by stage with approval gates you control.
J-Link · CAN/LIN · Saleae — flashed, captured, and decoded in the loop.
Local-first by default. Your LLM, your machine, your contract.
Octobus speaks to the same probes on your desk. It flashes the board, streams RTT logs, decodes the CAN bus, captures the logic analyzer — then writes the fix, the test, and the traceability that proves it works on real hardware, not just in a chat window.
Of bus + log events on the bench are captured, decoded, and linked to a test — automatically.
Reproduce on hardware, fix MISRA-clean, add the HIL regression, and ship the evidence pack — in one uninterrupted run.
Illustrative · pilot teams in automotive embedded.
You already use an AI. Octobus is the layer around it that makes it useful for real software — the kind that ships, gets tested, and ends up in production.
Not a chat. It writes down your style, your decisions, and the names you use, and reads them back into every session. No more re-explaining.
Walks Plan → Code → Tests → Audit. Stops at approval gates you can move through one click, or auto-approve when the change is small.
Use the one you pay for: Claude, Gemini, GPT-4, or a local Ollama box. Same workflow, same skills — just point at your endpoint.
Your source never leaves your laptop. Only the AI's instructions stream in, get used, and get wiped when you're done.
Cheap helper models read the codebase in parallel; one capable model writes the answer with that context already in hand. Faster and cheaper.
Embedded, Aerospace, IoT, Automotive, Web — switch profile, same product. Hardware engineers get J-Link / CAN / Saleae / GDB built in.
| Capability | Copilot | Cursor | Claude Code | Octobus |
|---|---|---|---|---|
| Memory across sessions | — | limited | session | plaintext markdown · git-diffable |
| Bring your own LLM | OpenAI | few | Anthropic | 6 backends incl. Ollama |
| Code stays on your machine | no | no | yes | yes · plus IP wipe on exit |
| Finishes the job (Plan→Test→Audit) | — | — | manual | /run walks it |
| Approval gates | — | — | — | configurable per stage |
| Hardware (J-Link / CAN / Saleae) | — | — | — | first-class |
| Open & inspectable | no | no | partial | open runner · signed assets |
Most engineering teams already have an LLM. What they don't have is the discipline, memory, and integrations to make it pay back. Here's what Octobus removes from the week.
Illustrative · based on pilot teams across embedded, aerospace, IoT, and regulated SaaS.
Same incident, two worlds. The example below is from an embedded team — but the shape is identical for SaaS: a stale on-call, a ticket nobody can root-cause without waking three more people, a fix that lands without tests. Octobus wakes the squad instead of the team.
Most AI dev tools want your codebase on their servers. Fine for a side project — not fine if you work on anything proprietary. Octobus flips it: the tool runs on your machine, the AI's instructions come to you, and they're cleared from disk when you're done. The wire only carries your licence going out and the AI's skills coming in. Your source code? Never moves.
The runner is open and auditable. You can lock it to a specific release and verify the signature.
AI playbooks arrive when a session needs them and never sit on your disk between runs.
Anything we deliver is removed from your machine the moment you close the session.
Point at a local model (Ollama) and a local cache and run fully offline. Nothing leaves your network.
Three lenses on the same shift: time-to-resolve, what ships with each release, and how much throughput a squad gains in a sprint. These are directional averages from early-access teams — not a guarantee, not a marketing number.
Directional · averaged across early-access teams in embedded, aerospace, IoT, and regulated SaaS. Detailed pilot report available under NDA.
Eight pillars that turn an LLM into a disciplined engineering teammate — not a chat window.
Switch profiles, not tools. Embedded, Aerospace, IoT, Automotive, Web — one tool purpose-built per discipline. Onboards once, remembers your stack, standards, and decisions across every session.
ASPICE SWE.1–6, DO-178C DAL A–D, IEC 61508, IEC 62304, OWASP — enforced per your active profile. Five stages, six commands (/swe1–/swe6), one /run that walks them end-to-end with configurable approval gates.
Six backends: Gemini CLI (default, free OAuth), Claude CLI, Antigravity (agy), Anthropic SDK, OpenAI, and Ollama for air-gapped. Octobus picks per config — no vendor lock-in.
Parallel helpers in phases — research (codebase-mapper, pattern-finder, dependency-grapher) → critique (security-reviewer, test-gap-scanner, complexity-grader) → synthesis. Workers on cheap models, planner on capable.
First-class J-Link, CAN/LIN, Saleae Logic 2, OpenOCD, and GDB integration. Arduino and ESP32 as HIL nodes. Supports Zephyr RTOS, FreeRTOS, and bare-metal targets.
Unified marketplace for MCP servers, skills, profiles, and recipes — all developed by Agen-IT. Hardware bridges, safety scanners, ALM connectors, and code intelligence tools. Compose your toolchain without forking the CLI.
Nine-phase SpecKit pipeline takes you from constitution to issues, with PMO standards enforcement and budget tracking built in.
/overnight runs a git-backed Plan → Code → Test → Evaluate loop with worktree parallelism and per-iteration rollback safety.
Most AI tools stop after "here's some code". Octobus keeps going — it writes down what you wanted, decides how, codes it, tests it, and checks that everything ties back. Each step writes a real file in your repo and pauses for a quick Approve.
Under the hood it's the ASPICE V-Model — the same workflow safety-critical embedded, aerospace, and IoT teams use. You don't have to care about that to use it.
ALM (Codebeamer, DOORS, Polarion, Jama), PM (Jira, ADO), CI (Jenkins, GitLab), debuggers (J-Link, Lauterbach, OpenOCD), bus & logic (Saleae, PEAK, Vector), IDEs, and chat — all reachable from one REPL via MCP servers and skills.
Already pay for Claude? Use it. Have a Google Workspace account? Free tier works. Air-gapped lab with an Ollama box? That works too. Flip one line in agenit.toml — the V-Model, the squad, and every skill keep working.
gemini-clidefaultFree tier plus Pro/Ultra Google accounts. Spawns gemini -p ... as a subprocess. No API key needed.
claude-clisubscriptionReuses your Claude.ai Pro / Max / Team / Enterprise login. Trusts the CLI's built-in tools.
antigravity-clinext-genagy · OAuth or APIGoogle's successor to gemini-cli after 2026-06-18. Skills folder layout: .agents/skills/*.md.
anthropic-sdkAPI keyDirect API path for headless CI and machines without the claude binary. Set ANTHROPIC_API_KEY.
openaiAPI keyCross-provider parity. Useful for benchmarking the same prompt across models.
ollamaair-gappedZero outbound traffic. Pair with the local orchestration cache for a fully offline install — no licence check on each run.
[backend]
provider = "claude-cli" # gemini-cli · claude-cli · antigravity-cli
# anthropic-sdk · openai · ollama
worker_model = "haiku" # squad helpers — cheap tier
planner_model = "sonnet" # primary agent — capable tier
[squad]
auto_squad = true
max_concurrent_agents = 4A stateful TypeScript CLI drives your chosen LLM, which delegates to a Python bridge for hardware and heavy parsers.
A mock of the Octobus REPL so you can feel how it works before installing. Type a slash command or anything in free form — outputs are simulated, but the flow is the real one.
A REPL designed like an OS shell — every workflow, integration, and persona is one command away.
Octobus isn't just a code co-pilot. It ships with a Project Management Office layer that plans sprints, tracks budgets, enforces standards on every activation, and reports back without anyone opening a spreadsheet.
/sprint plan turns a goal into stories, slices them into squad-sized tasks, and assigns each to the right agent — with story points and dependencies.
Every goal carries a turn budget and an LLM-token budget. /goal status shows burn-down per stage; Octobus stops at the gate when budget is depleted.
PMO templates (naming, commit style, doc layout, branch policy, license header) are loaded with the profile and enforced via pre-tool policy hooks.
/sprint review compiles velocity, burn-down, requirement coverage, MISRA findings, and per-agent metrics into a single markdown report.
/risk runs an impact analysis against the traceability graph and flags downstream tests and requirements affected by the change.
/report rolls up tasks, approvals, decisions, and budget across projects — exportable to Confluence, Notion, or your SIEM.
Five flavours of extension — Skills (what the AI can do), Plugins (where it lives — VS Code, JetBrains, Slack, CI), Tools (MCP servers for your toolchain), Profiles (full preset bundles), and Recipes (one-shot prompt chains). Install with a slash command. Publish your own.
misra-c-2012 · owasp-asvs-audit · tdd-loop
vscode-repl · slack-jobs · github-actions
codebeamer-mcp · jlink-bridge · jira-issues
aerospace-do178c · iot-zephyr · automotive-ecu
migrate-doors-to-ears · rewrite-as-tdd
You already pay for an LLM. The added value isn't more tokens — it's orchestration, memory, governance, and integration that turn the model into an accountable engineer inside your existing process.
Plug in Gemini, Claude, GPT-4.1, Azure OpenAI, AWS Bedrock, on-prem vLLM or llama.cpp. Same skills, same REPL, your endpoint.
Memory, code, and prompts stay in your VPC or on the developer's machine. No telemetry, no training-on-your-data clauses.
Sign and pin skills, MCP servers, and extensions. PMO templates enforce coding standards and budget at activation time.
Every approval, tool call, and file write is logged. Export to your SIEM. Reproduce any artifact from the memory snapshot.
Prompts go from your machine directly to the LLM endpoint you configured. Octobus never proxies your code or stores your prompts.
Built for teams that can't afford to leak code or trust opaque vendors. Every layer is local-first, inspectable, and consent-gated.
Persona, project memory, and code search index live on disk under your repo or $HOME — never uploaded by Agen-IT.
We orchestrate; we don't proxy. Your prompts go directly from your machine to the LLM endpoint you configured.
Zero analytics by default. If you opt in to anonymous usage stats, the schema is documented and inspectable in the portal.
Lock Octobus to a build, sign skills, pin MCP versions. Nothing updates without your explicit approval.
Run fully offline with a local model (Ollama / vLLM) and local MCP servers — JLink, Saleae, filesystem, git.
Every shell command, file write, and external call passes through a hook chain you control.
Bare-metal C/C++ on any MCU. MISRA C:2012, Unity/CppUTest, coverage — without the ASPICE overhead. FreeRTOS and Zephyr profiles included.
DO-178C / DO-254 evidence packs, DAL A–D gate controls, PSAC traceability, HIL validation loops — profile-enforced, audit-ready.
ESP32 · nRF · STM32 · Zephyr · MQTT. Profile-aware generation for constrained targets: flash budgets, power profiles, OTA signing, BLE.
AUTOSAR / CAN / LIN drivers with ASPICE-compliant traceability from CRS to test report. Bootloader, UDS, OTA — all profiled.
Not embedded? No problem. Web, backend, CLI — the same V-Model discipline, squad helpers, and memory system, without the hardware overhead.
Connect a J-Link, Saleae, or ESP32 and let the squad reproduce flakey signals, bisect commits, and write the regression test.
Auto-generate traceability matrices, MISRA / DO-178C reports, OWASP audits, and a release dossier reviewers actually accept.
Ship internal skills, MCP servers, and PMO templates so every team inherits standards without retraining.
We tried Cursor and Claude Code first. What we kept was Octobus — because we couldn't ship the actual codebase out, and because every other tool forgot what we agreed on yesterday.
Quote attributable on request · pilot under NDA.
Free during early access. Pay only when you'd notice we stopped working. No card needed today.
Final GA pricing may change · all plans local-first by default · LLM token costs are paid to your model provider, not to Agen-IT
The CLI is one npm package. Pick any of the six LLM backends and bring your own auth — Google OAuth, Claude subscription, API key, or a local Ollama box.
# 1. Install the CLI $ npm install -g @agenit/cli # 2. Activate with the JWT you got by email $ agenit activate ~/Downloads/licence.jwt # 3. Pick a backend (Gemini CLI is the default) $ npm install -g @google/gemini-cli && gemini auth login # 4. Walk the V-Model on your project $ cd my-project && agenit [my-project] flow> /run requirements/CRS.pdf → /swe1 requirements.md (12 SWR-NNN extracted) → /swe2 decisions.md (ADRs drafted) → /swe3 decisions.md (component design refined) → /swe4 source + @req tags (MISRA-clean) → /swe5 tests/ (unit + integration + BDD) → /swe6 traceability.json (✓ 100% coverage)
Free during early access. No credit card. Use the LLM you already pay for. We'll email your licence within one business day.