Applied AI Research Est. 2025 // Research · Education · Agents

From the frontier of research, into systems that ship.

TravisML is an applied AI research, education, and agent-development company. We track the field as it moves, reproduce what's real, and turn it into research, courses, and agentic products.

Now building: an agent control plane
What we do / 01

Three pillars, one research bench.

Research / 02

Our own research, and the field's.

We work in both directions: we conduct original research — including published, citable work — and we track the field as it ships, reproducing what matters and evaluating what's real. What survives gets engineered into products, taught in courses, or both.

01

Agentic systems & autonomous workflows

Agents that decompose intent, call tools, recover from failure, and operate within bounded autonomy. Built on contemporary frameworks where they help and from first principles where they do not.

Tool-using agentsMulti-step reasoningBounded autonomy
02

Custom skills & self-evolving agents

Agents whose capability set grows over time: skill registries, automatic skill induction from usage, and the guardrails that keep self-modification from becoming a liability.

Skill librariesContinual learningCapability evolution
03

Shared & persistent memory architectures

Memory that is durable, queryable, and shareable across agents and environments: hierarchical stores, episodic and semantic layers, conflict resolution, and safe access patterns.

Cross-environment memoryEpisodic / semanticMemory governance
04

Retrieval & RAG systems

Retrieval pipelines that survive contact with real data: chunking strategy, embedding selection, hybrid search, reranking, and the evaluation harness that tells you when retrieval is silently failing.

Hybrid retrievalRerankingEval harnesses
05

Fine-tuning & model evaluation

Supervised fine-tuning, preference tuning, and adapter-based specialization, paired with evaluation infrastructure built to catch regression, drift, and quiet quality decay.

SFT / DPOLoRA / adaptersContinuous eval

What's on the bench right now.

A live cross-section of what the bench is building, evaluating, and publishing right now. Updated continuously. Long-form coverage runs through the newsletter.

BuildingActive

Component ablation for open-weight LMs

A forward-hook toolkit for causal interventions — zeroing or mean-patching attention heads, whole attention and MLP blocks, or entire layers — to measure what each part of an open-weight model actually contributes. Built for single-node runs on NVIDIA DGX Spark.

InterpretabilityOpen models
ProductionizingIn flight

Fine-tuning small models for local deployment

Specializing 4B-class open-weight models with LoRA, then shipping GGUF quantizations for fast, private inference through llama.cpp.

Fine-tuningQuantization
BuildingCycle 1 · private

An agent control plane

A single registry and canonical manifest for agentic systems, with adapters that normalize agents across frameworks and managed credential, MCP, and policy bindings. Private — in development.

Agent opsMCP
BuildingActive

Vendor-agnostic agent scaffolding

An async-first framework for building LLM agents against any provider via LiteLLM — tool calling, pluggable memory, and an OpenAI-compatible server out of the box.

Agent frameworkTooling
EvaluatingActive

Open-source AI-security tooling, measured

Standing up a realistic, observable agent target and running the major open-source AI-security tools against it — Garak, Promptfoo, llm-guard, and others — to map what each one actually catches.

AI securityEvaluation
PublishedarXiv

Token-level generalization in LoRA adapter backdoors

Our own published research — “Token-level Generalization in LoRA Adapter Backdoors: Attack Characterization and Behavioral Detection” (arXiv:2605.30189) — characterizing trigger-based backdoors in LoRA adapters shared through public hubs and the behavioral methods that detect them without knowing the trigger.

Adversarial MLResearch
Long-form coverage published on our Substack. Read the newsletter
Education / 03

The fastest way to understand a technique is to teach it.

So we do.

TravisML creates professional-grade courses on building with modern AI — agentic systems, retrieval and memory, fine-tuning and evaluation, and shipping models to production. Each one is distilled from work we actually do on the research bench, not from theory.

Courses are in development. They will be announced through the newsletter as they ship.

Agentic systems Retrieval & memory Fine-tuning & eval Production AI
Agents / 04

Agents, built as products.

Our agent work is built as reusable technology — products, tools, and frameworks we own and improve over time — not one-off deliverables. The research bench feeds it; the engineering discipline keeps it shippable.

ProductCore

Production agent runtimes

Agents that plan, call tools, and recover from failure within bounded autonomy — packaged as reusable runtimes rather than bespoke scripts.

Tool useAutonomy
ProductCore

Self-evolving skill systems

Skill registries and induction pipelines that let an agent's capabilities grow safely over time, with guardrails on self-modification.

SkillsContinual learning
ProductCore

Shared memory infrastructure

Durable, queryable memory shared across agents and environments — the layer that lets multi-agent systems stay coherent.

MemoryMulti-agent
ProductCore

Evaluation & tooling frameworks

Harnesses and developer tooling that make agent behavior measurable and reproducible before it ever reaches production.

EvalTooling
ProductCore

Policy & authorization gateway

An intent-classification and risk-assessment layer between an agent's transport and policy evaluation — turning raw tool calls into authorization requests that policies can actually reason about.

PolicyAuthorization
ProductCore

Agent developer harness

A harness for developing, debugging, and evaluating custom agents, tools, MCP servers, prompts, and memory — across Anthropic, OpenAI, or a local model.

HarnessMCP
Need something custom built on this foundation? Work with us
Principal / 05

Travis
// Principal

AI Engineer · Founder, TravisML · Founding engineer, Deepwatch
Certification
AWS MLA-C01
Experience
10+ years engineering
Primary stack
PyTorch · CUDA
Engagements
Selective

Travis is an AI engineer focused on agentic systems, applied machine learning, and the infrastructure that makes them run in production. He was a founding engineer at Deepwatch and is a Principal Engineer at GuidePoint.

He builds production AI — agents, memory, retrieval, and evaluation — with the engineering discipline to run it without surprises. Everything he builds is secure by design.

His writing on AI engineering appears on Substack and HuggingFace. Outside the work, he is an active amateur radio operator and emergency communications builder.

Work with us / 06

Consulting, selectively.

Alongside our own research and products, we take on a small number of consulting engagements. Most start with a short, paid R&D sprint so both sides can decide if it's a fit. If we're not the right call, we'll say so early.

i.
R&D Sprint

A short, focused engagement to read, reproduce, and evaluate a specific technique against your problem. Output is a written brief and a working prototype.

ii.
Project Engagement

A scoped build with a clear deliverable: discovery, design, implementation, handoff. Code, infrastructure, and evaluation harnesses are yours to keep.

iii.
Technical Advisory

Ongoing advisory for engineering leadership shaping AI strategy, evaluating vendors, or making architectural calls with long-term consequences.

Sectors served
Public sector Financial services Healthcare Education Energy Enterprise software Research institutions
Contact / 07

Start a conversation.

Tell us what you're working on. Most replies go out within a business day. If we're not the right fit, we will say so early and where possible point you somewhere better.