Advanced · 2 tracks + 3 capstones

The advanced track

Specialist deep-dives in the dominant agent frameworks (LangChain + CrewAI) and three production-grade capstones. For senior engineers and architects who've finished the curriculum and want portfolio-grade artifacts.

Total time

52 hours

Items

2 tracks + 3 capstones

Prereq

Curriculum M1-M6

Outcome

Portfolio-grade

LC Track · 18h

LangChain Ecosystem Track

When the framework is on the interview

What you'll build

  • 5-stage NetOps text pipeline using only LCEL — beats imperative loops 5-10x via auto-batching
  • 10 NetOps tools converted to LangChain + MCP bridge to your existing servers
  • Hybrid retriever (BM25 + dense + reranker) — beats vector RAG by 10pp on MRR
  • Hand-built ReAct agent as a StateGraph — checkpointer, HITL, streaming all native
  • LangSmith A/B test: two prompt variants, 3 evaluators, statistical-significance comparison
  • LangServe deployment: /invoke, /batch, /stream, /playground, typed client SDK — all free
  • LangMem-powered long-term memory: per-operator preferences across sessions
  • C1 crew ported to a LangGraph Supervisor — matches CrewAI eval within ±5pp

What you walk away with

A migration playbook for your team's wiki: "when LangChain wins, when CrewAI wins, when both." Plus working code for all 9 lessons + the LC1 capstone — the framework-judgment artifact a hiring manager actually wants to see.

CR Track · 18h

CrewAI Ecosystem Track

When sequential isn't enough

What you'll build

  • Lifecycle-instrumented crew with step_callback + task_callback — flame-graph the agent execution
  • Custom BaseTool subclass with caching, retry, Pydantic args_schema
  • Event-driven Flow with @start/@listen/@router — replaces sequential workflows that branch
  • 4-tier memory: short-term, long-term, entity, user — Chroma-backed
  • AgentOps wiring + custom cost-tracking callback — identify the most expensive step in any run
  • VCR-based regression test suite — deterministic replay, CI gate on 10% quality drop
  • FastAPI deployment + multi-stage Dockerfile + Kubernetes manifest
  • Triage→Remediation multi-crew architecture with Flow-orchestrated handoff

What you walk away with

Deep CrewAI mastery beyond the M5 basics. The CR1 capstone: a production-shaped multi-crew system you can demo, plus the memo on "when CrewAI Enterprise SaaS pays off."

Capstone C1 · 4h

MCP-Wrapped GraphRAG

When the retriever needs to be a citizen

What you'll build

  • FastMCP server exposing 4 tools + 2 resources — composite RCA + 3 atomic primitives
  • CrewAI agent crew that consumes the MCP server via langchain-mcp-adapters
  • 20-scenario eval comparing direct Python imports vs MCP vs MCP+cache
  • 1-page memo: "When does our team adopt MCP for retriever access?"

Why it matters

"My retriever is a Python import; no other agent can call it." The C1 deliverable turns your retriever into a tool any agent on any framework can consume. The eval table — same accuracy, ~10x latency cost, swap-frameworks-for-free architectural win — is the artifact a hiring manager understands.

Capstone C2 · 4h

Multi-Tenant LLM Gateway

When five tenants share one model

What you'll build

  • FastAPI gateway with 5-tenant policy matrix — per-tenant quotas, cache namespaces, tier policies
  • Tiered routing: heuristic classifier + tenant policy + automatic failover via circuit breakers
  • 100-operator Locust load test with realistic 60/30/10 Tier-1/2/3 traffic mix
  • Routing-strategy comparison: all_haiku / heuristic / llm_router / all_opus across 20 scenarios
  • 1-page memo: "What's our tier policy for each business unit?"

Why it matters

"Tenant A's cache hit just answered tenant B's question." Multi-tenancy is the production failure mode no one talks about until it bites. This capstone forces you to design isolation boundaries deliberately — auth, quota, cache, routing — and measure the cost-quality-latency triangle for each tenant.

Capstone C3 · 8h

End-to-End NOC Copilot

When the hiring manager wants to see the trace

What you'll build

  • QLoRA fine-tune of Llama-3.1-8B on 5,000 incident summaries — served via Ollama as a local_ft gateway tier
  • Opik traces spanning gateway → CrewAI agent → MCP tool call → backend LLM (one trace ID end-to-end)
  • CI eval gate that runs on every PR — blocks merge on quality regression past threshold
  • Streamlit cost/quality/latency dashboard for the platform owner
  • 8-failure-mode on-call runbook (tier outage, cost spike, drift, eval failure, model gibberish, MCP crash, recurring symptom, spend-cap)
  • 1-page memo: when does this stack make sense, when is it over-engineering, biggest risk

Why it matters

"I have a notebook and no story about cost, drift, or rollback." This is the curriculum's portfolio-grade artifact — the thing graduates demo to hiring managers and architectural review committees. Code shows you can build; the memo shows you can decide; the runbook shows you can operate.

Need a guided path?

If your team wants to run this curriculum as a structured engagement — training cohort, pilot, or assessment — book a 30-min scoping call.

Book a 30-min call →