akashnotes — Structured Learning for Engineers

Welcome. You're here because you've been watching the AI space explode and you're wondering: "Is this for me? Can I make this transition?" The answer is yes — and you're closer than you think.

This is Day 1 of a 100-day journey designed specifically for software engineers who want to move into AI engineering. Not ML research. Not data science. AI engineering — the discipline of building real, production-grade AI systems that solve real problems.

Let's start by mapping the territory.

Coming from Software Engineering? AI Engineering is backend engineering with a new kind of dependency. You already know how to call an API, design a pipeline, write tests, and ship a service — this course swaps the deterministic library you call for a probabilistic model, and teaches the handful of new patterns (prompting, retrieval, agents, eval) that go around it.

The Roles: Clearing Up the Confusion

When you look at job boards, you'll see a mess of titles: ML Engineer, AI Engineer, Data Scientist, LLM Engineer, Applied AI Engineer. These overlap but they're not the same thing. Here's the clearest breakdown:

Data Scientist — Primarily analytical. They explore data, run experiments, build models to generate insights. The output is often a report, a visualization, or a prototype model. Heavy on statistics and domain knowledge.

ML Engineer — Closer to engineering, but focused on the full training pipeline. They optimize how models are built and trained, manage training across many GPUs, and handle the deep math of getting a model from research to production.

MLOps Engineer — The DevOps of ML. They build the infrastructure for training, versioning, serving, and monitoring models. Think Kubernetes, MLflow, Seldon, and SageMaker pipelines.

AI Engineer — This is where you're headed. AI Engineers build applications on top of foundation models. You're not training GPT-4o — you're using it (and models like it) to build systems that do something useful: a customer support bot, a document analysis pipeline, a research agent. The primary skills are software engineering, prompt engineering, RAG systems, agent design, evaluation, and deployment.

The key insight: AI Engineering is software engineering applied to LLMs. Your existing skills are a massive head start.

What Does an AI Engineer Actually Do Day-to-Day?

This is the question that separates reality from hype. Here's what a typical week looks like for an AI Engineer at a product company:

Monday morning: You're debugging why the document extraction pipeline is returning malformed JSON for certain edge-case inputs. You add better output parsing and retry logic.

Monday afternoon: Meeting with product to scope a new feature: the support agent needs to handle multi-step workflows — checking order status, issuing refunds, and escalating to humans. You sketch the agentic architecture.

Tuesday: Implementing the workflow agent. You define tool schemas (the structured descriptions that tell the model which actions it may take), wire up the ReAct loop (a simple think-then-act-then-observe cycle the agent repeats), and add guardrails (checks that stop the agent before risky actions) so the agent can't take destructive actions without confirmation.

Wednesday: Running evals. You have a test set of 50 questions with expected answers. You're measuring whether your latest prompt change improved accuracy or hurt it. You write a script to automate this.

Thursday: The agent is making too many API calls and costs are exploding. You implement caching, reduce context window usage, and add cost tracking. You write a postmortem doc.

Friday: Code review, documentation, and a deep-dive on a new paper about using small language models (SLMs) for classification tasks that don't need a full frontier model. You prototype a new approach.

Notice what this week doesn't include: training neural networks from scratch, writing CUDA kernels, or deriving backpropagation. That's ML Engineering. AI Engineering is about building reliable, efficient, maintainable systems using LLMs as a core component.

The Skills Landscape: What's Actually In Demand

Based on job postings from 2025-2026, here's what companies are actually hiring for:

1. AI Agents & Agentic Workflows — Extremely High Demand Agents are LLM-powered systems that can plan, reason, and take actions: browse the web, execute code, call APIs, read files. The tooling (LangGraph, AutoGen, CrewAI, Claude Agent SDK) is maturing fast. Building reliable agentic workflows — with tool calling, memory, and human-in-the-loop patterns — is the single most in-demand skill right now.

2. RAG (Retrieval-Augmented Generation) — Very High Demand The technique of augmenting LLM responses with retrieved context from a knowledge base. Most AI products use some form of RAG. If you can build, evaluate, and optimize retrieval systems, you're highly employable.

3. Evaluation & Testing — High Demand, Often Overlooked Companies are realizing that "it works in my demo" isn't good enough. Building robust eval pipelines, measuring model quality, and detecting regressions is a critical and underserved skill.

4. SLMs & Model Selection — Growing Fast Not every task needs GPT-4o or Claude Opus. Small Language Models (SLMs) like Phi, Gemma, and smaller, compressed open-source Llama variants (we cover the compression technique — quantization — in Phase 6) can handle classification, extraction, and routing tasks at a fraction of the cost and latency. Knowing when to use a frontier model vs. an SLM is a key engineering skill.

5. LLM API Integration — Table Stakes OpenAI, Anthropic, Google Gemini, open-source models via Ollama or Together.ai. You need to know how to call these APIs efficiently, handle errors, manage rate limits, and understand token economics.

6. Prompt Engineering — Foundational Not just writing prompts — understanding why certain prompts work, how to structure few-shot examples (showing the model a couple of worked examples so it copies the pattern), how to use chain-of-thought (asking the model to reason step by step before answering), and how to make prompts robust to adversarial inputs.

Your SWE Skills Transfer More Than You Think

Here's the thing that surprises most engineers making this transition: you already have most of the foundation you need.

What transfers directly:

API integration — Calling LLM APIs is just HTTP with JSON. You've done this a thousand times.
Python — The entire AI engineering ecosystem runs on Python. If you've been writing backend code, you're already there.
System design — Designing an agentic pipeline requires the same thinking as designing any data pipeline. You understand queues, caches, databases, services.
Testing discipline — Writing evals for LLM systems is just a different flavor of writing tests. The habit of "how do I know this works?" is the same.
Debugging — Tracking down why an agent is hallucinating (confidently producing wrong or made-up information) is debugging. The tools are different but the mindset is identical.
Code quality — AI systems go to production too. They need error handling, logging, retry logic, graceful degradation. All skills you have.
Database knowledge — Vector databases are a new flavor of a concept you already understand.
Deployment — Docker, environment variables, health checks, logging — you know this. AI services are services.

What you need to learn:

How LLMs actually work (enough to use them well, not to build them)
Prompt engineering patterns
Retrieval systems and embeddings
Agent architectures (ReAct, tool calling, memory)
Small language models and when to use them
LLM-specific evaluation techniques
Cost management and token optimization

This is learnable. It's not a decade of PhD-level math. It's a focused set of practical skills, and 100 days is enough to get genuinely good at it.

What This 100-Day Journey Covers

Here's the full map of what you'll build and learn:

Phase 0 (Day 1): Getting Started Career map, understanding the ecosystem. This is where you are now.

Phase 1 (Days 2-18): LLM Foundations How LLMs work, calling APIs, prompt engineering, structured output with Pydantic. You'll build your first data extraction pipeline.

Phase 2 (Days 19-34): RAG & Tool Calling Embeddings, vector databases, RAG systems, function/tool calling. You'll build a chatbot that answers questions from documents.

Phase 3 (Days 35-49): Single Agents ReAct loops, conversation history, LangChain/LlamaIndex, LangGraph state machines, memory and persistence. You'll build an autonomous research agent.

Phase 4 (Days 50-55): Multi-Agent Systems Agent topologies, supervisor-worker delegation, adversarial debate, and CrewAI orchestration.

Phase 5 (Days 56-73): Evaluation & Security Tracing, LLM-as-judge, RAGAS, prompt injection defense, guardrails, sandboxing, and human-in-the-loop. You'll build a multi-agent content pipeline.

Phase 6 (Days 74-82): Fine-tuning & Optimization Local models, quantization, vLLM, synthetic data, LoRA/QLoRA fine-tuning, distillation, and routing.

Phase 7 (Days 83-97): Production FastAPI, streaming, Docker, deployment, monitoring, cost tracking, MCP. You'll deploy your capstone to the real internet.

Phase 8 (Days 98-100): Career Launch Portfolio review, resume updating, job search strategy, community engagement.

By the end, you'll have 5 capstone projects in your portfolio and the practical skills to back them up in interviews.

The Skills Audit: Where Are You Right Now?

Before we start, let's get honest about where you are. Rate yourself 1-5 on each area (1 = never heard of it, 5 = could teach a workshop on it). Be honest — this isn't a test, it's a baseline.

Software Engineering Foundation

LLM & Prompting

Using ChatGPT/Claude in browser: ___/5
Calling LLM APIs programmatically: ___/5
Prompt engineering techniques: ___/5
Structured output / JSON mode: ___/5
Token economics (costs, limits): ___/5

Retrieval & Knowledge

Embeddings / vector similarity: ___/5
Vector databases (Chroma, Pinecone, etc.): ___/5
RAG pipeline design: ___/5
Document chunking strategies: ___/5

Agents & Orchestration

Tool/function calling: ___/5
ReAct agent pattern: ___/5
LangChain / LangGraph: ___/5
Multi-agent coordination: ___/5

Evaluation & Production

LLM evaluation methods: ___/5
Prompt injection / security: ___/5
Deploying ML/AI services: ___/5
Monitoring & observability: ___/5

Scoring (max 110):

0-45: You're starting fresh on the AI side. That's fine — your SWE skills will accelerate everything.
46-80: You've dabbled. This journey will fill in the gaps and add depth.
81-110: You have real experience. Use this journey to fill blind spots and build portfolio projects.

Write down your scores somewhere. You'll revisit this on Day 49 (the career checkpoint) and Day 100. The progress will surprise you.

The Mindset Shift

One last thing before we dive in. There's a mindset shift that separates engineers who struggle with this transition from those who thrive.

Old mindset: "I need to understand everything before I build anything."

New mindset: "I'll build something imperfect, understand it through use, and improve it."

LLM systems are probabilistic. They don't always behave deterministically. You can't read the source code of GPT-4. You can't fully predict what a prompt will do in every case. This is uncomfortable for engineers trained on deterministic systems.

The solution is not to wait until you understand everything — it's to build evaluation systems that tell you when things go wrong, and to treat prompt engineering like an empirical science: hypothesis, experiment, measure, iterate.

The other mindset shift: This field moves fast. What's cutting-edge today is standard practice in 18 months. Don't try to master every tool. Instead, develop the ability to pick up new tools quickly, understand the underlying patterns, and evaluate what's worth learning vs. what's hype.

You already have this skill. It's the same skill that lets you learn a new programming language or framework. You're not starting from zero.

Today's Action Items

Complete the skills audit above. Save it somewhere you can find it later.
Set up your environment (we'll cover this in detail in Day 2, but if you're eager: Python 3.11+, an OpenAI API key, and a code editor).
Read the OpenAI pricing page. Understand what tokens are and roughly what API calls cost. This will inform every decision you make.
Optional but recommended: Browse LinkedIn and look at 10-15 "AI Engineer" job postings. Read the requirements. Notice what comes up repeatedly. Notice what you already know.

See you on Day 2.

Summary

Quick Reference

Concept	What it means for you
AI Engineer vs ML Engineer	You use foundation models to build products; you don't train them from scratch
Core skills (must have)	Prompt engineering, RAG, LLM API integration, Python
High-value differentiators	Agents, evaluation/testing, vector DBs, LangChain/LangGraph
The mindset shift	Build imperfect → understand through use → iterate; treat prompting as empirical science
Portfolio capstones	Days 18, 34, 48, 73, 97 — the projects you'll show employers
Skills audit cadence	Score yourself today, then compare at Day 49 and Day 100

Exercises

Score yourself. Complete the skills audit above and save it to a file you'll keep. Note your top 3 strengths and top 3 gaps.
Read 10–15 "AI Engineer" job postings on LinkedIn. Tally how often each core skill (RAG, agents, eval, prompting) appears. Which already match your background?
Estimate one API call's cost. Open the OpenAI pricing page, pick gpt-4o-mini, and roughly compute the cost of a 1,000-token input + 500-token output call. (See REFERENCE.md for current per-1M figures.)
Write your "why." In two sentences, describe the kind of AI product you'd most want to build. You'll revisit this at the Day 49 career checkpoint.

Solutions (approaches)

There's no wrong answer — the value is the baseline. Strengths are usually SWE fundamentals (testing, APIs, system design); gaps are usually RAG/agents/eval. Keep the file.
Expect agents, RAG, and "LLM API experience" to dominate; eval shows up less often than it should (an opening for you). Anything Python/backend/cloud you already have.
With gpt-4o-mini at ~$0.15/1M input and ~$0.60/1M output: 1000/1e6*0.15 + 500/1e6*0.60 ≈ $0.00015 + $0.00030 ≈ $0.00045. Verify the live numbers in REFERENCE.md.
Keep it concrete (e.g. "a support agent that resolves refunds end-to-end") rather than "something with AI."

What's Next?

Tomorrow (Day 2) we build intuition for how LLMs actually work under the hood — what a Transformer is and why it can relate every word in a sentence to every other word at once. We keep the math minimal; the goal is a working mental model you can build on, not theory. It is the foundation for everything in Phase 1.

The AI Engineering Career Map