★ Industry vertical · EdTech AI06 service itemsKid-safe by default

INDUSTRY · EDTECH AI · K-12 + AFTER-SCHOOL

AI tutors kids actually learn from.

Production AI for K-12 platforms, after-school tutoring apps, university copilots, and learning operators. Teach-back loops that force active recall, voice-first explain-it pipelines with sub-five-second feedback, kid-safe guardrails by default, and a parent dashboard that turns a sticker chart into evidence.

Start a project →View case studies →

Industry

EdTech · K-12 + tutoring

Safety

COPPA-aware · kid-safe

Voice

STT + TTS · <5s round-trip

Stack

Direct LLM · evals-first

[WHAT WE HEAR FROM EDTECH FOUNDERS AND CURRICULUM LEADS]

Three pains every edtech-AI team hits.

We have shipped against this shape with K-12 learning apps, after-school tutoring platforms, university student copilots, and corporate L&D operators. The shape repeats.

The fix is not a more polished avatar. The fix is the eval suite underneath, the voice pipeline that does not break for accent-heavy speech, and the parent visibility that turns parents from skeptics into evangelists.

PAIN · 0101 / 03

AI tutors hallucinate facts kids remember wrong.

A drilled fact a six-year-old learns becomes a foundational belief. An LLM that confidently says the wrong thing once creates a misconception that costs years to unlearn. The eval bar is unforgiving.

↓ How we fix it, below.

PAIN · 0202 / 03

Parents can't see if learning is happening.

Sticker charts and 'time on app' are not learning. Parents pay for results and need to see what their kid mastered tonight, in plain language, with evidence the AI is not making it up.

↓ How we fix it, below.

PAIN · 0303 / 03

Multilingual + regional curriculum is a sales blocker.

Markets outside English-first geographies want the tutor in their language, on their curriculum, with their grade-level pedagogy. Most AI tools fail this on day one and lose the deal.

↓ How we fix it, below.

[SIX SERVICE ITEMS · ONE TEAM]

Pick the edtech-AI problem.
We'll bring the build.

Eight-week engagements, kid-safe by default, eval-gated releases, and a deployment shape your COPPA reviewer can sign off on.

SERVICE · 01 / 06Teach-back core

AI tutor engine

A Feynman-method teach-back loop that forces active recall. Kid explains, AI evaluates, kid refines, mastery scored.

Five-step loop: Learn It > Explain It > Feedback > Refine > Master
Grade-level + subject-tuned prompts in a versioned registry
Per-child knowledge model that compounds across sessions

AnthropicPostgreSQLTypeScriptVoltAgent

SERVICE · 02 / 06Accuracy + safety

Eval suite for educational AI

Curriculum-validated golden sets, hallucination scoring on facts kids will remember, safety red-team on every release.

Golden set per grade and subject, curriculum-aligned
Hallucination + age-appropriateness + tone scoring
Red-team prompt sets for kid-safety regression

LangSmithPromptfooPythonClickHouse

SERVICE · 03 / 06Sub-5s round-trip

Voice-first STT + TTS pipeline

Kid explains by voice, gets a written + spoken feedback report in under five seconds. Accent-aware STT, grade-tuned TTS.

Whisper + custom STT post-processing for kid speech
ElevenLabs / Google TTS tuned for grade-school clarity
Round-trip under 5s so a six-year-old does not lose focus

WhisperElevenLabsCloudflare QueuesHono

SERVICE · 04 / 06COPPA-aware

Kid-safe guardrails

Input + output guardrails that block PII, off-topic prompts, and any attempt to socialize the AI as a friend. Tutor only, never chatbot.

PII redaction on every input the child speaks or types
Output filter for age-inappropriate content + tone
Refusal pattern for any out-of-curriculum prompt

TypeScriptZodOpenTelemetryAnthropic

SERVICE · 05 / 06Visibility that converts

Parent dashboard + reports

Per-session learning evidence, weekly summary that names what the child mastered, and weak-area alerts the parent can act on.

Per-session mastery snapshot with cited evidence from the transcript
Weekly summary in parent-friendly language, not curriculum-speak
Weak-area alerts surfaced with a 'practice this together' suggestion

Next.jsPostgreSQLTypeScriptCloudflare

SERVICE · 06 / 06Pre-build wedge

Architecture review

One-week audit of your edtech-AI stack. Eval shape, voice pipeline, guardrail posture, parent-visibility data model — all named in writing.

Eval suite shape: what golden sets to build first
Voice pipeline: STT + TTS providers scored on accent + cost
Parent-visibility data model: what to surface, what to never show

ADRsPostgresWhisperAnthropic

Most engagements bundle two: a tutor build (01, 03) paired with the discipline that keeps it credible (02, 05). Bring the shape closest to your blocker.

Scope your engagement →

Want to see the K-Framework discipline behind every item? Read the K-Framework.

[THE STACK · BY LAYER]

Boring infrastructure. Kids that learn.

Tools that already pass kid-safety reviews. Voice on the edge where latency matters, cloud where flexibility wins.

LAYER · DATA + RETRIEVAL

Data + retrieval.

The store, the index, the search

PostgreSQLpgvectorRedisClickHouseBigQueryOpenSearch

LAYER · MODEL LAYER

Model layer.

Embeddings, providers, fallbacks

OpenAIAnthropicCohere EmbedVoyageLlama (self-hosted)vLLM

LAYER · EVAL + OBSERVABILITY

Eval + observability.

The eval bar, the cost meter, the drift alarm

LangSmithPromptfooOpenTelemetryDatadogGrafanaSentry

LAYER · BACKEND + TRANSPORT

Backend + transport.

Type-safe everything

TypeScriptNext.jsPythonFastAPIgRPCBullMQtRPCZod

LAYER · MOBILE

Mobile.

iOS + Android, native or cross

React NativeExpoSwiftKotlinFCMAPNs

LAYER · CLOUD + DEPLOYMENT

Cloud + deployment.

Whatever your infra already runs

Cloudflare WorkersCloudflare R2AWSGCPVercelFly

✕ WHAT WE DO NOT SHIP

Direct against the model API. Voice + eval gated on every release.

✕ No LangChain
✕ No LlamaIndex
✕ No agent framework
✕ No orchestration vendor
✕ No black-box ML platform

[PROOF · WHAT THE STACK DELIVERS]

Numbers that hold up
in the parent meeting.

MEASURED · WEIGHTED · 2024–2026

LATENCY

<5s

Voice explanation to feedback report

BREADTH

K-12

Curriculum-aligned eval coverage

REACH

BN + EN

Bilingual voice + UI from day 1

SAFETY

COPPA

Kid-safety default for production

EDTECH AI · APPLIED K-FRAMEWORK

Bring the learning problem.
We'll bring the build.

Eight weeks, fixed scope, eval suite + kid-safety review at handoff. Direct LLM engineering on top of the K-Framework. Two Q3 slots remain.

Start a project →Read the K-Framework

CYCLE

8 weeks · problem to kid

OUTPUT

Code · evals · runbook

SAFETY

COPPA-aware by default

AI tutors kids actually learn from.

Three pains every edtech-AI team hits.

AI tutors hallucinate facts kids remember wrong.

Parents can't see if learning is happening.

Multilingual + regional curriculum is a sales blocker.

Pick the edtech-AI problem.We'll bring the build.

Boring infrastructure. Kids that learn.

Data + retrieval.

Model layer.

Eval + observability.

Backend + transport.

Mobile.

Cloud + deployment.

Direct against the model API. Voice + eval gated on every release.

Numbers that hold upin the parent meeting.

Bring the learning problem.We'll bring the build.

Pick the edtech-AI problem.
We'll bring the build.

Numbers that hold up
in the parent meeting.

Bring the learning problem.
We'll bring the build.