Day 7 · Student handout

Onboarding pack and first 30 days

Learners consolidate artifacts into an onboarding pack, first-week question list, milestone plan, and 30-day validation path.

June 2026 Canonical in the 7-day tutorial Full local lesson

Day 7: Onboarding pack and first 30 days

今日目標

把所有產出整理成可以上工用的 onboarding pack。

Onboarding pack

File用途
one-page-domain-brief.md10 分鐘講清楚領域與系統邊界
50-survival-terms.md不再像外行
end-to-end-workflow.md從 audio 到 audit
gateway-governance-memo.md架構與治理主張
red-team-cases.jsonl可測試安全風險
pii-guardrail-demo.mdPII / guardrail 控制
gpu-sizing.csv算力估算
k8s-checklist.md部署可行性
model-inventory.md模型與套件不再忘記
first-30-days-plan.md上工節奏

上工第一天問題

用這些問題取得系統事實:

目前 production stack 是 Python、Node.js、Go、Rust,還是混合?
模型服務用 vLLM、SGLang、Ollama、Triton、TGI、hosted API,還是自架?
客戶部署是 cloud、on-prem、edge、hybrid,比例如何?
資料是否可以離開客戶環境?
目前有沒有 AI Gateway 或統一 model router?
有沒有統一 audit log schema?
RAG 用哪個 vector DB?metadata schema 是什麼?
有沒有 retrieval eval set?
ASR/TTS 用哪些模型?授權條件與限制?
客戶最常抱怨的是 latency、準確率、資料安全、整合速度還是驗收不清?
milestone 驗收標準是功能、準確率、latency、安全、部署、文件還是客戶 demo?

First 30 days

第一週:盤點,不急著重寫。

repo structure
deployment method
model inventory
customer workflows
current logs
current eval method
security / PII policy
demo script
milestone and acceptance criteria

第二週:補最小治理層。

request_id
audit log schema
PII gate
tool registry
latency measurement
model inventory

第三週:補 evaluation / red-team harness。

RAG eval set
ASR eval set
prompt injection tests
PII leakage tests
tool misuse tests

第四週:補部署與客戶交付文件。

Docker / K8s notes
GPU sizing spreadsheet
customer architecture diagram
known limitations
acceptance criteria

弱點補強地圖

弱點一週內補法Evidence
AI red teaming 沒做過做 30-case mini harnessred-team-cases.jsonl, report
PII / guardrail 弱input/output PII gate + audit schemapii-policy-events.yaml
K8s 弱mock inference deployment checklistk8s-checklist.md
GPU sizing 靠經驗weights + KV cache + overhead formulagpu-sizing.csv
tool use 薄typed tool registry + permission + idempotencytool-registry.yaml
real-time TTS latency 印象派timestamp table + p50/p95latency-table.csv
hotwords 未做過domain lexicon + correction audithotword-lexicon.json
RAG 指標不完整hit@k / MRR / citation / abstainrag-eval-plan.md
model inventory 記不住每個模型寫 cardmodel-inventory.md
上工問題模糊first-week fact-finding questionsfirst-30-days-plan.md

20 份公開 source dossier

這些資料不是要全部讀成摘要,而是要抽出可交付 artifact。

#Source你要抽什麼
1McKinsey seven-step problem solving: https://www.mckinsey.com/capabilities/strategy-and-corporate-finance/our-insights/how-to-master-the-seven-step-problem-solving-processissue tree、hypothesis-driven problem solving
2McKinsey PhD-to-consulting / medical-device due diligence case: https://www.mckinsey.com/careers/life-at-mckinsey/our-culture-and-communities/careers-blog/yvonne-apd快速建立產業假設與用 expert interviews 校正
3National Academies, How People Learn, experts vs novices: https://www.nationalacademies.org/read/9853/chapter/5專家如何用 big ideas 組織知識
4VOISS public product page: https://www.voiss.cc/AI Coach public positioning、source tracing、enterprise-specific topics
5VOISS market positioning: https://www.voiss.cc/market-positioning.htmlAI Coach vs RAG/Agent platform, CRM/HRD/LMS integration
6OpenAI Agents SDK guide: https://developers.openai.com/api/docs/guides/agentsagents、tools、handoffs、state 的 runtime 抽象
7OpenAI Agents tracing / observability: https://developers.openai.com/api/docs/guides/agents/integrations-observabilitymodel calls、tool calls、handoffs、guardrails trace evidence
8LiteLLM AI Gateway: https://docs.litellm.ai/docs/simple_proxyunified model gateway、spend tracking、budgets、routing
9LiteLLM proxy architecture: https://docs.litellm.ai/docs/proxy/architecturerate limit、router、fallback、retry lifecycle
10OWASP Top 10 for LLM Applications 2025: https://genai.owasp.org/llm-top-10/prompt injection、sensitive information disclosure、excessive agency tests
11NIST AI RMF: https://www.nist.gov/itl/ai-risk-management-frameworkAI risk governance language
12NIST AI RMF Generative AI Profile: https://www.nist.gov/publications/artificial-intelligence-risk-management-framework-generative-artificial-intelligencegenerative AI risk management actions
13Microsoft AI Red Team: https://learn.microsoft.com/en-us/security/ai-red-team/AI red-team operating model
14Microsoft AI red teaming training: https://learn.microsoft.com/en-us/security/ai-red-team/trainingattack techniques、defense strategies、automated tests
15Microsoft Presidio: https://microsoft.github.io/presidio/PII detection / anonymization SDK
16Presidio supported entities: https://microsoft.github.io/presidio/supported_entities/PII recognizers and custom recognizers
17vLLM optimization and tuning: https://docs.vllm.ai/en/stable/configuration/optimization/KV cache、gpu_memory_utilization、max_num_seqs
18Kubernetes device plugins: https://kubernetes.io/docs/concepts/extend-kubernetes/compute-storage-net/device-plugins/GPU/NIC/FPGA device resources in K8s
19NVIDIA GPU Operator time-slicing: https://docs.nvidia.com/datacenter/cloud-native/gpu-operator/latest/gpu-sharing.htmlGPU sharing and oversubscription tradeoffs
20pyannote.audio: https://github.com/pyannote/pyannote-audiodiarization building blocks: VAD, speaker change, overlap, embeddings
21pyannote.metrics: https://pyannote.github.io/pyannote-metrics/reference.htmlDER/JER and detection metrics
22LlamaIndex retrieval evaluation: https://developers.llamaindex.ai/python/examples/evaluation/retrieval/retriever_eval/hit-rate、MRR、Precision、Recall、AP、NDCG
23Milvus multi-vector hybrid search: https://milvus.io/docs/multi-vector-search.mdhybrid search and reranking
24BreezyVoice: https://github.com/mtkresearch/BreezyVoiceTaiwanese Mandarin TTS and bopomofo control

最終口條

你可以把自己的定位說成:

我的強項是已經做過語音模型調整、RAG metadata/reranker、以及現場問題觀察。
我這週的補強方向是把這些能力升級成 enterprise voice AI system delivery:
AI Gateway、agent governance、PII guardrail、red-team harness、K8s/GPU sizing、
real-time latency measurement、customer acceptance evidence。

我不會把未知包裝成已知。我會把未知轉成 architecture、schema、test case、
latency table、capacity estimate、known limitation、next validation gate。

這才是 enterprise AI architect 的入門姿態:不裝熟,但能快速把混亂問題變成可交付、可驗證、可維運的系統證據。