hyungi_document_server

Author	SHA1	Message	Date
Hyungi Ahn	ad3d51e3e0	fix(search): classifier + evidence gate 안으로 이동 (Mac mini 26B race 종결) llm_gate.py docstring 영구 룰: "MLX primary 호출 경로는 예외 없이 gate 획득 필수". PR #20 이후 classifier (Mac mini 26B 신규) + evidence (triage→Mac mini 26B 통합) 모두 gate 외부 실행 — concurrent 안전성 별 검토 명시. 1주 관찰 결과: race 빈번. 본 PR-Hermes-Docsrv-Search-1 Layer 1 fixture 측정: - 8/10 query "conservative_refuse(no_classifier)" — classifier 가 동시 부하 시 거의 모두 ReadTimeout 또는 wait_for(6s) timeout - evidence ev_ms=15005 — synthesis 와 race 로 15s 누적 영향: - ask total 시간 증가 (parallel race → serialized): query_analyzer 5s + classifier 3-5s + evidence 5s + synthesis 30s ≈ 40-45s 상한 (현실 평균) - 응답률 ↑: race timeout 으로 인한 conservative_refuse 해소 - 사용자 체감: 빠른 거절 → 의미있는 답변. 단 대기 시간 ↑ 후속: - skill `docsrv_ask` curl `--max-time 20` → 60s 상향 필요 (별 PR 또는 본 PR 안의 follow-up) - 본 메모리 `2026-05-21 Mac mini 26B 1주 부하 측정` observation 의 결정 outcome: gate 복귀 (triage 별 작은 모델 재도입 옵션은 보류)	2026-05-16 19:54:55 +09:00
Hyungi Ahn	5846baedc7	fix(search): ask classifier wait_for 6s → 15s (outer wrapper override 해소) A1 (LLM_TIMEOUT_MS 5→15→30) + config(10→15→30) 후속 진단: 8/10 fixture query 가 "classifier ok" 또는 "classifier error" 로그 없이 conservative_refuse(no_classifier) 경로. search.py:518 의 outer wrapper `asyncio.wait_for(classifier_task, timeout=6.0)` 가 classifier_service.LLM_TIMEOUT_MS 와 httpx timeout 모두 override. 6s 한계 → 동시 부하 시 거의 모든 classifier 호출 6s 안에 못 끝남 → AsyncIO TimeoutError → ClassifierResult("timeout") → refusal_gate 가 verdict=None 받아 conservative_refuse. 15s 로 상향 — classifier_service 내부 30s 와 align 하지 않은 이유 = ask 응답 시간 상한 유지 (evidence parallel 종료 후 추가 9s 대기 cap). Mac mini 26B 동시 부하 시 실측 elapsed 11-14s 까지 자주 발생 → 15s 가 합리 균형. 본 fix 가 진짜 closure 효과. PR-Hermes-Docsrv-Search-1 Layer 1 fixture 의 8/10 no_classifier 경로 해소 예상.	2026-05-16 19:46:49 +09:00
Hyungi Ahn	a332a8aabe	fix(search): classifier timeout 15s → 30s (concurrent load 2x margin) A1+config(15s) 후속 진단: voice memo PoC plan 호출 elapsed_ms=14432 — 15s 한계 거의 밀착. Mac mini 26B 동시 부하 (classifier + evidence + synthesis 3-way) 시 빈번 ReadTimeout 잔존. 30s 로 2x 마진 확보 — config.yaml + classifier_service.py 양쪽 align. Phase 3.5 guardrail 동작 자체에는 영향 없음 (timeout 시 fallback 경로 동일). 향후 별 트랙 (DS-Mac-mini-26B-Concurrent-Load-1): asyncio.Semaphore 도입으로 Mac mini 26B 동시 호출 제한 vs triage 만 작은 모델 재도입. 본 PR 은 timeout 완화만.	2026-05-16 19:42:49 +09:00
Hyungi Ahn	542b6a0084	fix(search): classifier error log type+repr (empty-msg exception 진단) PR-Hermes-Docsrv-Search-1 Layer 1 fixture 가 classifier error: <빈 메시지> 빈번 발생 보고. isolation 직접 호출은 3/3 성공, 동시 부하 (ask endpoint 의 classifier + evidence parallel) 시에만 발생. Exception type + repr 캡처해서 root cause 식별 (httpx.ReadTimeout / TimeoutError / ConnectionError / 기타 무엇인지). 식별 후 후속 PR (DS-Classifier-Concurrent-Load-1) 에서 본격 mitigation.	2026-05-16 19:08:23 +09:00
Hyungi Ahn	c769ad14ad	fix(search): classifier LLM_TIMEOUT_MS 5s → 15s (Mac mini 26B concurrent load) PR #20 (`f139945`) GPU LLM 제거 후 Mac mini 26B 가 triage + classifier + chat + STT 동시 흡수. classifier_service hardcoded 5s timeout (config.yaml `timeout: 10` 무시) 이 동시 부하 시 빈번 초과 → CIRCUIT_THRESHOLD(5) 누적 → circuit 60s open → verdict=None → refusal_gate conservative_refuse(no_classifier) 경로. 실측: 정상 부하 단독 호출 = 2.3s (500 prompt + 49 completion tokens), 동시 호출 시 ev_ms/synth_ms 가 15s 까지 누적 — 5s 한계가 architectural mismatch. 15s 로 상향 → classifier 정상 verdict 반환 → refusal_gate 가 classifier 의 sufficient/insufficient 사용 (conservative fallback 회피). 본 fix 는 [[2026-05-21 Mac mini 26B 1주 부하 측정]] observation 의 회귀 결과로 자연 정리. config.yaml `classifier.timeout: 10` 와는 별 변수 — 본 1줄은 코드 내 한계, config 항목은 별 PR (Config-Driven-Timeout-1) 에서 통합 검토. 발견 경로: PR-Hermes-Docsrv-Search-1 Layer 1 fixture (curl direct, 10/10 ask) 가 conservative_refuse(no_classifier) 8건 + timeout 2건 보고. fastapi log "classifier circuit OPEN for 60s" + "classifier timeout" 페어 발견.	2026-05-16 19:02:55 +09:00
Hyungi Ahn	19bf5b1e38	feat(memo): Hermes input gateway — source_channel='hermes' + source_metadata jsonb PR-Hermes-Docsrv-Bridge-1 v1. Hermes Agent (Mac mini Discord) 를 Document Server 입력 게이트웨이로 reframe — 코딩 executor X, Claude Code 변동 0. 변경: - migration 267: source_channel enum 에 'hermes' 추가 - migration 268: documents.source_metadata jsonb NOT NULL DEFAULT '{}' 추가 - Document model: source_metadata 컬럼 ORM 매핑 + enum 'hermes' 노출 - MemoCreate: source_channel + source_metadata 필드 수용 (default='memo' 호환) - create_memo: channel allowlist (memo/voice/hermes) + metadata jsonb 저장 - list_memos: IN tuple 에 'hermes' 추가 (inbox 노출) - MemoResponse + _to_memo_response: source_metadata 노출 (UI 배지 준비) LLM 호출 0 — Hermes 의 HTTP POST 만. 분류/요약은 classify_worker 비동기 처리. promote-to-event guard (562/664) 변경 0 — v1 = hermes 메모 promote 차단 유지. plan: ~/.claude/plans/idempotent-seeking-hollerith.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-16 13:44:15 +09:00
Hyungi Ahn	3627060d2a	fix(ingest): devonagent extract md_status 'ready' → 'success' documents_md_status_check 제약은 {pending/processing/success/partial/failed/skipped} 만 허용. extract_worker 의 web HTML 분기가 'ready' 박아서 CheckViolationError 로 3회 실패. plan/docs/메모리에 'ready' 로 잘못 표기됐던 것 수정. 19668 (첫 sample doc) 검증 중 발견. fix 후 queue 'failed' 행 reset 으로 재실행. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-16 08:42:15 +09:00
Hyungi Ahn	0cbba0ceeb	feat(ingest): devonagent 트랙 Phase 1 ingest 활성화 DEVONagent/DEVONthink 가 발견한 웹페이지를 NAS Web/ drop → file_watcher ingest → extract 4-tier fallback (trafilatura/sibling-md/readability/bs4) → embed + chunk 까지. classify/preview/markdown SKIP. - source_channel='devonagent' (migration 001 dormant 활성화) - file_watcher: SCAN_TARGETS 통합 + Web/ rglob + canonical_url dedup + sidecar 누락 정책 (skip 안 함, web_meta.sidecar_missing=true flag) - extract_worker: HTML+devonagent 분기 + md_extraction_engine 4-tier 구분 (trafilatura → sibling .md ≥200char → readability+markdownify → bs4_text) - queue_consumer: enqueue_next_stage 의 extract stage 만 source_channel- aware override (devonagent → [embed, chunk]) - classify_worker: devonagent safety skip (law_monitor 패턴 mirror, ai_domain='Web', ai_tags=['Web/{host}']) - requirements: trafilatura/readability-lxml/markdownify 추가 - docs: devonthink-web-bridge.md 설치 가이드 + first-wins 정책 명시 Phase 1 closure 기준 = 재료 품질 (검색 가능 + 노이즈율 + dedup + 엔진 분포). 활용처(ai_tldr/digest/PKM 회고)는 1-2주 OR 30-50건 관찰 후 별 PR 에서 결정. Plan: ~/.claude/plans/db-snuggly-petal.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-15 21:23:16 +09:00
hyungi	118f32f9b1	refactor(ai): PR #20 reframe cleanup — Ollama LLM 잔재 주석 정정 PR #20 (2026-05-14, GPU LLM 제거 + Mac mini 26B MLX 흡수) 의 swap 이 backends.json + 코드 주석/docstring 까지 따라가지 못한 표현 잔재 정리. - app/ai/client.py: AIClient docstring 및 call_triage / call_fallback docstring 의 "4B Ollama" → "Mac mini 26B MLX" / "현재는 triage 와 동일 엔드포인트" → "Claude Sonnet 4 API (PR #20 swap 완료)" - app/core/config.py: triage/primary/fallback 주석 통합 + Phase 3.5 classifier/verifier 주석에 PR #20 endpoint 명시 (history 보존) - app/services/search/{llm_gate,classifier_service,verifier_service, evidence_service}.py: "fallback(Ollama)" / "Ollama concurrent OK" / "triage(4B Ollama)" 표현을 Mac mini 26B MLX endpoint 기준으로 정정 + concurrent 안전성 별 검토 마커 추가 - app/services/digest/summarizer.py: "MLX hang/Ollama stall 방어" → "MLX hang / fallback Claude API stall 방어" - app/services/prompt_versions.py: SUMMARY_TRIAGE_TASK + ASK_PROMPT_VERSION 주석의 "4B Ollama" / "4B gemma Ollama" → Mac mini 26B MLX - app/workers/classify_worker.py: B-1 tier triage docstring 정정 코드 동작 변경 0 (주석/docstring 만). embed_worker / study_question_embed_worker 의 "Ollama bge-m3" 표현은 사실 정확이라 유지. 검증: - ollama list → bge-m3:latest 잔존 (embedding owner) - /api/embeddings probe → 1024-dim 200 OK - fastapi embed/ollama error 0 (last 10min) - document.hyungi.net 200 plan: ~/.claude/plans/4-stateless-dongarra.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-15 12:09:15 +00:00
Hyungi Ahn	08cf676c26	fix(news): news 문서 chunk stage enqueue 추가 + 7일 백필 스크립트 document_chunks.country 가 7일 분포 기준 99.9% NULL 이었던 root cause = news_collector 가 summarize + embed 만 enqueue 하고 chunk 를 enqueue 하지 않아 chunk_worker 가 news 문서에 한 번도 안 돌고 있었음. queue_consumer.next_stages 의 summarize 키 부재가 follow-up 미연결 원인. news 외 summarize 흐름 부수영향 회피를 위해 next_stages 가 아니라 news_collector RSS/API 양쪽에 chunk enqueue 1줄씩 명시 추가. days_old <= 30 가드 안에서 embed 와 동일 정책. scripts/news_chunk_country_backfill.py — doc 단위 small batch, 실패 doc skip, 50건마다 progress. queue 우회 직접 chunk_worker.process 호출로 timing 통제. Gate (PR closure): A) chunked_doc_pct > 95% 최근 7일 news doc 중 chunk 보유 비율 B) country null_pct < 5% 최근 7일 news chunk country NULL 비율 plan: ~/.claude/plans/7-whimsical-crab.md (PR-News-Prep-Layer-1) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-15 16:35:53 +09:00
hyungi	5125f82d4a	feat(study): Mac mini derived-worker (PR-MacMini-Derived-Worker-1) GPU = RAG context provider, Mac mini = LLM 가공 공장. GPU 측 변경: - app/api/internal_study.py: GET /internal/study/explanation-context/{qid} Bearer auth, gather_explanation_context + _render_envelope_prompt 재호출. 204=evidence missing, 410=deleted/ready. - app/workers/study_queue_consumer.py: settings.study_explanation_enabled false 시 explanation 분기 skip (status/attempts 미변경, pending 유지 → Mac mini 흡수). - app/core/config.py: study_explanation_enabled + internal_worker_token 2 setting. - app/main.py: internal_study_router include (prefix /internal/study). - docker-compose.yml: fastapi ports → 100.110.63.63:8000 Tailscale bind, STUDY_EXPLANATION_ENABLED + INTERNAL_WORKER_TOKEN env 추가. Mac mini 측: ~/derived-worker/ (별도 push 0, 어제 작성). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-15 03:13:43 +00:00
Hyungi Ahn	261036c7b2	ops(file-watcher): idle fire 로그 가시화 watch_inbox() 가 new_count/changed_count 둘 다 0 일 때 silent — PR-NAS-Watch-Folder 검증 시 fire 추적 부재 확인 후 보완. else 분기 추가해 매 5min fire 마다 "변경 없음 (idle)" info 로그 한 줄. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-14 13:32:38 +09:00
Hyungi Ahn	52f86acda7	feat(auth): voice-memo bot 365d access token (PoC v1) bot 계정(`voice-memo-bot`) 한정 long-expiry access token 발급 경로 추가. 일반 사용자 흐름 영향 0 (env gate default false). - core/auth.py: create_voice_memo_bot_token() 신규 (env gate + username hard-match) - api/auth.py: login route 에 bot 분기 (bot 이면 long token 반환, 일반은 기존 흐름) - docker-compose.yml: 3 env (VOICE_MEMO_BOT_TOKEN_ENABLED/_USERNAME/_EXPIRE_DAYS) default false OpenClaw `/voice-memo` plugin → DS `/memos/` Bearer proxy 의 auth 기반. 정식 service-account/api_keys 테이블은 Phase 2 (multi-service 인입 추가 시점). plan: ~/.claude/plans/rosy-launching-otter.md project: ~/.claude/projects/-Users-hyungiahn/memory/project_voice_memo_pipeline.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-13 12:24:18 +09:00
hyungi	1293c7094a	Merge pull request 'feat/news-tech-ai-sources' (#17 ) from feat/news-tech-ai-sources into main Reviewed-on: #17	2026-05-13 07:54:59 +09:00
hyungi	4b8120d83f	feat(briefing): date picker + 카드별 읽음/하이라이트 액션 사용자 요청 (2026-05-13): - 오늘 briefing 만 보여주고 과거 못 보는 게 아쉬움 → 날짜 선택 UI - 시간대 별 나열은 오히려 불편 → date dropdown 1단계 선택 - 각 카드에 읽음/하이라이트 토글 Schema (migrations 263~266, 단일 statement): - briefing_topics.is_read BOOL NOT NULL DEFAULT false - briefing_topics.read_at TIMESTAMPTZ - briefing_topics.highlighted BOOL NOT NULL DEFAULT false - briefing_topics.highlighted_at TIMESTAMPTZ API (app/api/briefing.py): - TopicResponse 에 id / is_read / read_at / highlighted / highlighted_at 추가 - GET /api/briefing/dates → 사용 가능 날짜 목록 (60일 cap) · briefing_date / total_topics / total_articles / status / read_count / highlighted_count - PATCH /api/briefing/topics/{id}/read body {value: bool} → 읽음 토글 - PATCH /api/briefing/topics/{id}/highlight body {value: bool} → 하이라이트 토글 - 토글 시 *_at 컬럼 자동 설정/NULL UI (frontend/src/routes/news/+page.svelte): - 헤더 우측 <select> date dropdown — 최신 + N일치 (highlighted_count 별 표시) - 선택 시 /api/briefing?date=… 로 해당 날짜 briefing 로드 - 카드 우측 상단 ★ (하이라이트) + 읽음 버튼 - 하이라이트 = Card class ring-2 ring-yellow-400 - 읽음 = 외부 div class opacity-60 (시각 차분화, 펴기 가능) - 토글 즉시 PATCH 호출 + 로컬 state 갱신 each key topic.topic_rank → topic.id 변경 (이미 unique).	2026-05-12 22:05:06 +00:00
hyungi	1d3d61d31e	fix(briefing): lower clustering threshold 0.78 → 0.70 배포 후 관측 결과 (2026-05-13 새벽): - 126 docs / 7 countries 인데 THRESHOLD=0.78 로 raw_clusters=124, dropped_min_articles=122, kept=1. - 거의 매 article 이 별 cluster 로 갈려 토픽 묶음 실패. - 같은 cron 어제 (5/12) 는 101 docs 에서 6 topics 성공 — 그날 뉴스가 우연히 같은 토픽으로 더 모인 case. 수동 측정 (5/13 동일 docs): - 0.78 → kept=1 - 0.70 → kept=5 (allowed) 영구 변경 = THRESHOLD=0.70. cross-country 필터 (MIN_COUNTRIES≥2) + min_articles(≥2) 그대로 유지하므로 noise topic 위험은 제한적. 원본 주석 (0.75~0.80 중간값) 도 갱신.	2026-05-12 21:44:00 +00:00
hyungi	2dbbeac1c7	fix(daily_digest): cast today to date object for KST comparison 매일 20:00 KST cron fire 시 fail: UndefinedFunctionError: operator does not exist: date = character varying 원인: today 가 strftime("%Y-%m-%d") 로 string, func.date(created_at) 가 date 타입. PostgreSQL 가 date = string 비교 거부. Fix: today = datetime.now(ZoneInfo("Asia/Seoul")).date() — date 객체로. KST 기준은 scheduler cron 이 KST 20:00 에 fire 되므로 자연 일치. scope: app/workers/daily_digest.py:24	2026-05-12 21:30:41 +00:00
hyungi	138f689c98	fix(scheduler): pass KST timezone to all CronTriggers AsyncIOScheduler(timezone="Asia/Seoul") 의 scheduler-level timezone 이 CronTrigger 에 자동 전파되지 않아 6 cron 모두 UTC 로 fire 되던 버그. 영향 (모두 9h 오차): - morning_briefing 의도 05:10 KST → 실제 14:10 KST - daily_digest 의도 20:00 KST → 실제 05:00 KST (다음날) - global_digest 의도 04:00 KST → 실제 13:00 KST - law_monitor 의도 07:00 KST → 실제 16:00 KST - mailplus_morning 의도 07:00 KST → 실제 16:00 KST - mailplus_evening 의도 18:00 KST → 실제 03:00 KST (다음날) Fix: 모든 CronTrigger 에 timezone=KST (= ZoneInfo("Asia/Seoul")) 명시. 검증 (재시작 후): law_monitor next: 2026-05-13 07:00 KST mailplus_morning next: 2026-05-13 07:00 KST mailplus_evening next: 2026-05-13 18:00 KST daily_digest next: 2026-05-13 20:00 KST global_digest next: 2026-05-14 04:00 KST morning_briefing next: 2026-05-14 05:10 KST	2026-05-12 21:30:34 +00:00
Hyungi Ahn	55e39818ec	feat(briefing): register 05:10 KST APScheduler cron 매일 KST 05:10 morning_briefing_run 자동 실행. scheduler timezone=Asia/Seoul 이라 hour=5 minute=10 만 명시. Phase 4 04:00 cron 종료 후 70분 buffer + MLX semaphore 충돌 회피.	2026-05-12 14:54:20 +09:00
Hyungi Ahn	6966be9cf6	fix(briefing): backfill country_perspectives[].article_ids from cluster members LLM 이 article_ids 를 자율적으로 비워두는 케이스 (2026-05-12 첫 briefing 6 topics 모두 빈 list) 를 서버에서 보정. 후처리 정책 (_resolve_article_ids): 1. LLM 이 준 id ∩ cluster member id (엉뚱한 id 차단, hallucination 방어) 2. 비어있으면 같은 country cluster member top weight N 개 자동 주입 3. cluster 안 country 매칭 멤버 0 → [] per-country cap = MAX_ARTICLE_IDS_PER_COUNTRY = 5. weight 내림차순. API 계약 강화: country_perspectives 가 있는 topic 은 article_ids ≥ 1 보장 (같은 country cluster member 존재 시). frontend / 외부 채널 / archive UI 모두 신뢰 가능. tests 3 케이스 추가.	2026-05-12 13:15:26 +09:00
Hyungi Ahn	431d4fe010	feat(briefing): add morning briefing schema + services + api (historical off) 야간 수집 뉴스 (KST 00:00~05:00) topic×country 비교 분석 1페이지 카드. Phase 4 Global Digest 와 코드/로직/테이블 분리, 알고리즘만 services/clustering_common 공유. Backend 신규: - migrations/255_morning_briefings.sql: morning_briefings + briefing_topics (briefing_date UNIQUE, UNIQUE(briefing_id,topic_rank), FK CASCADE, historical_* 3컬럼 nullable, cluster_members JSONB, country_perspectives JSONB, status 4-state success\|partial\|failed\|empty) - app/models/briefing.py: SQLAlchemy ORM - app/services/briefing/loader.py: KST 5h 윈도우 + news_sources prefix fallback (Phase 4 패턴 미러) + historical candidate pool 로더 - app/services/briefing/clustering.py: cluster_global topic-first (LAMBDA=ln(2)/2h, MIN_COUNTRIES_PER_TOPIC=2, MAX_TOPICS=7) - app/services/briefing/comparator.py: call_primary 26B + JSON envelope sanitize (cap perspectives 10 / divergences 3 / convergences 2 / quotes 5) + fallback row 고정 형태 + retrieve_historical cosine top-K - app/services/briefing/pipeline.py: load→cluster→select(K=7,λ=0.6) →historical→compare→status 4-state→delete+insert transaction - app/workers/briefing_worker.py: APScheduler/수동 호출 공용 진입점, 600s hard cap - app/prompts/briefing_comparative.txt: 한국어 비교 분석 JSON 프롬프트, {articles_block} + {historical_block} 2섹션, 인용 금지 라벨 - app/api/briefing.py: GET /latest, GET ?date=, POST /regenerate?date= (admin, sync delete+insert tx, regenerated:true) Backend 수정: - app/main.py: briefing_router 등록 (/api/briefing prefix). scheduler 등록은 PR-3 에서. - app/services/digest/selection.py: select_for_llm 매개변수화 (K, λ caller 주입). Phase 4 동작은 default 값으로 보존. Historical 정책: - BRIEFING_HISTORICAL_ENABLED env flag, default off. - flag off → historical_* 컬럼 모두 NULL, prompt {historical_block} 빈 라벨, retrieval 호출 안 함. - flag on (PR-1b 에서 enable) → cluster centroid 와 과거 30일 doc embedding cosine top-K 5 (sim≥0.70), prompt 에 주입. Country canonical (실측 확인 후): - documents.country 컬럼 부재 확정 - document_chunks.country 매칭률 0% (chunks 자체가 뉴스에 안 만들어짐) - 유일 country 신호 = news_sources prefix 매핑 (Phase 4 와 동일) Tests: - tests/test_briefing_historical.py: 3 경로 회귀 (flag off/on with fixture/on zero match) + sanitize cap + fallback row 형태. Verification: PR-1.8 에서 GPU 컨테이너 pytest + 수동 regenerate.	2026-05-12 12:58:50 +09:00
Hyungi Ahn	1ca6d8b522	refactor(digest): extract clustering helpers to clustering_common Phase 4 Global Digest 의 클러스터링 핵심 알고리즘 (time-decay weight, adaptive threshold, greedy cosine assign + EMA centroid, importance normalize) 을 `app/services/clustering_common.py` 로 추출. country 축은 caller 책임 — Phase 4 cluster_country 는 그대로 country 별 호출, 신규 morning briefing 모듈이 country 없이 cluster_global 로 호출 예정. selection.py 의 중복 _normalize 도 공통 util 로 통일. 동작 변경 0: - LAMBDA / threshold / EMA alpha / MIN_ARTICLES 모두 Phase 4 기본값 유지 - docs.sort (in-place) → sorted (copy) 변경했으나 caller 가 정렬된 docs 를 재사용하지 않으므로 무관 (dict element 의 weight 부여는 reference 라 그대로 반영) 다음 commit 에서 Phase 4 회귀 검증 (digest regenerate diff 0).	2026-05-12 12:38:32 +09:00
Hyungi Ahn	3dc78e4f94	fix(memos): voice memo file_type → 'immutable' (doc_type enum 호환) GPU 서버 main pull 후 /api/memos/?archived=false 가 500 — doc_type enum 에 'audio' 값 없음 (immutable/editable/note 만). list_memos WHERE file_type IN ('note', 'audio') 가 invalid_text_representation. 수정: - voice upload Document.file_type = 'audio' → 'immutable' (기존 audio 컨테이너 인입과 같은 패턴: file_type='immutable' + category='audio' + source_channel='voice') - list_memos 필터에서 file_type 조건 제거 (source_channel IN ('memo','voice') 만으로 분리 — file_type='immutable' 필터는 일반 PDF 까지 끌어옴, 위험) - module docstring + voice upload 주석 업데이트 원본 plan 의 file_type='audio' 결정은 doc_type enum 미확인이 원인. enum 확장(ALTER TYPE ADD VALUE 'audio') 대신 기존 패턴 재사용 — 안전 + 회귀 X.	2026-05-11 12:28:58 +09:00
Hyungi Ahn	6490050b04	feat(memos): promote memo to event + voice memo upload endpoint PR-2B/2C backend 2/2. plan v9 commit 분할 2~3 통합 (memos.py 단일 파일 변경). PR-2B promote-to-event: - POST /api/memos/{memo_id}/promote-to-event — 메모 → events 1-click 승급 · kind 결정: body.kind > documents.ai_event_kind > 400 · activity_log 면 status=done + ended_at=now() 자동 (5초 행동 기록 UX) · calendar_event + start_at 있으면 status=scheduled · Event row + events_history(create) 자동 생성 · memo_document_id 자동 link + source='memo' + raw_metadata 에 AI 추천값 보존 · 한 메모 → N events 가능 (사용자 의도에 따라 dedup 없음) - POST /api/memos/{memo_id}/dismiss-event-suggestion — '그냥 메모' (ai_event_kind='note' 강제) · MVP: AI 추천값과 사용자 확정값 같은 컬럼 (정확도 측정 흐려질 수 있음) · 백로그: user_event_kind 별 컬럼 분리 (plan Memo Intake Upgrade 백로그) - MemoResponse 확장: ai_event_kind / ai_event_confidence / source_channel / file_type / file_path - list_memos 필터 완화: file_type IN (note, audio) + source_channel IN (memo, voice) → voice 메모도 같은 inbox list 에 표시 (사용자 의도: 메모 = 모든 입력의 inbox) PR-2C voice upload: - migration 254: ALTER TYPE source_channel ADD VALUE 'voice' - POST /api/memos/voice (multipart audio + recorded_at + device_hint) · 검증: Content-Type audio/* + size ≤ 50MB + 확장자 화이트리스트 · NAS 저장: /documents/PKM/Recordings/{YYYY-MM}/{uuid}.{ext} · fsync + rename(atomic) 패턴 (NAS soft mount 안전) · Document row: file_type='audio' + source_channel='voice' + category='audio' · enqueue stt 큐 → 기존 stt_worker → classify (PR-2B triage) → embed → chunk · extract_meta 에 device_hint / recorded_at 보존 - 응답: MemoResponse (file_path 포함, frontend audio player 용) 원칙: AI worker 는 events row 직접 생성 X. 본 endpoint 가 사용자 의도 channel.	2026-05-11 12:06:41 +09:00
Hyungi Ahn	63990ac632	feat(memos): add AI event-kind triage fields PR-2B (Memo Inbox Triage) backend 1/2. plan: beszel-tingly-sloth.md 라운드 13. 사용자 비전 = 메모는 inbox, AI 는 triage assistant. AI worker 는 events row 직접 생성 X. Migrations 250–253 (실측 N=250): - 250 CREATE TYPE event_kind_hint AS ENUM (note\|task\|calendar_event\|activity_log\|reference) - 251 ALTER TABLE documents ADD ai_event_kind event_kind_hint - 252 ALTER TABLE documents ADD ai_event_confidence NUMERIC(3,2) + CHECK 0–1 - 253 CREATE INDEX idx_documents_ai_event_kind partial WHERE ai_event_kind IS NOT NULL ORM: - Document.ai_event_kind / ai_event_confidence 컬럼 추가 (Enum SQLAlchemy 동기) - source_channel enum 에 'voice' 추가 (PR-2C 와 호환) Worker: - classify_worker Phase 3 (Gemma 4B triage) 확장 · TriageOutput 에 event_kind_hint + event_kind_confidence 필드 추가 · 4B 응답에 hint 가 있을 때만 Document 에 저장 (enum 외 값은 무시) - prompt p3a_short_summary.txt 확장 — note/task/calendar_event/activity_log/reference 분류 기준 + confidence + default='note' 명시 원칙: AI worker 는 hint 만 제공. events 생성은 다음 commit 의 promote endpoint 에서만.	2026-05-11 12:04:21 +09:00
hyungi	a842dc682e	Merge pull request 'wip/gpu-main-snapshot-2026-05-11' (#7 ) from wip/gpu-main-snapshot-2026-05-11 into main Reviewed-on: #7	2026-05-11 08:11:44 +09:00
Hyungi Ahn	6d71116553	feat(events): PR-2 UI MVP — 4-tab + 빠른 행동 기록 + 상세/생성/이력 plan v6 PR-2 scope. 5초 행동 기록 UX 가 핵심 가설. Backend: - GET /api/events/{id}/history — events_history timeline 조회 (lifecycle op 자동 기록) Frontend (SvelteKit 5 runes mode): - /events 메인 — 4-tab (오늘/Inbox/예정/활동) + 빠른 행동 기록 widget · 단일 입력 + Enter → POST /api/events kind=activity_log · status=done + 시간 default 채워짐 (서버 측) → Activity 탭 즉시 반영 · 새 항목을 list 최상단 prepend (refetch 불필요) · 연속 입력 위해 입력 ref focus 유지 · lifecycle 버튼 (complete/defer/cancel/reactivate) — activity_log 는 lifecycle 대상 X - /events/[id] 상세 — PATCH 허용 필드 edit (title/desc/시간/priority/project_tag) + history timeline · PATCH 금지 필드는 UI 노출 X (status/completed_at/cancelled_at/defer_until 은 별 버튼) - /events/new — kind 선택 (task/calendar_event/activity_log) 후 필드 분기 form · task: due_at + start_at (선택, "14:00 전화" 같은 시각 task 허용 — 라운드 10) · calendar_event: start_at 필수 + end_at + all_day · activity_log: started_at/ended_at 비우면 서버 default now() - Sidebar 메모 옆에 events 진입점 (CalendarCheck icon) API helpers: frontend/src/lib/utils/events.ts (createEvent / logActivity / list* / lifecycle ops / kind&status enum label/color). quickref doc: docs/events_api_quickref.md (이전 commit, PR-2 frontend reference). PR-2 핵심 가설 검증 = 빠른 입력 → 저장 → Activity 즉시 반영 → 새로고침 유지. PR-1 deferred HTTP behavior 5건도 본 UI 의 자연 사용으로 닫힘.	2026-05-11 07:56:31 +09:00
Hyungi Ahn	9d9b3359b0	feat(events): PR-1 Events Core — schema + ORM + 최소 API 개인 운영 로그 / 일정 / 할 일 / 회고용 1차 컨테이너 도메인 신설. plan: ~/.claude/plans/beszel-tingly-sloth.md (라운드 12 v6). Schema: - enum 5종 (event_kind / event_status / event_source / event_actor / history_change_kind) - events 테이블: kind(task\|calendar_event\|activity_log) + lifecycle 7-state status - events_history: lifecycle op 자동 기록, FK RESTRICT (이력은 시점 사실) - CHECK: calendar_event → start_at NOT NULL / activity_log → started_at\|ended_at NOT NULL - partial unique (source, source_ref) — 외부 source dedup (PR-4 활용) - partial index (active status / activity_log timeline) API: - POST /api/events (kind=activity_log shortcut: status=done + ended_at=now() default) - GET /api/events/{id} \| /api/events?kind&status&from&to&project_tag&source - PATCH /api/events/{id} (extra=forbid + 시간 필드 변경 시 reschedule history) - POST /api/events/{id}/{complete,cancel,defer,reactivate} (history 자동) - GET /api/events/today (Asia/Seoul default, deferred 는 defer_until<=now() 만) - GET /api/events/inbox \| /api/events/activity?from&to 제외 (PR-2~5 또는 백로그): - DELETE (회고 데이터 → /cancel 일관화) - log shortcut / upcoming endpoint (POST + GET ?from&to 로 흡수) - /ingest (PR-4 MailPlus forward 시 정확한 요구로 추가) - iCal export / ntfy 알림 / recurrence / 일반 edit history	2026-05-11 07:19:04 +09:00
Hyungi Ahn	aca2f0d62c	feat(canonical): restore GPU STT owner and extend KGS watch paths D9 Track B revised (2026-05-08): 1) STT owner GPU 정식 복귀: - docker-compose.yml: stt-service profiles:[legacy] 제거 → 상시 활성 - fastapi STT_ENDPOINT = http://stt-service:3300 (compose 내부 DNS) - 정책: Mac mini = Gemma 26B 전용 우선이므로 STT/Whisper 는 호출량 무관 GPU 서버 소유. 이전 "Mac mini 이전본" 주석은 trace 오인 기반. 2) KGS Code 등 외부 학습 자료 추가 스캔 경로: - ADDITIONAL_WATCH_TARGETS env (쉼표 구분, PKM 상대경로) - app/core/config.py: additional_watch_targets list 설정 추가 - app/workers/file_watcher.py: 추가 watch path 처리 - app/workers/classify_worker.py: KGS Code 분류 분기 (가스기사 학습 자료) - 모두 expected_category=library 처리 (md/pdf/docx 만) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 05:47:20 +00:00
Hyungi Ahn	8ca27eb573	fix(markdown): img auth via ?token= query param (Authorization header 미지원) `<img src=>` 가 Authorization header 를 못 보내서 /api/documents/{id}/images/{key}/raw 가 401 반환 → 이미지 안 보임. 기존 /file?token= iframe 패턴과 동일하게 access token 쿼리 파라미터로 전달. backend: get_current_user 의존성 제거하고 token 쿼리 파라미터 직접 검증 (기존 /file 엔드포인트와 동일 흐름). frontend: MarkdownDoc 의 swap selector 가 img.src 에 ?token={getAccessToken()} 부여. 로그아웃 상태면 placeholder 유지. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 14:47:09 +09:00
Hyungi Ahn	68fa86ea52	feat(markdown): persist extracted images with auth routes Markdown Canonical Phase 1B.5 — marker 가 추출하던 이미지를 NAS 에 영구 저장하고 DB 메타 + 인증 라우트 + 프론트 swap 까지 wiring. 핵심 변경: - marker-service /convert 응답에 base64 image 리스트 포함 (stateless 유지, NAS write 권한 X) - marker_worker 가 NAS `/documents/extracted_images/{doc_id}/` 에 persist + UPSERT + 고아 row DELETE + md_content ref 를 `docimg:img_NNN` stable scheme 으로 정규화 - /api/documents/{id}/images/{key}/raw 인증 라우트 (Cache-Control private + ETag = content_hash) - frontend MarkdownDoc 가 placeholder card 안의 docimg ref 를 실제 <img> 로 swap 원칙: - 이미지 binary = NAS, metadata = Postgres (학습 섹션 패턴 동일) - image_key sequence 기반 결정적 → 재변환 idempotent - MARKDOWN_IMAGE_PERSIST=false env 로 rollback 가능 (placeholder card 폴백 자연 유지) 기존 28건 marker success 문서는 본 PR 에서 건드리지 않음 — deploy + 신규 업로드 1건 + sample 5건 검증 후 scripts/marker_reprocess_existing_success.py 로 targeted reprocess. plan: ~/.claude/plans/piped-humming-crystal.md Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-10 14:05:41 +09:00
Hyungi Ahn	5b62c59f8a	fix(canonical): marker_worker transport 계층 오류는 transient retry 분류 기존: (ConnectError, TimeoutException) 만 transient → raise → queue retry. ReadError / WriteError / RemoteProtocolError 같은 다른 transport 류는 'except Exception' 이 잡아 _fail 처리 → max_attempts 무시하고 final fail. Phase 1D pilot 에서 5111/5115 두 건이 'Server disconnected without sending a response' (RemoteProtocolError) 로 retry 없이 final fail. Fix: except (ConnectError, TimeoutException) → except TransportError. TransportError 가 Connect/Read/Write/RemoteProtocol/Timeout 의 공통 부모 라서 모든 transport 계층 오류가 transient queue retry 대상이 됨. 5135 의 ReadTimeout (queue exhausted) 는 본 fix 와 별개 — 8.4MB PDF 가 MARKER_TIMEOUT=300s 안에 못 끝나 3번 retry 다 timeout. timeout 자체를 늘리거나 큰 PDF 분할 처리하는 별도 결정 필요. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 15:29:47 +09:00
Hyungi Ahn	5185501bbd	feat(search): PR-RAG-Time-1 freshness decay (news/law_monitor) 뉴스/법령 알림 retrieval 결과에 시간 가중치 soft multiplier 적용. reranker 이후 final score 합성 단계에서 운영 정책 단계로 분리. - news (source_channel='news'): half-life 90일 - law_monitor (source_channel='law_monitor'): half-life 365일 - 비적용: manual / drive_sync / inbox_route / memo / Manual / Reference / Academic_Paper / Checklist / KGS Code / Study / content_origin='ai_drafted' - formula: decay = exp(-ln(2) * age / HL); final = base * (0.7 + 0.3 * decay) - floor 0.7 (완전 demote 금지) - 가드: missing date / future date / unknown source 모두 no-op - 임시 date source: documents.created_at (published_date 컬럼 부재 — 후속 PR) debug 메타 (?debug=true 응답 + logs/search.log): base_score / age_days / decay_factor / freshness_adjusted_score / freshness_policy / freshness_date_source 신규: app/services/search/freshness_decay.py hook: app/services/search/search_pipeline.py:303 (apply_diversity 직후, normalize 직전) schema: app/api/search.py SearchResult.freshness_debug (Optional[dict]) tests: tests/test_freshness_decay.py 24 case (정책 디스패처 9 + age/decay/score 11 + apply integration 6 — guard 1~6 all) Episode/Fact layer 와 contradiction detection 은 본 PR 스코프 외. plan: ~/.claude/plans/pr-rag-time-1-freshness-decay.md	2026-05-03 08:38:09 +09:00
Hyungi Ahn	7d0fca267d	feat(marker): handwritten 자동 skip — Phase 1D pilot 결과 반영 1D pilot (2026-05-02 야간 sweep, 25 controlled_backfill 결과) 에서 필기 PDF 3건 (4798 / 4813 / 4815) 이 status='success' 로 변환됐으나 사용자 quality 평가에서 좋은 자료 추출 불가 판정. 근본 원인은 Marker 설정 부족이 아니라 입력 자체 (애플펜슬 손글씨 + 사용자 글씨체 = OCR/ layout 모델 한계 영역). Marker 튜닝으로 해결될 영역이 아니므로 enqueue 단계에서 자동 skip. 가드 로직: marker_worker.process() 의 doc_type SKIP 직후 (1.5 단계) title/path 의 보수적 키워드 4개 (필기, 손글씨, handwritten, handwriting) 매칭 시 _set_skipped() 호출. md_content/md_content_hash NULL clear, md_extraction_error='skipped: handwritten note (title/path heuristic)', content_origin='extracted'. 키워드 선정 (보수적): 포함: 필기 / 손글씨 / handwritten / handwriting 제외 (false positive 위험): - 노트 (노트북 매뉴얼 / release notes / Note_240528_워크숍 같이 필기 아닌 정상 문서까지 잡음) - scan / 스캔 (스캔 PDF 中 정상 변환되는 케이스 있음, 1D 결과 doc 5127 표준기계설계(KS)_08_핀 density 1.59 / scan_likely 인데 성공) logger: markdown_skip_handwritten_hint id=<id> keyword=<matched> title=<...> regex 단위 테스트 15 케이스 (실 production fastapi venv) 전부 통과: 매칭: Note_240805_용접교육 필기 / Note_240827_필기 / 손글씨 모음 / Handwritten Notes 2024 / handwriting practice / path/필기/* / path/handwritten_collection/* (8건) 비매칭: 다이아프람워크숍 / 노트북 매뉴얼 / Release notes v2 / PIPE FABRICATORS / 표준기계설계 / scan documentation / 스캔 문서 (7건) 이번 가드는 enqueue 시점 적용. 이미 success 인 4건의 md_content 는 보존 (사용자가 직접 보고 싶을 때 표시 가능). 정리 필요 시 별건. 후속 (별 PR): - A2 (정식 doc_type='필기노트' 라벨): 1D 3건 sample 너무 적어 라벨 정의 보류. 필기 PDF 누적 후 별도 검토. - C (Phase 2 풀 backfill plan): 본 PR 머지 후 별도 라운드. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-03 08:11:42 +09:00
Hyungi Ahn	6b52d57bac	feat(study): Phase 4-A explanation_md 길이 cap + prompt 강화 운영 데이터에서 ready 박힌 풀이가 793/838/866자 — 권장 200~400 대비 큰 편. 1차 운영 후 결과 화면 가독성 + 토큰 사용량 통제 위해 prompt 강화 + 저장 전 cap. Prompt (study_explanation_envelope.txt): - explanation_md 권장 300~600자, 최대 900자 명시 - 핵심 개념 + 정답 근거 + 헷갈리는 1~2개 오답만 — 모든 오답 풀이 X - explanation_md 안 줄바꿈 최소화 (parse_json fix 와 결합 — invalid escape 줄임) - LaTeX 수식 자제 — \\circ/\\text/\\, 매크로 가능하면 평문 ('0°C', 'C') - 출력은 raw JSON 한 객체만 — 코드 펜스/thinking/메타 X 강조 Worker (study_explanation_worker.py): - _cap_explanation_md(text, max_chars=1200) 헬퍼 신규 · 1200자 이하 passthrough · 초과 시 마지막 200자 안에서 \\n\\n / \\n / '. ' / '다.' / '요.' 경계 탐색 · 경계에서 자르기 + '…' (단어 중간 자르기 회피) · 경계 못 찾으면 단순 자르기 + '…' - save 전 cap 적용. ai_explanation_status='ready' 유지 (cap 됐다고 failed X) - payload 에 운영 분석 metadata: explanation_len_original / _saved / capped 플래그 검증: - tests/test_explanation_cap.py (6 케이스) · short passthrough / exact at limit / paragraph boundary / sentence boundary · no boundary fallback / empty input - scripts/phase4_health.sql 섹션 8/9 추가 · ai_explanation 길이 p50/p95/max (study_questions.ready) · cap 작동 빈도 (job.payload 의 explanation_capped/_original/_saved) cap 1200 = 800 (4-B summary_md) 보다 여유 — 기사시험 풀이는 공식+오답+개념 묶이면 800 빡빡함. 운영 후 800~1000 으로 조정 검토. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 08:33:18 +09:00
Hyungi Ahn	b3dbf1a11e	fix(ai): parse_json_response — string literal 안만 fix 하는 stateful walker 직전 fallback 의 무차별 newline replace 가 string 외부 (object 구조) 의 raw newline 까지 escape 해서 JSON 거부. 또 LaTeX 수식 (\circ, \text, \, etc) 의 invalid backslash 는 newline 이슈와 별개라 별도 fix 필요. state machine: in_string 토글 (`\"` 만남). string literal 안에서만: - raw LF/CR/TAB → \\n/\\r/\\t 로 변환 - backslash 다음에 valid escape char (\"\\/bfnrtu) 면 그대로 - backslash 다음에 invalid (\\c, \\,) 면 backslash 자체를 \\\\ 로 escape - string 외부 raw newline 은 JSON whitespace 라 보존 운영 데이터 id=243 의 raw 940자에 \\circ \\text \\, \\approx \\times 등 다수 LaTeX + markdown 줄바꿈 → 새 walker 가 두 케이스 모두 fix. 다른 worker (classify/triage/ study_explanation/evidence/study_session_analysis) 자동 혜택. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 08:00:20 +09:00
Hyungi Ahn	95b127fd8d	fix(ai): parse_json_response — raw newline escape fallback (5단계) Phase 4-A debug 결과 study_question_jobs.parse_fail 33건의 raw preview 분석: - 모델이 explanation_md 안에 raw newline (LF) 그대로 박음 ('### [풀이]\n\n**자료...') - JSON 표준상 string literal 안 raw control char 금지 → json.loads 거부 - 4단계 fallback (greedy slice) 도 이 때문에 실패 5단계 fallback 추가: candidate 의 \r\n/\n/\r 을 ``\\n``/``\\r`` escape 로 치환 후 재시도. 이미 escape 된 ``\\n`` (Python str = backslash+n 두 글자) 는 raw newline 아니라 영향 없음. 다른 worker (classify/triage/study_explanation/evidence/study_session_analysis) 모두 같은 파서를 공유하므로 자동으로 혜택. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 07:56:01 +09:00
Hyungi Ahn	ff41feb3e3	fix(study): Phase 4-A parse_fail 디버깅 — 파서 fallback + raw 저장 운영 데이터에서 4-A study_question_jobs 의 33/114 가 'envelope JSON parse failed' 로 종결. parse_json_response 의 balanced 정규식이 못 잡는 케이스 다수 추정. 원인 분류 위해: 1. 파서 보강 (app/ai/client.py) - 기존 4단계 파싱 (fenced / balanced finditer / 전체 cleaned) 보존 - 5단계 fallback 추가: first '{' ~ last '}' greedy slice → json.loads - envelope JSON 안에 내부 따옴표/뉴라인/escape 때문에 balanced 가 못 잡는 케이스 방어. 모델이 JSON 앞뒤 자유 텍스트 섞어도 본체만 추출. - 회귀 위험 낮은 추가만 (앞 단계 성공 시 즉시 반환) 2. parse_fail 시 raw preview 저장 (study_explanation_worker) - 3개 inline parse_fail 분기 (not_dict / invalid_answer_choice / empty_explanation_md) 모두 _save_raw_preview() 헬퍼 호출 - job.payload.debug_raw_preview = raw_text[:1000] - job.payload.parse_fail_reason = 분류 키 - 향후 parse_fail row 의 payload 분석으로 원인 정확히 분류 가능 다음 단계: 배포 후 재발생 추이 + raw preview 분석 → prompt 추가 강화 또는 parser 추가 보강. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 07:48:10 +09:00
Hyungi Ahn	8074be6b6d	feat(study): Phase 4-D 운영 관찰 + confidence calibration Phase 4-B v1 첫 검증 결과 자료 부족 토픽인데도 모델이 confidence='high' 박는 케이스 발견. 정의 (high = 자료 + 다른 ai_explanation 으로 패턴 명확) 보다 과신 — UX 신뢰도 위험. 자동 cap 보정 + 운영 관찰 SQL 추가. confidence calibration (services/study/session_summary_guard): - calibrate_confidence(c, ctx_docs_count, ready_explanation_count) 신규 · ctx_docs_count == 0 AND ready_explanation_count == 0 → 'low' cap · ctx_docs_count == 0 (ready 만 있음) → 'medium' cap · ctx_docs_count >= 1 → 모델 값 그대로 - 모델이 정의보다 더 보수적인 값 박은 경우 (모델 'low' + cap 'medium') 는 보존 — 더 보수적인 값을 절대 올리지 않음 worker 적용 (study_session_analysis_worker): - ctx_docs_count = len(ctx_docs) - ready_explanation_count = sum(1 for a in prompt_attempts if a.get('ai_explanation')) - calibrate_confidence 호출 → study_quiz_session_analysis.confidence 박힘 - job.payload 에 운영 분석 metadata 보존: · ctx_docs_count / ready_explanation_count · model_confidence_raw (모델 응답) vs calibrated_confidence (cap 후) · prompt_attempts / valid_attempts_total / summary_len → SQL 4 번 쿼리가 cap 작동 빈도 측정 scripts/phase4_health.sql (신규 운영 점검 SQL 7 섹션): 1. 4-A study_question_jobs status × error_code 분포 2. 4-B study_quiz_session_jobs status × error_code 분포 3. 4-B confidence 분포 (calibrated) 4. 4-B model_confidence_raw vs calibrated 차이 (cap 작동 빈도) 5. 4-A/4-B 최근 7일 처리 지연 p50/p95/max/avg 6. 4-A/4-B skipped 사유 분포 7. 4-B guard_fail / parse_fail / llm_timeout 비율 ship gate (단위 테스트): - test_calibrate_confidence_no_evidence_caps_to_low (3 케이스) - test_calibrate_confidence_only_explanations_caps_to_medium (3 케이스) - test_calibrate_confidence_with_documents_passthrough (3 케이스) - test_calibrate_confidence_normalizes_invalid_first (2 케이스) Plan: ~/.claude/plans/nifty-sparking-spindle.md (Phase 4-B v1 후속) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 07:33:57 +09:00
Hyungi Ahn	1186537ecf	fix(study): Phase 4-B v1 worker — completed 박을 때 error_code 명시 clear 이전 attempt 가 llm_timeout/parse_fail 박은 후 다음 attempt 가 정상 완료해도 error_code 가 잔존해서 운영 분석 시 혼선. status='completed' 박는 시점에 error_code = None / error_message = None 으로 명시 reset. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 07:28:43 +09:00
Hyungi Ahn	ea6b2cf351	fix(study): Phase 4-B v1 prompt cap — 큰 세션 LLM timeout 방어 세션 1 (wrong+unsure 84건) 에서 prompt 가 23K자 넘어 30초 timeout. plan 가정 (5~30건) 대로 MAX_ATTEMPTS_IN_PROMPT=30 cap 추가. 가장 최근 attempts 우선 (answered_at asc 정렬의 뒤쪽). 기존 valid_attempts 카운트 검증 (5건 미만 skip) 은 그대로 유지 — cap 은 prompt 입력만, 검증은 전체 기준. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 07:25:14 +09:00
Hyungi Ahn	6785d53d3d	feat(study): Phase 4-B v1 세션 단위 종합 분석 (자유 마크다운) Phase 4-A 가 wrong/unsure 한 문제씩 풀이 캐시. 4-B 는 세션 전체 wrong/unsure 5~30건을 묶어 200~400자 자연어 요약 1건 생성. 결과 화면 헤더 카드. 큐 인프라는 4-A study_question_jobs 와 분리 — FK 단일 의미 + 운영 SQL 명확성 + 4-A/4-B 가드/payload/재시도 정책 차이. 신규 study_quiz_session_jobs (큐) + study_quiz_session_analysis (결과 캐시 PK=session_id, UPSERT) + 전용 consumer. Backend: - migrations/233 — study_quiz_session_jobs (FK study_quiz_sessions NOT NULL, status pending/processing/completed/failed/skipped, max_attempts=2) - migrations/234 — partial unique idx (session_id) WHERE pending/processing - migrations/235 — study_quiz_session_analysis (session_id PK, summary_md, confidence, model_name, generated_at, is_stale) - models/study_quiz_session_job — ORM + enqueue_session_analysis_job() (멱등) - models/study_quiz_session_analysis — ORM (PK = session_id) - services/study/session_summary_guard — GUARD_PATTERN (정규식) + normalize_confidence() 단일 source, worker + tests 가 import 공유 - services/study/session_summary_rag — gather_session_summary_context() documents 만 (PR-3 _gather_document_evidence 재사용). evidence 없어도 호출 허용 (4-A 와 다른 정책 — 세션 기록 자체가 evidence) - services/study/session_analysis_enqueue — auto (finalize/fallback) + request_session_analysis_regenerate (manual). manual 은 wrong/unsure < 5 즉시 차단, active job 차단, 기존 analysis 있으면 is_stale=true 박기 - prompts/study_session_summary_envelope.txt — envelope JSON {summary_md, confidence}. 정량 정수만 인용 가능, 비율/추세/범위/날짜 금지 - workers/study_session_analysis_worker — terminal status 분기: · wrong/unsure < 5 → status=skipped, error_code=insufficient_attempts · question_text/outcome 부족 → skipped, evidence_missing · GUARD_PATTERN match → failed, guard_fail · 800자 hard cap + confidence normalize · timeout/parse/unknown → 재시도 후보 · UPSERT study_quiz_session_analysis ON CONFLICT DO UPDATE (PK session_id) - workers/study_session_queue_consumer — 4-A consumer 패턴 복제. BATCH_SIZE=1 + STALE_MINUTES=10. MLX gate 4-A 와 공유 (Semaphore(1)) - main.py — APScheduler add_job(consume_study_session_queue, ..., 1분 주기) - session_finalize — 끝에서 enqueue_session_analysis_auto (best-effort) - api/study_topics: · QuizSessionAnalysisOut + ai_session_analysis 응답 필드 (analysis row + 최신 job status/error_code) · GET fallback enqueue (기존 analysis 또는 active job 없으면만, non-blocking) · POST /quiz-sessions/{sid}/regenerate-summary — manual 트리거 Frontend (quiz-sessions/[sid]/+page.svelte): - 결과 헤더에 세션 요약 카드 (AI 풀이 indicator 직후, 바로 할 일 직전) - summary_md 박혔으면 markdown 렌더, 없으면 job_status / error_code 분기: · pending/processing → "AI 가 세션 분석 중" · insufficient_attempts → "오답·모르겠음 5건 미만" · evidence_missing → "자료 부족" · guard_fail → "환각 검증 차단" + 재생성 링크 - confidence='low' 배지 + is_stale "재생성 중" 배지 - 재생성 버튼 + regenerateSummary() — reason 별 toast 분기 ship gate: - tests/test_session_summary_guard_pattern.py — 허용 5 + 차단 7 케이스 + normalize_confidence 표준/비표준 검증. python3 직접 실행 패스. Plan: ~/.claude/plans/nifty-sparking-spindle.md (Phase 4-B v1) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 07:20:29 +09:00
Hyungi Ahn	c7630b9815	feat(study): Phase 4-A 결과 화면 inline indicator — AI 풀이 진척 노출 결과 화면에서 사용자가 [AI 해설 보기] 누를 때 캐시 hit/miss 가 불투명함. 헤더에 한 줄 indicator 추가 — 오답·모르겠음 대상 N건 중 ready 박힌 카운트 + 진행 중/실패/자료 부족 분포. Backend (study_topics.py get_quiz_session): - questions[i].ai_explanation_status 응답에 추가 (q.ai_explanation_status 그대로) · frontend 가 attempts.outcome (wrong/unsure) 와 결합해 카운트 Frontend (quiz-sessions/[sid]/+page.svelte): - $derived aiExplProgress — wrong/unsure attempts 와 question.ai_explanation_status 결합 카운트 (target / ready / pending / failed / skipped) - 헤더에 Sparkles 아이콘 + "AI 풀이 자동 생성: N/M (P%)" 한 줄 · pending > 0: "생성 중 N" (warning 색) · failed > 0: "실패 N" (error 색) · skipped > 0: "자료 부족 N" (dim) · 셋 다 0인데 ready < target: "대기열 처리 대기" (worker 1분 주기 안내) 이 indicator 는 GET fallback enqueue 와 함께 작동 — 결과 화면 진입 시점에 backfill 이 누락된 wrong/unsure 가 이미 enqueue 되고, 1분 주기로 ready 박힘. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 12:15:35 +09:00
Hyungi Ahn	3db5d331de	feat(study): Phase 4-A 운영 가시화 — 통계 대시보드 AI 풀이 카드 Phase 4-A 가 wrong/unsure 풀이를 background batch 로 캐시하는데, 사용자/운영자 입장에서 (1) 지금까지 얼마나 캐시 채워졌는지, (2) 환각 차단/파싱 실패/자료 없음 같은 worker 결과 분포를 볼 수 없었음. 통계 대시보드에 카드 추가. Backend (study_question_progress.py /stats): - StatsAiExplanation 신규 응답 섹션 · status_distribution — 토픽 전체 study_questions.ai_explanation_status 분포 (none/ready/failed/skipped/stale/pending 6 키 default 0) · target_total / target_ready — wrong/unsure progress 의 ready 비율 (캐시 hit 가능성 추정 핵심 지표) · recent_jobs — 최근 7일 study_question_jobs 의 (status, error_code) 분포 ('completed', 'failed:guard_fail', 'failed:parse_fail', 'skipped:evidence_missing' 같은 합성 키) Frontend (/study/topics/[id]/stats): - 신규 Card "AI 풀이 캐시" — Sparkles 아이콘 · 큰 숫자 + 진행률 바: ready / wrong+unsure · 토픽 전체 status 분포 inline (한국어 라벨) · 최근 7일 worker 결과 grid (환각 차단 / 파싱 실패 / 자료 없음 skip 등 분리) - statusLabel / jobLabel 헬퍼 — 운영자 친화 한국어 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 11:59:20 +09:00
Hyungi Ahn	43097e6fd9	fix(study): Phase 4-A envelope 프롬프트 — answer_choice 사용자 정답 강제 검증 결과 모델이 envelope 안에서 자료 근거로 정답 번호를 재판단해서 거의 매번 guard_fail (answer_choice != correct_choice). 환각 가드는 정확히 작동했지만 caching 효율 0%. PR-3 의 free-form 풀이는 "사용자 정답 우선, 충돌 명시" 라 정상 ready 박혔지만 envelope.txt 가 "자료 근거 우선" 으로 충돌. 환각 가드의 본질 — 모델이 envelope 형식을 어겨 임의로 다른 번호를 박는 케이스 차단 — 을 유지하되, answer_choice 값은 사용자 정답 (correct_choice) 을 그대로 박도록 명시. 자료 근거와 사용자 정답이 다를 경우 explanation_md 안에 짧게 명시만 하고 answer_choice 는 보존. 정답 자체를 바꾸는 게 환각 가드의 차단 대상이라고 강조. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 11:47:44 +09:00
Hyungi Ahn	e8da53490c	feat(study): Phase 4-A wrong/unsure AI 풀이 prefetch batch PR-3 의 결과 화면 [AI 해설 보기] 실시간 호출이 클릭 시 8~30초 대기. 풀이 직후 백그라운드 batch 로 미리 생성해 캐시 hit. 환각 가드는 PR-3 보다 강화 — envelope JSON {answer_choice, explanation_md, confidence} + answer_choice == correct_choice 검증 + evidence 의무. processing_queue 가 documents.id FK 라 study_questions 에 직접 재사용 불가 → 별도 study_question_jobs 테이블 + 별도 consumer. Backend: - migrations/231 — study_question_jobs CREATE TABLE (13컬럼, kind 권장값 'explanation' / 'session_summary' 예약, status pending/processing/completed/ failed/skipped, max_attempts=2) - migrations/232 — partial unique idx (qid, kind) WHERE status IN (pending, processing) — active 행 중복 차단, terminal 이력 누적 허용 - models/study_question_job — ORM + enqueue_study_question_job() 헬퍼 (on_conflict_do_nothing 멱등) - prompts/study_explanation_envelope.txt — envelope 형식 프롬프트 (answer_choice 1~4 강제, confidence high/medium/low) - workers/study_explanation_worker — terminal status 분기: · evidence 둘 다 빈 리스트 → job/question 모두 skipped (LLM 호출 X) · answer_choice != correct_choice → guard_fail / failed (재시도 X) · timeout/parse → 재시도 후보 (max_attempts=2) · catch-all except → unknown 명시 + retryable 분기 · question.ai_explanation_status='ready' 이미 박혀있으면 즉시 completed · confidence 는 job.payload 에 보존 (운영 분석) - workers/study_queue_consumer — APScheduler 1분 주기, BATCH_SIZE=1, MLX gate Semaphore(1) 공유. STALE_MINUTES=10 자체 복구 - main.py — scheduler.add_job(consume_study_queue, ..., id='study_queue_consumer') - services/study/explanation_enqueue — finalize + GET fallback 공유 헬퍼: filter_needs_explanation (study_questions status + 최신 job error_code 필터, guard_fail/evidence_missing 인 마지막 job 은 자동 재enqueue 제외) + enqueue_explanation_for_qids (max_count cap) - session_finalize — 끝에서 wrong/unsure qid prefetch enqueue (best-effort, 실패해도 finalize 자체 안 깨짐) - api/study_topics get_quiz_session — done 세션에서 backfill enqueue (max=30, non-blocking, debug 로그) 대상 조건: ai_explanation_status IN ('none', 'failed') OR ai_explanation IS NULL. stale / skipped / pending / ready 는 자동 enqueue 대상 X. stale 재생성은 PR-3 명시 [다시 생성] 또는 후속 Phase 에서. Plan: ~/.claude/plans/nifty-sparking-spindle.md (Phase 4-A) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 11:42:08 +09:00
Hyungi Ahn	711b81f8f0	feat(study): Phase 2-F due_at 정체 정리 — overdue redistribute 사용자가 며칠 안 들어오면 due_today 가 누적되어 학습 페이스 압박. Phase 1 plan 위험 항목 처리. 자동 batch 대신 사용자 명시 액션으로 통제권 보장. Backend: - POST /study-topics/{tid}/review-queue/redistribute — overdue 를 round-robin 분산. days_offset = i % spread_days + 1 (오늘 + 1~7일). 같은 날 안에서도 i*7분 spread 로 시간 분산. review_stage 는 보존 (재배치만, stage 리셋 X). body { spread_days: 1~14, default 7 }. 응답 { redistributed_count, spread_days }. - GET /review-queue?tab=due_today 응답에 overdue_count: int 옵션 필드 — UI 가 경고 + [정리] 노출 판단. due_at < today 0시 (UTC) + stage<4 카운트. Frontend (review-queue): - due_today 탭에서 overdue_count>0 시 노란 banner — "정체 N건" + [정리] 버튼. - 정리 클릭 → confirm → POST → toast (N건을 7일에 분산) → 카운트/목록 reload. - 다른 탭에서는 banner 미노출 (backend 가 overdue_count=0 응답). Plan: ~/.claude/plans/crispy-petting-dijkstra.md (Phase 2-F) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 10:48:00 +09:00
Hyungi Ahn	f42f6ff480	feat(study): Phase 2-E 복습함 멀티 셀렉트 → 복습 세션 복습함 카드 단위 체크박스 + sticky bottom bar 로 N개 골라 한 quiz_session. backend QuizSessionStartRequest 에 question_ids 파라미터 추가 — 우선순위 stage > question_ids > 기존 subject 경로. 명시되면 selection 우회 + 검증 (user × topic 소속 + 미삭제 + 최대 200 + 중복 제거 순서 보존). Backend: - question_ids: list[int] \| None — Field 한도 200 - valid_set 검증: 다른 user/topic 또는 deleted_at 인 qid 는 silent drop - subject_distribution 자동 계산 (결과 카드용) - 빈 wanted / 무효 qid → 400 Frontend (review-queue 페이지): - 카드 좌측 체크박스 (분리 영역, 본문 클릭은 기존대로 문제 페이지) - "이 페이지 전체 선택 / 해제" 토글 - 선택 N>0 시 sticky bottom bar — `{N}개 풀이 시작` 버튼 - 탭 변경 시 선택 초기화 (다른 의도 묶음 가능성) - 페이지 이동 시 선택 유지 (Set<question_id>) - 진행 중 in_progress 세션 있으면 confirm 후 abandon - 200 한도 도달 시 toast 경고 Plan: ~/.claude/plans/crispy-petting-dijkstra.md (Phase 2-E) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 10:39:46 +09:00
Hyungi Ahn	d39882c38e	feat(study): Phase 2-D 학습 통계 대시보드 — 6 섹션 신규 라우트 /study/topics/[id]/stats — backend 단일 endpoint 호출로 6 섹션: 진척도 / 학습 상태 분포 / 복습 큐 / 세션 추이 / 일별 풀이량 / 과목별 약점. 차트는 SVG 직접 렌더 (의존성 0). Backend (app/api/study_question_progress.py): - GET /study-topics/{tid}/stats — 6~7 쿼리 묶음 · 문제 진척도 (study_questions count + progress count) · pattern_state 분포 (NULL → unattempted + 토픽 미시도분 합산) · review_stage 분포 (0/1/2/3/mastered≥4) · due 분류 (today / this_week / later / mastered) — datetime 비교 + filter · 최근 done 세션 추이 (Phase 2-B 4 컬럼 활용, limit 20) · 일별 풀이량 30일 (cast Date + group) · 과목별 약점 (subject 별 attempted/correct/pending_review/chronic) Frontend (/study/topics/[id]/stats): - Card grid 6개. 진행률 바 + stacked horizontal bar + SVG sparkline + bar chart. - 패턴 분포: 7색 stacked bar + 범례 grid. - 복습 큐: 4 카운트 박스 + stage 분포 inline. - 세션 추이: SVG sparkline (50% baseline) + 최근 5세션 표 (회복/퇴행/새로 맞힘 인라인). - 일별 풀이량: SVG bar (max 동적) + title tooltip + start/end 날짜 라벨. - 과목별: 정답률 진행률 바 + 미확인/반복 오답 인라인. 진입: 토픽 페이지 헤더 [통계] 버튼. Plan: ~/.claude/plans/crispy-petting-dijkstra.md (Phase 2-D) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 10:04:03 +09:00
Hyungi Ahn	d3bf963a66	feat(study): Phase 2-B 결과 화면 변화 카운트 + 확인완료 progress 통합 Phase 1 finalize 가 계산하던 SessionSummary 가 응답에 포함되지 않고 discard 되던 것을 quiz_session row 4 컬럼으로 영속화. 결과 화면 헤더에 회복/퇴행/ 새로 맞힘/반복 오답 누적 변화 카운트 + "바로 할 일" 콜아웃 (지금 시점 progress 기반 동적 카운트 — pending_review/chronic/regressed). 동적 카운트는 결과 GET 호출 시점에만 계산 (목록 endpoint 비용 회피). 확인완료 통합 — 결과 카드의 [학습완료] 버튼이 attempts.reviewed_at 만 박던 것을 progress.last_reviewed_at + (wrong/unsure 면 due_at 최초 부여) 도 같이 박도록. reviewed=false 토글은 attempts 만 되돌림 (다른 attempt 가 검토 표시 했을 수 있어 progress 의 last_reviewed_at 은 보존). - migrations/230 — quiz_sessions 4 컬럼 ADD (단일 ALTER TABLE) - StudyQuizSession 모델 + finalize_session 가 row 영속화 - QuizSessionSummary 응답에 4 스냅샷 + 3 동적 필드 (default 0) - _build_session_summary include_progress_counts=True 시 SQL 3회 - review-mark 가 reveiwed=true 시 progress 동기화 - 결과 화면: 헤더 변화 카운트 줄 + 바로 할 일 콜아웃 (값 있을 때만) Plan: ~/.claude/plans/crispy-petting-dijkstra.md (Phase 2-B) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 09:49:01 +09:00

1 2 3 4 5 ...

254 Commits