hyungi_document_server

Author	SHA1	Message	Date
hyungi	688532b1fa	fix(briefing): held→409 표면화 + study attempt naive datetime→UTC (R8) - briefing.regenerate: held(정책상 정상 보류)를 digest.py 정본처럼 409 로 표면화. 이전엔 briefing_worker.run() 이 held/timeout/exception 셋 다 None 반환 → API 가 셋 다 500 으로 오보(silent-state-conflation). 진입부 'briefing' in pipeline_held_stages 가드. - study_question.answered_at: naive default datetime.now → lambda datetime.now(timezone.utc). 컨테이너=UTC 실측이라 값 동일·백필 불요, 컨테이너 TZ 바뀌면 9h 어긋나던 잠복 의존 제거. 검증: py_compile 통과. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-16 13:51:42 +09:00
hyungi	0a7402b327	feat(study): 공부 암기노트 Phase 1 — card_extract 추출 파이프라인 (순수 additive) study_memo_cards 추출 파이프라인 + 버전키 폴러 + needs_review 컬럼. 운영 SR 코드(session_finalize/quiz_selection) 무수정. - migrations 287~298: study_memo_cards/_evidence/_jobs/_progress(P1 휴면)·study_reminders·study_topics.focused_at·study_questions needs_review 3컬럼. dedup PARTIAL UNIQUE(deleted_at IS NULL). - 워커: in-process RAG gather → MLX {cards} → 카드 가드(정량=evidence 원문 등장·cue/cloze 누출·dedup) → supersede 구버전 retire → append. 별 consumer 로 기존 study_queue 격리. - 폴러 study_card_enqueue: 버전키 NOT EXISTS(source_version) 멱등 + ai_explanation_generated_at NOT NULL 가드 + per-poll LIMIT(thundering-herd). - 검증: 실 prod 스키마 덤프 위 12 마이그 적용 OK + dedup/supersede/active-unique 기능 7/7 PASS + 정규화 util 15/15. plan: PKM plans/2026-06-05-study-memo-card-p1-plan.html Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-06 21:33:12 +09:00
Hyungi Ahn	219e233a48	feat(study): related-types DB 캐시 — HNSW 매번 재계산 제거 - migrations 220/221: study_questions 에 related_repeat/similar JSONB + 카운트/grade/computed_at/threshold_version + partial idx - 임베딩 워커: ready 처리 직후 같은 트랜잭션에서 related 계산·저장 + 같은 토픽 ready 행들의 related_computed_at=NULL invalidation - 신규 cron study_q_related_refresh (1분, batch=20) — stale 캐시 일괄 재계산 - API list_related_types: cache hit (computed_at + threshold version 일치) 시 SELECT 1번으로 응답. miss 면 즉시 계산+저장 후 응답 - update_question PATCH: 본문/exam_round 변경 시 related_computed_at=NULL - soft delete: 같은 토픽 ready 행 invalidation threshold 변경 시: related_types.THRESHOLD_VERSION 갱신 + UPDATE WHERE version != '<신>' SET computed_at=NULL 한 번이면 cron 자동 일괄 재계산. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-01 07:22:31 +09:00
Hyungi Ahn	7f4d64c6df	feat(study): 문제풀이 세션 + 결과 카드 + 학습완료 체크 (PR-10) - study_quiz_sessions 테이블 (한 토픽 in_progress 1개 partial unique) - study_question_attempts 에 quiz_session_id + reviewed_at 컬럼 - 풀이 진행률 서버 단일 진실 (cursor) — 나갔다 와도 이어풀기 가능 - 통합뷰: 진행 중 카드(이어풀기) + 최근 완료 결과 카드(미확인 N건 배지) - 신규 /quiz-sessions/[sid] 결과 페이지 (3 카테고리 + AI 해설 + 분야 설명 + 학습완료 토글) - /review 페이지는 풀이만, 마지막 문제 풀이 후 결과 페이지로 redirect - 마이그레이션 206~209 (single-statement, asyncpg 호환) - API: POST/GET/PATCH /study-topics/{tid}/quiz-sessions(/{sid}), PATCH /study-question-attempts/{aid}/review-mark - AttemptCreate.quiz_session_id 추가 — submit_attempt 가 같은 트랜잭션에서 세션 cursor + count 증가, 마지막이면 status='done' + finished_at Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 16:49:21 +09:00
Hyungi Ahn	d968b2d901	feat(study): 문제풀이 모드 개편 + 결과 분류 + 분야 설명 (PR-9) - 라벨 "복습 시작" → "문제풀이" - attempts.outcome 컬럼 + selected_choice nullable (correct/wrong/unsure) - 풀이 중 정답·해설·AI·비슷한 문제 모두 비노출, 답 클릭 시 자동 진행 - "모르겠음" 5번째 옵션 추가 - 결과 화면 = 정답/틀린/모르겠음 3 카테고리 탭, 카드 클릭 expand - 틀린 → PR-3 AI 해설 (RAG) - 모르겠음 → 분야(subject+scope) 설명 AI 즉석 생성 + 캐시 (PR-9 신규) - 분야 설명 RAG: 매핑 documents 청크 + 같은 분야 다른 문제·해설 → bge-reranker - 마이그레이션 200~205 (single-statement, asyncpg 호환) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 15:58:35 +09:00
Hyungi Ahn	8803e6a0fd	feat(study): 시험·회차·문항 관리 (PR-6) 기사시험 회차별 100문제 채워가기 시나리오. 문제 입력 페이지를 단순 폼에서 "회차 진행률 추적·재개" 도구로 보강. 데이터 모델 (migrations 195~197): - study_topics: exam_round_size INT CHECK 1~300 (회차당 문항 수, NULL=미설정) + exam_subjects JSONB DEFAULT '[]' (과목 리스트, 입력 페이지 드롭다운 옵션) - study_questions: exam_question_number SMALLINT CHECK >0 (회차 안 문항 번호) - partial idx (study_topic_id, exam_round, exam_question_number) WHERE deleted_at IS NULL AND exam_round IS NOT NULL — 회차별 max+count 고속화 백엔드: - POST /questions: exam_round 명시 + exam_question_number 미명시 시 서버가 같은 토픽·회차의 max+1 자동 채움 - 신규 GET /api/study-topics/{id}/exam-rounds: 회차별 진행률 집계 {exam_round_size, items: [{exam_round, question_count, max_question_number, next_question_number, is_complete}]} - StudyTopic Create/Update/Response/Meta 에 exam_round_size·exam_subjects - StudyQuestion Create/Update/Response 에 exam_question_number - exam_question_number 변경은 embedding stale 트리거에서 제외 (의미 영향 없음) 프론트: - 토픽 생성/편집 모달: "시험 정보" 섹션 (회차당 문항 수 + 과목 리스트 +추가/제거 칩) - /study/topics/[id]/exam-rounds 신규 페이지: 회차 카드 + 진행 바 + [N번부터 이어서] 버튼 + [새 회차 시작] 모달 - 통합뷰 문제 섹션 헤더에 [회차 보기] 진입점 - /questions/new 페이지 전면 개편: - 시험명 = topic.name 자동 prefill - 과목 드롭다운 (topic.exam_subjects + 기존 distinct, "직접 입력" 토글) - 회차 드롭다운 (기존 distinct + "새 회차") - 문항 번호 자동 (회차 선택 시 next_question_number, 새 회차 = 1) - 진행률 바 (현재/exam_round_size) - 출처/메모 자동 합성 "회차 N번" (수정 가능) - "저장 후 계속 입력" → 본문/보기/정답 reset, 회차 유지, 문항 +1 - 회차 변경 감지 시 문항 번호 1로 reset - exam_round_size 도달 시 회차 강조 + "저장 후 계속 입력" 비활성 - query string ?exam_round=&start_qnum= 지원 (회차 목록에서 재개 진입) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 09:31:06 +09:00
Hyungi Ahn	9d4aa201a8	feat(study): study_questions 자동 임베딩 (PR-4) 문제 본문 + 보기 1~4 → bge-m3 1024차원. status 자체가 큐 역할 (별도 큐 테이블 없음 — ProcessingQueue 인프라 영향 0). APScheduler 1분 cron 이 status in {none, failed, stale} 행을 batch=10 처리. 새 문제는 default 'none' 으로 자동 backfill. 데이터 모델 (migrations 193~194): - study_questions: embedding vector(1024), embedding_status VARCHAR(20) DEFAULT 'none' (none/pending/ready/failed/stale), embedding_updated_at, embedding_model - HNSW partial index (vector_cosine_ops) WHERE deleted_at IS NULL AND embedding IS NOT NULL — bge-m3 cosine 기준, documents.embedding (ivfflat) 과 ops 일관 재계산 트리거: question_text / choice_1~4 변경 시 ready→stale 자동. correct_choice / explanation / subject / scope 변경은 재계산 안 함 (의미 검색에 영향 없음). 워커 (workers/study_question_embed_worker.py): - race-safe pending 마킹 (조건부 UPDATE WHERE status IN none/failed/stale) - AIClient.embed(text) bge-m3 호출, 15s timeout - 실패 시 status='failed', 직전 embedding 보존, 다음 cron 틱에 재시도 - 본문 = "문제: ...\n보기:\n1. ...\n2. ...\n3. ...\n4. ..." (subject/scope 의도 제외 — 분류명이 의미 검색 노이즈) 후속 PR 예정: 비슷한 문제 검색 UI / 중복 입력 감지 / RAG 정확도 향상 / 오답 클러스터링. 본 PR 은 임베딩 저장·재계산·backfill 까지만. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 08:54:02 +09:00
Hyungi Ahn	e1a2cdc677	feat(study): AI 풀이 생성 — 수동 트리거 + RAG (PR-3) 복습 답 제출 후 또는 편집 화면에서 사용자가 명시적으로 누를 때만 AI 가 4지선다 풀이 생성. 자동 일괄 생성 금지 (하루 100문제 입력 시 MLX 부하· 잘못 입력 문제 해설 위험). 데이터 모델 (migrations 191~192): - study_questions 4 컬럼 추가: ai_explanation TEXT, ai_explanation_status VARCHAR(20) DEFAULT 'none' (none/pending/ready/failed/stale), ai_explanation_generated_at, ai_explanation_model - partial idx (study_topic_id, ai_explanation_status) WHERE status != 'none' PATCH stale 자동 전이: question_text/choice_*/correct_choice 변경 시 status='ready' 만 'stale' 로. 본문은 보존, UI 배지 + "다시 생성" 동선. 신규 엔드포인트: POST /api/study-questions/{id}/ai-explanation - regenerate=false + ready/stale → 캐시 즉시 (MLX 호출 없음, is_stale 플래그) - pending → 409 (race-safe 조건부 UPDATE 로 동시 호출 차단) - 그 외 → 새 생성 RAG 입력 풀: - 1순위: study_topic 매핑 documents 청크 + ai_summary, bge-reranker top-5 - 2순위: 같은 토픽 다른 questions (자기 자신 제외, ai_explanation 은 ready 상태만 포함 — 재귀적 hallucination 방지), reranker top-3 - 제외: 필기 OCR / 외부 웹 / Premium 모델 모델: Mac mini MLX gemma-4-26b primary 단독. get_mlx_gate() Semaphore(1) 경유, 30s timeout. 실패 시 status='failed' + 직전 본문 보존. 프롬프트 (app/prompts/study_question_explanation.txt): 자료 우선순위·인용 형식·할루시네이션 방지 절대 규칙 (법령명·조항·수치·표준 번호 단정 금지, "자료에서 확인되지 않음" 명시). 프론트: - 복습 화면 답 제출 후 인라인 expand. status별 버튼 분기 (ready 캐시 / stale "이전 풀이"+"다시 생성" / failed "다시 시도") - 편집 화면 별도 카드. 상태 배지 + "이전 풀이 보기" / "다시 생성" 분리 - 참고 근거 토글 (source_type 별 아이콘 📄/❓ + 제목 + snippet) 후속 PR 보류: 오답노트/통계, AI 일괄 백그라운드 생성, 필기 OCR RAG, Premium/Claude 재생성, /api/search/ask retrieval scope 통합. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 08:41:46 +09:00
Hyungi Ahn	4b7156061e	feat(study): 문제은행 + 복습모드 (study_questions) study_topic 워크스페이스에 4지선다 문제은행 자산 트랙 추가. 기사시험 필기 대비 시나리오 — 빠른 반복 입력 + 과목별 균등 추출 복습 + 정오답 누적. 데이터 모델 (migrations 186~190): - study_questions: study_topic 1:N, soft delete, is_active 토글, correct_choice SMALLINT CHECK 1~4 - study_question_attempts: 답 제출 1행 누적. study_question_id FK는 ON DELETE RESTRICT (이력 보존 원칙 — hard delete 실수로 풀이 기록 소실 차단) 설계 원칙: - 문제 삭제는 API 에서 soft delete only. attempts FK RESTRICT 로 DB 레벨도 보호 - correct_choice 변경 시 기존 attempts.is_correct 재계산 안 함 (시점 사실 보존) - 복습 default = 과목별 target_per_subject(20) 무작위 균등 추출. 한 과목이 부족하면 가용한 만큼만 - wrong_only=true 정의 = 가장 최근 attempt 가 오답인 문제 (latest-wrong, ever-wrong 아님) - 출제 응답에서 정답·해설 비공개. 답 제출 시점에만 노출 - subject/scope 강한 enum 미사용 (자유 텍스트, 자동완성은 후속) API: /api/study-topics/{id}/questions, /review/questions, /api/study-questions/{id}, /attempt. 통합뷰(/study-topics/{id}) 응답에 sections.questions / stats.question_count 추가. 기존 question_set_count 는 후속 PR(회차/모의고사 묶음)용으로 보존. 프론트: /study/topics/[id]에 문제 섹션 + "새 문제"/"복습 시작" 진입. /questions/new (저장 후 계속 입력 + sessionStorage persistent), /questions/[qid]/edit (정답 변경 시 attempts 재계산 안 됨 안내 배너), /review (시작 옵션 → 풀이 → 마지막 요약). 후속 PR 예정: 오답노트/취약 과목 리포트, AI 해설/클러스터링, spaced repetition, 이미지 OCR 입력, CSV import, study_question_sets 묶음. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 08:00:37 +09:00

9 Commits