feat(search): add document_chunks page/source columns + unique idx

migrations 279-281: page_start/end + source_type/chunker_version/source_hash/chunk_content_hash, legacy backfill (30,952 rows), unique (doc_id,source_type,chunker_version,chunk_index). PR-DocSrv-LargeDoc-Split-Markdown-1 commit 1.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
hyungi
2026-05-24 07:01:34 +00:00
parent 329c9eac76
commit 72190cf90a
3 changed files with 23 additions and 0 deletions
@@ -0,0 +1,5 @@
-- UNIQUE (doc_id, source_type, chunker_version, chunk_index).
-- dup-check (doc_id, chunk_index)=0 확인 (2026-05-24) → legacy backfill 후 키 유효.
-- 비-CONCURRENTLY: runner 트랜잭션 안전 + ~30K rows 규모 현실적.
CREATE UNIQUE INDEX IF NOT EXISTS uq_document_chunks_source_version_index
ON document_chunks (doc_id, source_type, chunker_version, chunk_index);