Matt d80043c4aa
Some checks failed
Build and Push Docker Image / build (push) Has been cancelled
Strip null bytes from extracted text to fix PostgreSQL UTF-8 errors
Some PDFs contain \x00 null bytes in their text which PostgreSQL rejects
with "invalid byte sequence for encoding UTF8: 0x00". Sanitize extracted
text in both document-analyzer and file-content-extractor services.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-17 11:34:05 +01:00
Description
No description provided
22 MiB
Languages
TypeScript 99.4%
JavaScript 0.2%
CSS 0.2%
Shell 0.2%