-
Vũ Hoàng Anh authored
feat(ocr): prioritize direct DOCX parse to resolve Google Drive numbering corruption, add text export + regex fallback for PDFs
371783cc
feat(ocr): prioritize direct DOCX parse to resolve Google Drive numbering corruption, add text export + regex fallback for PDFs
| Name |
Last commit
|
Last update |
|---|---|---|
| static | Loading commit data... | |
| test | Loading commit data... | |
| .gitignore | Loading commit data... | |
| Dockerfile | Loading commit data... | |
| app.py | Loading commit data... | |
| docker-compose.yml | Loading commit data... | |
| docx_converter.py | Loading commit data... | |
| final_test.txt | Loading commit data... | |
| format_contract.py | Loading commit data... | |
| ocr_drive.py | Loading commit data... | |
| out.txt | Loading commit data... | |
| requirements.txt | Loading commit data... | |
| test_drive_ocr.txt | Loading commit data... | |
| test_e2e.py | Loading commit data... | |
| test_ocr.txt | Loading commit data... | |
| test_out.txt | Loading commit data... | |
| text_to_markdown.py | Loading commit data... |