Commit Graph

711 Commits

Author SHA1 Message Date
admin d828a91bf3 Use embedded line numbers instead of built-in line numbering
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Remove w:lnNumType; line numbers are now plain text in each paragraph.
- Ensures first line is always '1' across Word, LibreOffice, Google Docs.
- Each paragraph: line number + tab + content.
2026-06-16 19:15:47 +00:00
admin 670c6d3e2b Fix first-page line numbering off-by-one in transcript DOCX
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Remove docGrid element to prevent phantom grid-based line offset.
- Ensure exactly one lnNumType element (no duplicates).
- First visible line on page 1 now correctly numbered as 1.
2026-06-16 19:09:26 +00:00
admin f20102d564 Fix transcript DOCX line numbering (spacing and column fixes)
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Ensure single column layout (cols num='1')
- Set explicit single line spacing (before/after=0, line=240 twips)
- Prevents Word from counting extra lines due to spacing/columns
2026-06-16 18:08:46 +00:00
admin 0e6bc53cf8 Fix duplicate pgMar causing line numbering issue
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Update existing pgMar instead of appending a second one
- Prevents Word from miscounting lines on first page
2026-06-16 18:03:39 +00:00
admin c43076efd4 Increase timeouts for large-file transcription
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- LocalAI client timeout: 600s -> 3600s
- Summarizer timeout: 600s -> 3600s
- Add task_time_limit=14400s (4h) and soft_time_limit=13500s to transcription task
2026-06-16 17:18:09 +00:00
admin 03d66219d9 Rebuild transcript DOCX generation flow
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Clean, single-pass implementation for transcript and summary DOCX
- Explicit margins, font, line numbering per OOXML spec
- Disable docGrid to prevent off-by-one line numbering
- Ensure first content line is line 1
2026-06-16 16:54:48 +00:00
admin 0c0e52dfb8 Fix syntax error in speaker identification prompt string
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-16 16:05:02 +00:00
admin 604bfa3f41 Ensure identified speaker names/roles are printed in ALL CAPS
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-16 16:02:55 +00:00
admin 8ff473f3e6 Fix transcript DOCX line numbering starting at 2 (docGrid)
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Disable document grid (w:type='none') when enabling line numbering
- Prevents Word from treating an empty grid line as line 1
2026-06-16 16:00:09 +00:00
admin 0b3f737e5b Update speaker identification to use real names or roles instead of random names
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-16 15:49:39 +00:00
admin 598f8630de Fix transcript DOCX line numbering (invalid 'eachPage' value)
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Replace invalid 'eachPage' with valid 'newPage' for w:lnNumType restart attribute
- This ensures Word starts line numbering at 1 on the first page
2026-06-16 15:41:12 +00:00
admin 7fac0e7d9c Fix transcript DOCX line numbering starting at line 2 (robust)
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Fully clear default paragraphs from document body so Word's line numbering starts at the first real line
2026-06-15 16:26:28 +00:00
admin 5dd56a3368 Fix missing subject on emails with attachments
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Ensure Subject header is set on the outermost MIME part when attachments are present
- Restructure send_email to use multipart/mixed as root with headers when attachments exist
2026-06-15 15:03:50 +00:00
admin 7364d572d5 Fix transcript DOCX line numbering starting at line 2
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Remove initial empty paragraph so Word's line numbering starts at first real line
2026-06-15 14:54:32 +00:00
admin d51b006a19 Fix Gradio launch error and adjust upload template
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Remove unsupported 'enable_api' argument from app.launch()
- Hide API link via CSS instead
- Remove queue-position paragraph from upload_notification_template.html
2026-06-15 04:06:55 +00:00
admin ea5a0752df Update README to reflect current behavior
Mirror and run GitLab CI / build (push) Has been cancelled
- Remove PDF-related references
- Clarify DOCX format: no cover pages, transcript line-numbered
- Align output files and env vars with current implementation
2026-06-15 03:58:56 +00:00
admin b0a1bc059b Simplify email subject handling and remove duplicate functions
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Remove send_success_email/send_error_email from email_sender.py
- Centralize subject logic in tasks.py using _get_subject()
- Use EMAIL_SUBJECT_SUCCESS and EMAIL_SUBJECT_ERROR with clear defaults
- Ensure subject is always set and logged before sending
2026-06-15 03:52:19 +00:00
admin e27e5b8522 Revert PDF generation; simplify to DOCX + MD + JSON only
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Remove PDF helpers, LibreOffice, PyPDF2, reportlab
- Transcript DOCX: standalone, no cover page, with line numbering
- Summary DOCX: standalone, no cover page, no line numbering
- Attachments:
  - Transcribe: JSON, transcript MD, transcript DOCX
  - Transcribe & Summarize: JSON, transcript MD, transcript DOCX, summary MD, summary DOCX
2026-06-15 03:38:12 +00:00
admin 6233a41f61 Remove Gradio API page and 'Use via API' link from web UI
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Set enable_api=False in app.launch()
- Hide API-related links via CSS
2026-06-15 03:26:34 +00:00
admin 237bd4b37c Refactor PDF generation and attachment logic
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Generate PDFs by:
  - Creating individual .docx components (cover, transcript, summary)
  - Converting each .docx to PDF
  - Merging PDFs in correct order
  - Adding page numbers to final PDFs

- Transcribe & Summarize:
  - Attach: JSON, transcript MD, summary MD, TRANSCRIPT.pdf, SUMMARY.pdf, COMBINED.pdf

- Transcribe only:
  - Attach: JSON, transcript MD, TRANSCRIPT.pdf

- Ensure transcript line numbering is isolated to its own .docx before PDF merge
2026-06-15 03:16:53 +00:00
admin 7ece1a50c2 Update Web UI: rename option, increase title font, default identify speakers
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Rename 'Transcript & Summarize' to 'Transcribe & summarize'
- Increase title font size to 60px via CSS
- Set 'Identify speakers' checkbox to default selected
2026-06-15 03:02:19 +00:00
admin 46fbcf80af Ensure success and error emails always have a subject
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Use EMAIL_SUBJECT_SUCCESS env var for success emails
- Use EMAIL_SUBJECT_ERROR env var for error emails
- Provide safe defaults if env vars are missing or blank
- Add final guard in send_email() to prevent blank subjects
2026-06-15 02:57:09 +00:00
admin 42a155aeaa Add PDF-based document generation with LibreOffice; fix line numbering and margins
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
- Add LibreOffice Writer and DejaVu fonts to Dockerfile for PDF generation
- Add PyPDF2 and reportlab to requirements.txt
- Refactor email_sender.py:
  - Enforce 1-inch margins on all sides
  - Isolate line numbering to transcript section only
  - Add generate_pdf_documents() to build:
    - TRANSCRIPT.pdf (cover + transcript)
    - SUMMARY.pdf (cover + summary)
    - COMBINED.pdf (transcript cover + summary + TRANSCRIPT header + transcript)
  - Add page numbers (bottom-right) to all PDFs via reportlab
- Update tasks.py:
  - Use generate_pdf_documents() after creating DOCX files
  - Attach source JSON, MD files, and compiled PDFs in success email
- Add test_docx_generation.py for transcript/summary/combined DOCX testing
2026-06-15 02:19:17 +00:00
admin b0a23b32e1 Fix page numbering: correct field insertion for PAGE and NUMPAGES
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 23:08:51 +00:00
admin 2e2bc3fb29 Fix page numbering: use correct python-docx field insertion for PAGE and NUMPAGES
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 23:03:12 +00:00
admin 2f9299389b Fix line numbering: only transcript pages; ensure page numbering fields are set correctly
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 22:25:26 +00:00
admin e0d2fd6963 Fix combined .docx: line numbering only for transcript, centered cover pages, correct date format, reliable page numbering
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 22:07:36 +00:00
admin 4651c5f8b2 Ensure success email subject is never blank; add final guard in send_email
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 21:56:04 +00:00
admin 6c11a8f19a Add 'Page X of Y' footer to all .docx files
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 21:51:12 +00:00
admin 2a2a5e024c Update combined .docx order: cover page, page break, summary, page break, transcript
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 21:47:36 +00:00
admin 7adca3d921 Add cover pages to transcript/summary .docx with AI-generated descriptions; include combined .docx when both requested
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 21:33:15 +00:00
admin efb34dd9ff Translate markdown headings to WYSIWYG styles in summary .docx
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 21:18:42 +00:00
admin 11e5309a8e End underline at colon in transcript .docx, not over following space
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 21:14:45 +00:00
admin a3ca1f3505 Ensure success email subject is wired to EMAIL_SUBJECT_SUCCESS and never blank
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 21:11:53 +00:00
admin 154cac6c7b Ensure success email subject is wired to EMAIL_SUBJECT_SUCCESS and never blank
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 21:09:25 +00:00
admin 18f4a4e8de Reduce logo to 75px desktop / 50px mobile; increase title font by 3pt
Mirror and run GitLab CI / build (push) Has been cancelled
2026-06-14 21:05:19 +00:00
admin 2f304e3ed1 Fix header.html template escaping so title and logo render correctly
Mirror and run GitLab CI / build (push) Has been cancelled
2026-06-14 21:02:59 +00:00
admin fd94e2daa0 Center logo above title in header for desktop and mobile
Mirror and run GitLab CI / build (push) Has been cancelled
2026-06-14 21:00:04 +00:00
admin e74bc04cb3 Show timestamp and speaker name on same line as text in transcript .docx
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 20:57:32 +00:00
admin c792fa17e8 Fix .logo-container: remove flex, limit to 75px
Mirror and run GitLab CI / build (push) Has been cancelled
2026-06-14 20:54:19 +00:00
admin e55f36a131 Improve queue position accuracy and wording in upload email
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 20:51:34 +00:00
admin 572587bb85 Fix syntax error in tasks.py speaker identification prompt
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 20:43:37 +00:00
admin cfc38b21ed Reduce header logo size to 25% while keeping responsive layout 2026-06-14 20:42:31 +00:00
ScrAIbe Admin 1582b90ddb Fix header template escaping; ensure title and logo render from env vars
Mirror and run GitLab CI / build (push) Has been cancelled
2026-06-14 20:11:28 +00:00
ScrAIbe Admin 9ec4c4ccba Restore title and logo in header with responsive layout
Mirror and run GitLab CI / build (push) Has been cancelled
2026-06-14 19:48:57 +00:00
ScrAIbe Admin 8ecae8f648 Optimize Web UI for mobile: fix logo overlap and responsive layout
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 19:00:40 +00:00
ScrAIbe Admin 49e999f0ee Add Identify speakers option: AI infers names and replaces Speaker IDs in transcript
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 18:05:37 +00:00
ScrAIbe Admin eb9b2f9126 Fix 'Section' object has no attribute '_element' in create_transcript_docx
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 17:53:01 +00:00
admin 50c7ec90a0 Always send numeric queue position in upload notification email
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 17:00:06 +00:00
admin f7c9c70bfc Robustly wire email subjects from env vars with safe fallbacks and logging
Mirror and run GitLab CI / build (push) Has been cancelled
Ruff / ruff (push) Has been cancelled
2026-06-14 16:31:03 +00:00