3.6 KiB
3.6 KiB
Verification Workflow
Use data/verified_author_overrides.tsv for manual metadata corrections.
Using generate_abs_mock_report.py
The script generates a non-destructive TSV report with proposed Audiobookshelf paths. It does not rename or move files.
What it does:
- scans the audiobook library tree
- detects audiobook roots based on audio files
- tries to infer author, title, series, sequence, year, and narrator from folder names and sidecar OPF files
- applies manual corrections from
data/verified_author_overrides.tsv - writes a TSV report with proposed target paths for Audiobookshelf
What it does not do:
- does not rename files
- does not move directories
- does not modify the library itself
Basic usage:
python3 generate_abs_mock_report.py
Default behavior:
- reads the library from
/mnt/nextcloudExtDS/Ksiazki/Audiobooki - writes the report to
reports/audiobookshelf_mock_report.tsv - applies manual corrections from
data/verified_author_overrides.tsv
Available options:
python3 generate_abs_mock_report.py --help
--root ROOT Path to the current audiobook library
--output OUTPUT TSV output path
--overrides OVERRIDES Optional TSV with verified metadata overrides
Examples:
python3 generate_abs_mock_report.py \
--root /mnt/nextcloudExtDS/Ksiazki/Audiobooki
python3 generate_abs_mock_report.py \
--root /path/to/library \
--output reports/custom_report.tsv \
--overrides data/verified_author_overrides.tsv
Typical workflow:
- Run
python3 generate_abs_mock_report.py. - Open
reports/audiobookshelf_mock_report.tsv. - Review rows with
status=reviewfirst, then ambiguousunverifiedrows. - Add confirmed metadata to
data/verified_author_overrides.tsv. - Run the script again to regenerate the report with overrides applied.
What the script prints after completion:
library_rootused for the scanreportpath to the generated TSVbooksnumber of detected audiobook rootsreadyrows with enough metadata to propose a target pathreviewrows that still need manual verification
Main output file:
reports/audiobookshelf_mock_report.tsv
Important columns in the TSV:
statuscurrent_pathauthorseriessequencetitleproposed_abs_pathnotesverification_statusverification_source
How to read the main status fields:
status=readymeans the row has enough metadata to build a proposed target path.status=reviewmeans the row still needs manual verification.verification_status=unverifiedmeans no manual override was applied yet.verification_status=verified_webmeans the row was corrected or confirmed from a web source stored inverification_source.
Notes about paths:
current_pathis the detected source folder in the current library.proposed_abs_pathis the suggested logical Audiobookshelf path relative to the author/series/title structure.- The script creates the parent directory for the output TSV automatically if it does not exist.
Source preference:
- Prefer a direct audiobook/store/catalog page when it clearly confirms the metadata.
lubimyczytac.plis an approved auxiliary source for verifying author, title, and series/cycle names.- Use
lubimyczytac.plespecially when path-derived guesses are ambiguous or when storefront metadata is incomplete.
Recommended fields to confirm:
- author
- title
- series
- sequence
When adding an override:
- Put the confirming page URL in
verification_source. - Keep the note in
verification_noteshort and only add it when it explains a correction or ambiguity.