c72d19599a
Implements the §6.2 enrichment pipeline: embedded tags → Chromaprint
fingerprint → AcoustID lookup. Well-tagged files get correct
artist/album/title offline; the rest are identified via AcoustID
(which also yields a MusicBrainz recording id in one call).
- domain: AudioTags/Fingerprint/RecordingMatch value objects; ports
AudioTagReader, AudioFingerprinter, AcoustIdClient; TrackRepository
.apply_enrichment (gap-fill, never erases) + AlbumRepository.get_or_create
- infrastructure/metadata: MutagenTagReader, FpcalcFingerprinter,
AcoustIdHttpClient (rich meta=recordings+releasegroups, throttled)
- application: MetadataEnrichmentService — tags preferred, AcoustID fills
gaps; resolves artist/album; status enriched/failed; skips manual;
every external step wrapped (graceful degradation)
- workers: enrich_task registered; enqueue_enrich is best-effort and
deferred so the caller's txn commits before the worker reads the row
- wiring: upload enqueues after add; import returns imported_ids and
enqueues post-commit (mid-scan would race the worker); manual
POST /tracks/{id}/metadata/enrich endpoint
- deps: add mutagen (fpcalc/ffmpeg already in the image)
Tests: metadata service orchestration, AcoustID parser, tag helpers.
125 passed; mypy strict + ruff clean.
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
35 lines
1.1 KiB
Python
35 lines
1.1 KiB
Python
"""arq worker settings — the queue runtime. Task functions register here.
|
|
|
|
Run with: ``arq app.workers.arq_worker.WorkerSettings``.
|
|
Tasks (download, transcode) are appended to ``functions`` in later steps.
|
|
"""
|
|
|
|
from typing import Any, ClassVar
|
|
|
|
from arq.connections import RedisSettings
|
|
|
|
from app.core.config import get_settings
|
|
from app.core.logging import configure_logging, get_logger
|
|
from app.workers.tasks.enrich_task import enrich_track
|
|
from app.workers.tasks.import_task import scan_local_folder
|
|
|
|
log = get_logger("worker")
|
|
|
|
|
|
async def startup(_ctx: dict[str, Any]) -> None:
|
|
settings = get_settings()
|
|
configure_logging(level=settings.log_level, json=settings.log_json)
|
|
log.info("worker_startup", environment=settings.environment)
|
|
|
|
|
|
async def shutdown(_ctx: dict[str, Any]) -> None:
|
|
log.info("worker_shutdown")
|
|
|
|
|
|
class WorkerSettings:
|
|
functions: ClassVar[list[Any]] = [scan_local_folder, enrich_track]
|
|
on_startup = startup
|
|
on_shutdown = shutdown
|
|
max_jobs = get_settings().max_parallel_downloads
|
|
redis_settings = RedisSettings.from_dsn(str(get_settings().redis_url))
|