c72d19599a
Implements the §6.2 enrichment pipeline: embedded tags → Chromaprint
fingerprint → AcoustID lookup. Well-tagged files get correct
artist/album/title offline; the rest are identified via AcoustID
(which also yields a MusicBrainz recording id in one call).
- domain: AudioTags/Fingerprint/RecordingMatch value objects; ports
AudioTagReader, AudioFingerprinter, AcoustIdClient; TrackRepository
.apply_enrichment (gap-fill, never erases) + AlbumRepository.get_or_create
- infrastructure/metadata: MutagenTagReader, FpcalcFingerprinter,
AcoustIdHttpClient (rich meta=recordings+releasegroups, throttled)
- application: MetadataEnrichmentService — tags preferred, AcoustID fills
gaps; resolves artist/album; status enriched/failed; skips manual;
every external step wrapped (graceful degradation)
- workers: enrich_task registered; enqueue_enrich is best-effort and
deferred so the caller's txn commits before the worker reads the row
- wiring: upload enqueues after add; import returns imported_ids and
enqueues post-commit (mid-scan would race the worker); manual
POST /tracks/{id}/metadata/enrich endpoint
- deps: add mutagen (fpcalc/ffmpeg already in the image)
Tests: metadata service orchestration, AcoustID parser, tag helpers.
125 passed; mypy strict + ruff clean.
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
57 lines
2.0 KiB
Python
57 lines
2.0 KiB
Python
"""arq task: enrich one track's metadata (plan §6.2, §1D).
|
|
|
|
Wires the §6.2 pipeline adapters to :class:`MetadataEnrichmentService` and runs
|
|
it in the worker's own transactional session. Enqueued (deferred) after upload
|
|
and after a local-folder import. Idempotent and best-effort — a missing track or
|
|
a ``manual`` one is a clean no-op.
|
|
"""
|
|
|
|
import uuid
|
|
from typing import Any
|
|
|
|
from app.application.metadata_service import MetadataEnrichmentService
|
|
from app.core.config import get_settings
|
|
from app.core.logging import get_logger
|
|
from app.infrastructure.db import session_scope
|
|
from app.infrastructure.db.repositories import (
|
|
SqlAlchemyAlbumRepository,
|
|
SqlAlchemyArtistRepository,
|
|
SqlAlchemyTrackRepository,
|
|
)
|
|
from app.infrastructure.metadata.acoustid import AcoustIdHttpClient
|
|
from app.infrastructure.metadata.fingerprint import FpcalcFingerprinter
|
|
from app.infrastructure.metadata.tags import MutagenTagReader
|
|
from app.infrastructure.storage.provider import get_file_storage
|
|
|
|
log = get_logger("worker.enrich")
|
|
|
|
|
|
async def enrich_track(_ctx: dict[str, Any], *, track_id: str) -> dict[str, Any]:
|
|
settings = get_settings()
|
|
api_key = (
|
|
settings.acoustid_api_key.get_secret_value() if settings.acoustid_api_key else None
|
|
)
|
|
acoustid = AcoustIdHttpClient(
|
|
api_key=api_key,
|
|
user_agent=settings.musicbrainz_user_agent,
|
|
api_url=settings.acoustid_api_url,
|
|
)
|
|
|
|
async with session_scope() as session:
|
|
service = MetadataEnrichmentService(
|
|
tracks=SqlAlchemyTrackRepository(session),
|
|
artists=SqlAlchemyArtistRepository(session),
|
|
albums=SqlAlchemyAlbumRepository(session),
|
|
storage=get_file_storage(),
|
|
tag_reader=MutagenTagReader(),
|
|
fingerprinter=FpcalcFingerprinter(settings.fpcalc_path),
|
|
acoustid=acoustid,
|
|
)
|
|
result = await service.enrich(uuid.UUID(track_id))
|
|
|
|
return {
|
|
"track_id": str(result.track_id),
|
|
"status": result.status,
|
|
"mbid": result.matched_mbid,
|
|
}
|