Content Analysis
The digital archive of a library is to be verified. As a first step, all files are sorted whether they are readable or damaged - in the latter case, those files should be repaired, if possible.

Uncorrectable files are routed to a manual check for a possible later manual repair. Other files are then checked for duplicates and general errors. Additionally, the music to speech ratio is computed. For every file, there is a report written, which contains the format type and most important attributes.
Ideally, after this workflow, the audio files are converted to a uniform format - or possibly multiple formats - see the Case Study "Content for Internet Portals".