user2493303
user2493303

Reputation: 113

How do I proceed in an attempt to find audio files which play the same song but are in different compressed formats?

all i want is suppose i have same song named as song.mp3 and song.aac now i want my program to identify that they are same, i know this is non-trivail task to do.

so far i have tried fingerprinting audio using dejavu python library which produces 2 different fingerprints for our case song.mp3 and song.aac, hence it doesnt suit need of my program.

I also tried MD5 using FFMPEG but as expected it gives different hash for even same songs downloaded from different websites

Do you guys have any idea how do I proceed? It would be even great to provide me step wise procedure and library to achieve my goal. thank you

Upvotes: 1

Views: 202

Answers (1)

Dr. Jan-Philip Gehrcke
Dr. Jan-Philip Gehrcke

Reputation: 35796

Audio fingerprinting is incredibly complex, and difficult to get right. You do not really want to come up with your own algorithm just like that, because it likely is much worse than established methods (being better than established methods requires doing some research ;-)).

One of the open source solutions for audio fingerprinting which I found is http://echoprint.me/codegen

You can use that in your application, either by calling directly into the libcodegen API, or by spawning subprocesses for audio analysis.

Upvotes: 1

Related Questions