About

Audio Atlas is a fuzzy search tool for sound effects. It takes in text query and returns a list of sound effects that match the query. The core of Audio Atlas is the CLAP (Contrastive Language-Audio Pretraining) model made by LAION-AI.

How does it work?

Here's a simple diagram to explain how Audio Atlas works:

Diagram of how Audio Atlas works

Preprocessing

Audio Atlas preprocesses the existing sound effects data by saving the sound data in a format that is easy to search through. The sound data is then converted into vector embeddings using CLAP's Audio Encoder.

Query

When a user enters a query, Audio Atlas preprocesses the query and converts it into a vector embedding using CLAP's Text Encoder which has the same dimension as the sound embeddings. It is then compared with the sound embeddings to find the most similar sound effects.

The sound effects are then ranked based on their similarity to the query and passed through a pagination algorithm to return the most relevant sound effects.

Download

When a user clicks on a sound effect, Audio Atlas fetches the sound effect from the server based on its ID and plays it for the user. The user can then download the sound effect if they like it.

Technologies Used

Audio Atlas is built using the following technologies: