Skip to content
#

qbv

Here is 1 public repository matching this topic...

This repository provides the code for "Improving Query-by-Vocal Imitation with Contrastive Learning and Audio Pretraining", presented at DCASE 2024. The paper addresses the challenge of audio retrieval using vocal imitations as queries, proposing a dual encoder architecture that leverages pretrained CNNs and an adapted NT-Xent loss for fine-tuning.

  • Updated Jan 6, 2025
  • Python

Improve this page

Add a description, image, and links to the qbv topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the qbv topic, visit your repo's landing page and select "manage topics."

Learn more