diff --git a/README.md b/README.md
index ccac4b4..597cbe2 100644
--- a/README.md
+++ b/README.md
@@ -4,7 +4,7 @@
CLAP (Contrastive Language-Audio Pretraining) is a model that learns acoustic concepts from natural language supervision and enables “Zero-Shot” inference. The model has been extensively evaluated in 26 audio downstream tasks achieving SoTA in several of them including classification, retrieval, and captioning.
-
+
## Setup