diff --git a/README.md b/README.md index 5ecd491..c3ddcee 100644 --- a/README.md +++ b/README.md @@ -4,7 +4,7 @@ CLAP (Contrastive Language-Audio Pretraining) is a model that learns acoustic concepts from natural language supervision and enables “Zero-Shot” inference. The model has been extensively evaluated in 26 audio downstream tasks achieving SoTA in several of them including classification, retrieval, and captioning. -clap_diagrams +clap_diagrams ## Setup diff --git a/docs/diagram.png b/docs/diagram.png new file mode 100644 index 0000000..040e719 Binary files /dev/null and b/docs/diagram.png differ