Multimodal AI with audio and text

Working with different language models.

read more