Multimodal Generative AI: Vision, Speech, and Assistants
About this Course
We are introducing a new course to replace the \"Coding with ChatGPT\" course in the Generative AI specialization. This updated course will cover materials, models, and content released in 2024. Some of the new additions include material on using AI for image-to-text (vision), text-to-speech, speech-to-text, and the Assistant API. All these topics come with new labs, lessons, and exercises.Created by: Codio

Related Online Courses
This is a self-paced lab that takes place in the Google Cloud console. Learn how to develop an application to create PDFs on Google Cloud using Serverless technologies and Go.Created by: Google Cloud more
This is a self-paced lab that takes place in the Google Cloud console. Google Cloud Platform (GCP) Virtual Private Cloud (VPC) Network Peering allows private connectivity across two VPC networks... more
This comprehensive course on Persuasive Communication is designed to guide students from fundamental concepts to advanced techniques necessary for enhancing their communication skills in a... more
The Large Language Models Specialization equips learners with a solid foundation and advanced skills in NLP, covering LLM fundamentals, data preparation, fine-tuning, and advanced techniques.... more
This specialization covers the foundations of visualization in the context of the data science workflow. Through the application of interactive visual analytics, students will learn how to extract... more