Multimodal Generative AI: Vision, Speech, and Assistants
About this Course
We are introducing a new course to replace the \"Coding with ChatGPT\" course in the Generative AI specialization. This updated course will cover materials, models, and content released in 2024. Some of the new additions include material on using AI for image-to-text (vision), text-to-speech, speech-to-text, and the Assistant API. All these topics come with new labs, lessons, and exercises.Created by: Codio

Related Online Courses
This 3-course Specialization from Google Cloud and New York Institute of Finance (NYIF) is for finance professionals, including but not limited to hedge fund traders, analysts, day traders, those... more
This Specialization will provide learners with the knowledge and skills to recognize key shifts in the industry and to have an agile perspective on how these shifts might impact their... more
The course provides the principles of modelling and simulation of modern mechatronic systems, which are mechanical systems integrated with several types of sensors and actuators. The aim of the... more
Gemini for Google Workspace is an add-on that provides customers with generative AI features in Google Workspace. In this mini-course, you learn about the key features of Gemini and how they can be... more
The Google Certified Professional Cloud Architect specialization is for individuals aspiring to become proficient Google Cloud Platform (GCP) Professional Architects. The specialization helps you... more