Ready to level up your GenAI skills? Step into the exciting world of multimodal AI, where language, images, and speech come together to build smarter, more interactive applications.

Build Multimodal Generative AI Applications
Seize the savings! Get 40% off 3 months of Coursera Plus and full access to thousands of courses.

Build Multimodal Generative AI Applications
This course is part of IBM RAG and Agentic AI Professional Certificate


Instructors: Hailey Quach
8,548 already enrolled
Included with
48 reviews
Recommended experience
What you'll learn
Build the job-ready skills you need to build multimodal generative AI applications in just 3 weeks
Understand the fundamental concepts and challenges in multimodal AI, including the integration of text, speech, images, and video
Build multimodal AI applications using state-of-the-art models and frameworks such as IBM’s Granite, Meta’s Llama, OpenAI’s Whisper, DALL·E and Sora
Develop multimodal AI solutions, including chatbots and image/video generation models, using IBM watsonx.ai, Hugging Face, Flask and Gradio
Skills you'll gain
Tools you'll learn
Details to know

Add to your LinkedIn profile
6 assignments
See how employees at top companies are mastering in-demand skills

Build your Software Development expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate from IBM

There are 3 modules in this course
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Offered by
Explore more from Software Development
Status: Free Trial
Status: Free Trial
Status: Free Trial
Status: Free Trial
Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
Learner reviews
- 5 stars
85.41%
- 4 stars
10.41%
- 3 stars
2.08%
- 2 stars
0%
- 1 star
2.08%
Showing 3 of 48
Reviewed on Oct 26, 2025
Wow, It was next Level Experience to learn the Multimodal Gen AI Development. Truly Amazing.

Open new doors with Coursera Plus
Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Join over 3,400 global companies that choose Coursera for Business
Upskill your employees to excel in the digital economy

