Skip to main content
Win Tun Lin avatar

Win Tun Lin has completed

Multi-Modal Systems with the OpenAI API

Start course For Free
2 hr
2,000 XP
Statement of Accomplishment Badge

Loved by learners at thousands of companies


Course Description

Develop Multi-Modal Applications with the OpenAI API

To build applications that are accessible to as wide an audience as possible, it's important that they can accept inputs in different modalities, like text and speech. In this course, you'll build on your knowledge of OpenAI's text generation models and learn to master their audio models!

Master Audio Applications

Did you know OpenAI developed audio models? In this course, you'll learn to create reliable text transcripts in multiple languages using their speech-to-text models. You'll also flip the script and generate realistic human audio (and in multiple languages!) with their text-to-speech models.

Protect and Moderate AI Applications

AI applications are often designed for a very specific reason, but a malicious actor may try to misuse it in different ways. In this course, you'll utilize OpenAI's moderation models to detect inappropriate user and AI-generated content.

Build Functional Customer Chatbots

You'll complete a case study to create an end-to-end customer service chatbot that will receive customer queries, and use internal resources to respond back to the customer. Not only that, it will respond in their native language and with AI-generated spoken audio!
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.
DataCamp for BusinessFor a bespoke solution book a demo.
  1. 1

    Beyond Text Generation

    Free

    OpenAI provides models that go far beyond text generation. In this chapter, you'll use OpenAI's text-to-speech and text-to-speech audio models. You'll also moderate user content to detect misuse.

    Play Chapter Now
    Speech-to-text
    50 xp
    Creating a podcast transcript
    100 xp
    Transcribing a non-English language
    100 xp
    Translating Portuguese
    100 xp
    Text-to-speech (TTS)
    50 xp
    OpenAI's text-to-speech (TTS)
    100 xp
    TTS in other languages!
    100 xp
    Content moderation
    50 xp
    Why use text moderation models?
    100 xp
    Requesting moderation
    100 xp
    Examining moderation category scores
    50 xp
For Business

Training 2 or more people?

Get your team access to the full DataCamp platform, including all the features.

collaborators

Collaborator's avatar
Eduardo Oliveira
Collaborator's avatar
Francesca Donadoni

prerequisites

Working with the OpenAI API
James Chapman HeadshotJames Chapman

AI Curriculum Manager, DataCamp

James is a Curriculum Manager at DataCamp, where he collaborates with experts from industry and academia to create courses on AI, data science, and analytics. He has led nine DataCamp courses on diverse topics in Python, R, AI developer tooling, and Google Sheets. He has a Master's degree in Physics and Astronomy from Durham University, where he specialized in high-redshift quasar detection. In his spare time, he enjoys restoring retro toys and electronics.

Follow James on LinkedIn

See More
Stan Konkin HeadshotStan Konkin

ML Content Developer, DataCamp

Stan is a Machine Learning enthusiast with a Master’s degree in Applied Mathematics. He’s passionate about applying Data Science, Analytics, and AI to real-world challenges. In his free time, he enjoys hiking and basketball.
See More

Join over 18 million learners and start Multi-Modal Systems with the OpenAI API today!

Create Your Free Account

or

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.