No. It is a Scrimba Pro course requiring a subscription. The free AI starting points are Learn to Code with AI and Intro to Mistral AI.

Two things: generating and editing images with DALL-E, and analysing or interpreting images with GPT-4 Vision, both inside AI-powered apps.

Guil Hernandez, who also teaches Learn to Code with AI and Learn RAG on Scrimba.

Intro to Dall-E and GPT Vision

Q: Do I need prior AI experience?

Yes. It assumes you can already call a model from code. Newcomers should do Intro to AI Engineering first.

Q: Is it worth taking if my app has no images?

Probably not. It is a specialised topic. Without an image use case, your time is better spent on the AI fundamentals, RAG, or agents.

Scrimba's short Pro course on the visual side of AI apps, taught by Guil Hernandez in about an hour. You generate images with DALL-E and interpret them with GPT-4 Vision.

Quick answer

Intro to Dall-E and GPT Vision is Scrimba's intermediate, Pro-tier course on multimodal AI: roughly 1 hour across 16 lessons, taught by Guil Hernandez. You use DALL-E to create and edit images, and GPT-4 Vision to analyse and interpret them, then bring both into AI-powered apps. It is a narrow, focused addition for when text alone is not enough.

Taught by Guil Hernandez (opens in a new tab)

Generate and edit images with DALL-E and interpret them with GPT-4 Vision inside your AI apps.

Duration: 1.0 hrsLevel: Intermediate16 lessons

View on Scrimba

Is it worth your time?

If your app needs to make or understand images, this is a quick, direct way to learn both sides of that without wading through a longer course. At an hour it is easy to slot in, and Guil Hernandez keeps the examples concrete.

The honest caveat is how narrow it is. Image generation and vision are specialised, and most AI apps never touch them. If you do not have a clear use case, the time is better spent on the fundamentals or on RAG and agents. This is an add-on, not a core skill.

What you'll learn

The course splits into the two halves of multimodal work. On the generation side, you use DALL-E to create and edit original images. On the understanding side, you use GPT-4 Vision to analyse and interpret images the app receives. Both are shown in the context of AI-powered apps, so the focus is on wiring the capability in rather than admiring the output.

Who it's for, and who should skip it

It fits developers building AI apps that specifically need to generate or read images, and anyone curious about multimodal models. It is a clean, short introduction to a specialised area.

Skip it if you have no image use case; it will not generalise to text-only apps. Newcomers should also build the AI fundamentals first, since this assumes you can already wire a model into an app.

Prerequisites

JavaScript and basic experience calling an AI model from code. The AI Engineering fundamentals are the right grounding before adding multimodal work.

Where it fits

This is an optional, specialised stop on the AI Engineer Path. It is not a milestone everyone needs; take it when a project calls for images. It sits alongside the other applied AI courses rather than before them.

Free or Pro

This is a Pro course requiring a Scrimba subscription. Pro also covers the full AI Engineer Path, the challenges, the Discord, and certificates. See current plans for pricing in your region.

Strengths and limits

What it does well: it covers both generation and vision in one short, practical hour with a clear instructor.

Where it is limited: it is highly specialised, short, and irrelevant to the many AI apps that never touch images, so it only pays off with a real use case.

Intro to AI Engineering, the fundamentals to take first
Learn RAG, for grounding text-based AI apps
Learn AI Agents, for AI that acts rather than describes
Intro to Mistral AI, a free way to practise LLM app patterns

Is this course free?

What does it cover?

Do I need prior AI experience?

Who teaches it?

Is it worth taking if my app has no images?

View Intro to Dall-E and GPT Vision on Scrimba

Quick answer​

Is it worth your time?​

What you'll learn​

Who it's for, and who should skip it​

Prerequisites​

Where it fits​

Free or Pro​

Strengths and limits​

Related courses and comparisons​

Related Guides

All Courses