Eric Berget

Blog Details

Post :

Eric Berget

Date :

December 11, 2024

Category :

Generative AI

As we reach the midpoint of 2024, AI continues to evolve at a rapid pace. The advancements in generative AI are particularly transformative, offering new tools and capabilities that are revolutionizing a wide variety of fields and industries. L&D is no exception. As the Creative Solutions Lead at ttcInnovations, I’ve had the privilege of exploring some of the most cutting-edge generative AI technologies available today.

‍

Wow #1 :
MidJourney

MidJourney has redefined what’s possible in digital imagery, offering an unprecedented level of quality and versatility. This AI-driven jpg-maker generates stunning visuals based on simple text prompts, making it a game-changer for learning experience designers. Using MidJourney, I’ve been able to elevate the visual appeal of my scenario-based custom eLearning modules, creating more engaging and immersive experiences for learners. The tool’s ability to not only execute complex visual requests, but to do so with continually ever-improving aesthetic instincts, is nothing short of magical.

Want to create wow-worthy learning experiences? Watch our on-demand webinar to seamlessly AI-generated images into your content development workflow.

‍

With Midjourney you can direct the feeling, style, or essence of an image with shocking creative clarity,

But more specifically, where is the wow?

My observation thus far is that the true power of Midjourney (in terms of image conjuring) is image quality – not image specificity. This is not a deus ex machina that will immediately solve any and all visual needs with one push of the button. Even the best image generator still gets rudimentary object details or dimensions (think: 5 legged chair) askew from time to time.

Midjourney is generating less six-fingered hands than earlier versions, but object specificity is still questionable. On the other “hand,” the wow resides within the creative control of image quality. Mood, tone, lighting, style are all levers ready to be pulled. You can direct the feeling, style, or essence of an image with shocking creative clarity.

Wow #2:
Text-to-Video from Sora and Runway

Sora looks to be a game-changer in video generation, leveraging generative AI to create realistic and dynamic videos from text descriptions. Despite not being able to actually use this functionality yet, seeing examples of this text to video capability was unquestionably a wow moment. Though I have not been able to test drive this platform, the output samples from BETA users have elicited more than one audible “wows” from me.

I’m not yet thinking of Sora as a replacement for video production—I’m imagining Sora as a replacement stock imagery replacement. A way to breathe life into what I might have otherwise designed with a still image. Although many of the demo samples for video generation are weird and sometimes extreme, to wield these video tools well, subtly will be key. Quick, easy video snippet to aid visual communication.

Comparatively, Runway AI a “high-fidelity, controllable video generation” tool just announced Introducing Gen-3 Alpha and it looks to be on par with Sora.

‍

Wow #3
Generating 360-degree images with Blockade Labs.

Blockade Labs has taken the concept of virtual spaces to a new level with their 360° image generator. By simply inputting text prompts, users can create detailed virtual environments quickly and easily. This functionality is particularly exciting when integrated with tools like Articulate Storyline, which supports 360° images.

Incorporating these immersive environments into eLearning modules offers a refreshing change of pace, providing learners with an interactive and engaging way to explore content. Whether it’s a simulated workplace environment. Though the use cases for 360-degree User Experience are somewhat limited, the potential for delight and surprise is high.

There are rumors that Midjourney is also working on 360° image generation. Imagine 360-degree image generation with Midjourney level image quality dropped right into Articulate Storyline with expanded click and reveal interactive customization. That would open up incredible creative learning design opportunities.

‍

Wow #4 – Magnific AI

Magnific AI is an under-the-radar gem that brings unprecedented detail and realism to digital imagery. Although not widely talked about, (likely due to its cost) this tool has three wow-level capabilities. ZoomandEnhance (that’s my term) and Style Transfer.

Magnific AI’s ability to enhance images with lifelike precision is a significant leap forward for eLearning content. As AI technology continues to evolve, tools like Magnific AI will undoubtedly play a crucial role in shaping the future of visual storytelling, enabling designers to create more immersive and realistic learning experiences.

‍

Wow #5 – Generative Fill Functionality in Photoshop

Photoshop’s generative fill functionality is perhaps the most mainstream of the emerging generative AI capabilities. This feature allows designers to seamlessly add, remove, or alter elements within an image based on simple text prompts.

For learning designers, this means less time spent on tedious photo editing tasks and more time focusing on creating impactful content. The ease and precision with which generative fill operates make it an indispensable tool for anyone looking to streamline their workflow and enhance their visual designs.

‍

Bonus Wow Moment:
Generative Audio

Generative audio is another exciting frontier, with tools like Suno and Udio for music and ElevenLabs for realistic voiceover (VO) generation leading the charge. Udio allows designers to create custom soundtracks tailored to their eLearning content, while ElevenLabs produces lifelike voice overs that can be customized in terms of tone, pitch, and style. The text to speech capability was an improvement, but still too often robotic. The real “wow” moment was Speech to Speech functionality.

I had some fun making songs for my (homeschool) kids' history lessons. They're learning about the renaissance.

I made a song to help them remember "Ad Fontes" (i.e. "Back to the Sources!)

https://suno.com/song/e326ced4-a9fb-4ca0-81ab-c82a19197d7f

‍

Summary

These five generative AI tools represent the cutting edge of what’s possible in eLearning design. Each tool brings unique capabilities that enhance storytelling and engagement, from MidJourney’s stunning visuals to Sora’s dynamic video generation, Blockade Labs’ immersive environments, Magnific AI’s detailed imagery, and Photoshop’s innovative editing features.

As learning designers, our role is to harness these tools to delight and inspire learners; to create compelling narratives and engaging learning experiences. The future of generative AI in eLearning is incredibly bright, promising even more advanced and intuitive tools on the horizon. A place where the barrier between what’s in your imagination and the screen output, is nearly invisible. Explore these new technologies, experiment with their capabilities, and share your own wow moments. Be creative. Be inspired. Play. Together, we can push the boundaries of what’s possible in eLearning design.

‍

Blog Details

Wow #1 :MidJourney

Wow #2:Text-to-Video from Sora and Runway

Wow #3 Generating 360-degree images with Blockade Labs.