The Los Angeles Post
California & Local U.S. World Business Lifestyle
Today: December 23, 2024
Today: December 23, 2024

Google’s new AI tool uses image prompts instead of text

Two women look at their mobile phones displaying the logo of Google in Ankara, Turkiye on September 3.Dilara Irem Sancar/Anadolu/Getty Images via CNN Newsource
December 17, 2024

(CNN) — Google’s newest artificial intelligence tool, “Whisk,” lets people upload photos to get back a combined, AI-generated image – even without users inputting any text to explain what they want.

Users can input images depicting subjects, setting and style before Whisk combines everything into one image.

Whisk is a “creative tool” for quick inspiration, Google said in a blog post, as opposed to a “traditional image editor.” In essence, Whisk is intended as a fun AI feature, rather than as something that’s supposed to be refined professional work.

Big Tech companies like Google and OpenAI are racing to release consumer products that can showcase uses for the snazzy new technology, even as naysayers warn that the lack of guardrails around the development of AI poses dangers for humanity.

Since OpenAI initially launched its text-to-image creation tool, Dall-E, in 2021, the concept of AI-generated artwork has swamped social media and become a focus of consumer products. Google’s Whisk is an image-to-image generator, building upon the popular concept of text-to-image generators.

People using Whisk can “remix” the final image by editing their inputs and mixing the categories to produce different images like a plushie toy, enamel pin or sticker. Users can add in text if they want to direct certain details, but it is not required to create an image.

“Whisk is designed to allow users to remix a subject, scene and style in new and creative ways, offering rapid visual exploration instead of pixel-perfect edits,” Thomas Iljic, a director of product management at Google Labs, said in a statement.

Google’s Whisk is built upon the generative AI developed by DeepMind, the AI lab that Google acquired in 2014.

Whisk works by using Google’s core AI offering, Gemini, which debuted in December 2023, and pairing it with Imagen 3, the latest text-to-image generator released by DeepMind in December.

When users upload their images, Gemini generates a caption which is fed into Imagen 3. The process captures the “essence” of the subject as opposed to an exact replica, which allows for remixing the final image but also means the end product might stray from the prompt.

For example, the generated image might have a different height, hairstyle or skin tone as the prompt images, Google said in a blog post.

When Google first rolled out Gemini’s text-to-image creator in February, the company faced initial backlash because the tool produced historically inaccurate images.

Whisk is first available as a website on Google Labs for users in the US and is in its early stages of development, the company said.

OpenAI also recently released a text-to-video generator called Sora, highlighting the competition for consumer products.

Dan Ives, managing director and senior equity analyst at Wedbush Securities, told CNN that Whisk is another “flex the muscles moment” for Google in the AI and tech race.

“DeepMind is a key asset for Google,” Ives said, noting that AI products are a part of Google’s “treasure chest” of new products for 2025, which also include a new Android operating system built in collaboration with Samsung and Qualcomm.

The-CNN-Wire
™ & © 2024 Cable News Network, Inc., a Warner Bros. Discovery Company. All rights reserved.

Related

Arts|Entertainment|US

What is a biblically accurate angel? And do you need one to top your Christmas tree?

About 7 in 10 U.S. adults say they believe in angels, but what, exactly, is an angel

What is a biblically accurate angel? And do you need one to top your Christmas tree?
Arts|Business|Entertainment|Lifestyle

Lovestruck Books opens in Cambridge, creating a community for romance readers

Lovestruck Books opens in Cambridge, creating a community for romance readers

Lovestruck Books opens in Cambridge, creating a community for romance readers
Arts|Crime|Entertainment|Political|US

How a European industrial rock band opposed to violence got tied to school shootings in America

How a European industrial rock band opposed to violence got tied to school shootings in America

How a European industrial rock band opposed to violence got tied to school shootings in America
Arts|Education|Political|US

Her surprise bestseller offers a holiday message Americans need to hear

Her surprise bestseller offers a holiday message Americans need to hear

Her surprise bestseller offers a holiday message Americans need to hear
Share This

Popular

Arts|Asia|Entertainment|Political|World

Makers of Taiwan's 'Zero Day' TV series set around invasion fear backlash from China

Makers of Taiwan's 'Zero Day' TV series set around invasion fear backlash from China
Americas|Arts|Entertainment|World

Santa braves the sticky heat of the Amazon jungle to bring gifts to children in Brazilian village

Santa braves the sticky heat of the Amazon jungle to bring gifts to children in Brazilian village
Arts|Europe|Travel

Rome's iconic Trevi Fountain reopens after renovation work in time for the Jubilee Holy Year

Rome's iconic Trevi Fountain reopens after renovation work in time for the Jubilee Holy Year
Arts|Entertainment|Europe|Travel|World

Rome's Trevi Fountain restored in time for Jubilee year

Rome's Trevi Fountain restored in time for Jubilee year