Meet “Muse”, a text-to-image generating model from GoogleAI

Amongst the current AI-trend sweeping the internet, Google has released a new text-to-image generating tool called Muse. The transformer-based AI image creator is able to create high-quality images at record speed. The new tool from Google AI is faster and more efficient than many competitors. But, what exactly does that mean?
A group from Google Research introduced Muse as a tool that’s on par with most current models. However, the researchers went on to say that Muse is significantly more efficient than existing diffusion models like Stable Diffusion, Dalle-E 2 and even Google Parti. However, what is this efficiency scale based on?
During research and extensive testing by GoogleAI, the researchers found that Muse delivers similar quality images, much faster. When evaluated, Muse was compared to Parti-3B and Imagen. Against these competitors, Muse was able to produce images matching them in quality, variety and text alignment. However, Muse stood out as significantly faster than these competitors.
Muse has a 1.3 second generation time to create images compared to Stable Diffusion which has a 3.7 second generation time. This is a significant difference, making Muse much faster.
The research team was able to achieve this speed in Muse using a compressed discrete latent space and parallel decoding. Regarding text comprehension, Muse uses a frozen T5 language model. This means it fully processes a text prompt instead of focussing on certain words or phrases. This makes it easier to get ahead with using AI-driven tools.
Muse also sports a new architecture that changes how images are adjusted or edited. This new range of image editing applications makes it easier to edit images using text prompts. This makes it possible to make changes in your generated images without needing to use complex masks, but instead you just use prompts.
During independent testing by other human applicants, Muse images were rated as better suited to text input than Stable Diffusion 1.4. This was the general consensus in 70.6% of testers. Testers also found that Muse is above average when it comes to incorporating predefined words into images. Muse has also shown to be more accurate in its composition than many competitors. This means it’s able to display image elements from the prompt more exactly, ie. three wine bottles or five yellow boxes.
The Muse team has pointed out that depending on the unique use case there is the ‘potential for harm’. This is not strange when it comes to scientific work on AI systems, especially when it relates to language and images. The tool could be used to reproduce social biases or spread misinformation if used maliciously. As such, the team has decided not to publish the code for Muse. They’ve also held off on releasing a publicly available demo as Muse is a closed model at the moment. If you’re looking for a good image-to-text AI generator in the meantime, why not have a look at ChatGPT? This AI-driven tool can help with more than just images.
Advertisement
Uhh, this has already been possible – I am not sure how but remember my brother telling me about it. I’m not a whatsapp user so not sure of the specifics, but something about sending the image as a file and somehow bypassing the default compression settings that are applied to inbound photos.
He has also used this to share movies to whatsapp groups, and files 1Gb+.
Like I said, I never used whatsapp, but I know 100% this isn’t a “brand new feature”, my brother literally showed me him doing it, like… 5 months ago?
Martin, what happened to those: 12 Comments (https://www.ghacks.net/chatgpt-gets-schooled-by-princeton-university/#comments). Is there a specific justifiable reason why they were deleted?
Hmm, it looks like the gHacks website database is faulty, and not populating threads with their relevant cosponsoring posts.
The page on ghacks this is on represents the best of why it has become so worthless, fill of click-bait junk that it’s about to be deleted from my ‘daily reads’.
It’s really like “Press Release as re-written by some d*ck for clicks…poorly.” And the subjects are laughable. Can’t wait for “How to search for files on Windows”.
> The page on ghacks this is on represents the best of why it has become so worthless, fill of click-bait junk…
Sadly, I have to agree.
Only Martin and Ashwin are worth subscribing to.
Especially Emre Çitak and Shaun are the worst ones.
If ghacks.net intended “Clickbait”, it would mark the end of Ghacks Technology News.
Ghacks doesn’t need crappy clickbaits. Clearly separate articles from newer authors (perhaps AIs and external sales person or external advertising man) as just “Advertisements”!
We, the subscribers of Ghacks, urge Martin to make a decision.
because nevermore wants to “monetize” on every aspect of human life…
“Threads” is like the Walmart of Social Media.
How hard can it be to clone a twitter version of that as well? They’re slow.
Yes, why not mention how large the HD files can be?
Why, not mention what version of WhatsApp is needed?
These omissions make the article feel so bare. If not complete.
Sorry posted on the wrong page.
such a long article for such a simple matter. Worthless article ! waste of time
I already do this by attaching them via the ‘Document’ option.
I don’t know what’s going on here at Ghacks but it’s obvious that something is broken, comments are being mixed whatever the article, I am unable to find some of my later posts neither. :S
Quoting the article,
“As users gain popularity, the value of their tokens may increase, allowing investors to reap rewards.”
Besides, beyond the thrill and privacy risks or not, the point is to know how you gain popularity, be it on social sites as everywhere in life. Is it by being authentic, by remaining faithful to ourselves or is it to have this particular skill which is to understand what a majority likes, just like politicians, those who’d deny to the maximum extent compatible with their ideological partnership, in order to grab as many of the voters they can?
I see the very concept of this Friend.tech as unhealthy, propagating what is already an increasing flaw : the quest for fame. I won’t be the only one to count himself out, definitely.
@John G. is right : my comment was posted on [https://www.ghacks.net/2023/08/23/what-is-friend-tech/] and it appears there but as well here at [https://www.ghacks.net/2023/07/08/how-to-follow-everyone-on-threads/]
This has been lasting for several days. Fix it or at least provide some explanations if you don’t mind.
> Google Chrome is following in Safari’s footsteps by introducing a new feature that allows users to move the Chrome address bar to the bottom of the screen, enhancing user accessibility and interaction.
Firefox did this long before Safari.
Basically they’ll do anything except fair royalties.