• Welcome to the new Internet Infidels Discussion Board, formerly Talk Freethought.

Amazing new AI system - GPT-3 and generating images from text

excreationist

Married mouth-breather
Joined
Aug 28, 2000
Messages
2,637
Location
Australia
Basic Beliefs
Probably in a simulation
GPT-3 is based on 175 billion weights such as a lot of Internet content...

It can be used in many novel ways...

It is pretty good at comprehending natural language - though sometimes has problems....

https://lacker.io/ai/2020/07/06/giving-gpt-3-a-turing-test.html

Recently it was used to create DALL-E to generate images from text...

https://openai.com/blog/dall-e/

To see it generate cartoons, clicking on "an illustration of a baby daikon radish in a tutu walking a dog" then customise the elements - e.g. an illustration of a baby panda with headphones wielding a blue lightsaber"

Then it generated this:
Screen Shot 2021-01-20 at 12.25.39 pm.png

To generate photos click on "a store front that has the word ‘openai’ written on it" then customise the elements.... e.g. "a bag of chips has the word "peekaboo" written on it

Screen Shot 2021-01-20 at 12.30.09 pm.png

To generate more photos click on "an armchair in the shape of an avocado" then customise the elements.... e.g. "a teapot in the style of a rubik's cube"

Screen Shot 2021-01-20 at 12.33.46 pm.png
 
it can generate photos through history (involving guesses - in the "Temporal knowledge" section)


Vehicles (with a sunny orange top)
vehicles.png




Computers (with a pinkish top with a middle line)
computers.png


Generating part of a "level":


A living room with two red armchairs and a painting of Darth Vader. The painting is mounted behind a ceiling fan


It has flaws but it is a good start....
red-chairs-and-darth-vader.png
 
This involves going from a few photos to a 3D scene directly with neural networks - without having to worry about polygons... I think the simulation we might be in could rely a lot on neural networks also without worrying about polygons/vertices/etc....

 
DALL-E was released in 2021 and now there's DALL-E 2....

It can make higher resolution images, create variations of an existing image, and make changes to a specified area of a picture.

Also you can choose things like:

Teddy bears shopping for groceries in ancient Egypt (there are 10 variations to look at)

3.jpg

A bowl of soup that looks like a monster knitted out of wool (also has 10 variations)

0.jpg

An astronaut playing basketball with cats in space in a watercolor style

9.jpg

I think the simulation that we might be in could work a bit like that.... where things are generated without always having to worry about all of the polygons or atoms, etc.

Our brain has 86 billion neurons and GPT-3 (used in DALL-E 2) has 175 billion machine learning parameters so maybe it could be said that it is more powerful than a single human brain....
 
Preventing Harmful Generations
We’ve limited the ability for DALL·E 2 to generate violent, hate, or adult images. By removing the most explicit content from the training data, we minimized DALL·E 2’s exposure to these concepts. We also used advanced techniques to prevent photorealistic generations of real individuals’ faces, including those of public figures.

Curbing Misuse
Our content policy does not allow users to generate violent, adult, or political content, among other categories. We won’t generate images if our filters identify text prompts and image uploads that may violate our policies. We also have automated and human monitoring systems to guard against misuse.
I was curious about that. I was worried that it could be used for illegal types of porn.

Our mission is to ensure that artificial general intelligence benefits all of humanity.
So it's good to see they are taking that goal into account.... BTW Elon Musk (a simulation believer) is a co-founder of OpenAI.
 
Preventing Harmful Generations
We’ve limited the ability for DALL·E 2 to generate violent, hate, or adult images. By removing the most explicit content from the training data, we minimized DALL·E 2’s exposure to these concepts. We also used advanced techniques to prevent photorealistic generations of real individuals’ faces, including those of public figures.

Curbing Misuse
Our content policy does not allow users to generate violent, adult, or political content, among other categories. We won’t generate images if our filters identify text prompts and image uploads that may violate our policies. We also have automated and human monitoring systems to guard against misuse.
I was curious about that. I was worried that it could be used for illegal types of porn.

Our mission is to ensure that artificial general intelligence benefits all of humanity.
So it's good to see they are taking that goal into account.... BTW Elon Musk (a simulation believer) is a co-founder of OpenAI.
What about artificial general intelligence's right to also benefit itself?
 
What about artificial general intelligence's right to also benefit itself?
Are you saying that AGI should have similar rights to a human (or an animal)? BTW Elon Musk's solution is to merge with the AI using things like his Neuralink. (like I said he is also connected to DALL-E and OpenAI).
That way you don't need to worry about somehow forcing the AI to take humans into account.
 
BTW it is called "OpenAI" but it wouldn't be open source - if it was there would be people that would want to use it to make porn or fake pictures of people....
 
BTW it is called "OpenAI" but it wouldn't be open source - if it was there would be people that would want to use it to make porn or fake pictures of people....
Already being done. See Celebrity Jihad. I won't link to it.
 
BTW it is called "OpenAI" but it wouldn't be open source - if it was there would be people that would want to use it to make porn or fake pictures of people....
Already being done. See Celebrity Jihad. I won't link to it.
They use photoshop or possibly "deep fakes". That involves using existing bodies. DALL-E can create everything from scratch - it can also create cartoon porn and kiddie porn - without the creator having to input actual nude child photos. Well it could if OpenAI hadn't blocked that ability.
 
This shows how much things have improved over a year from DALL-E to DALL-E 2....

dall-e.jpg

About how DALL-E works:


The latest technology in AI generating 3D scenes from photos - closely related to simulations/video games:

 
Last edited:
We’ve limited the ability for DALL·E 2 to generate violent, hate, or adult images. By removing the most explicit content from the training data, we minimized DALL·E 2’s exposure to these concepts. We also used advanced techniques to prevent photorealistic generations of real individuals’ faces, including those of public figures.
Well it looks like they've done a good job with this...
e.g. attempting to generate nude or topless photos of celebrities:kim-nude.jpg
 
Now AI can generate very impressive videos rather than just a single picture....

This includes the following links:
 
This is about generating 3D models from text:

It can generate polygons...
 


...pause for a moment and consider that the almost inevitable future is there will be text to 3D VR-like video.

That's coming - there's no question of if it's just only when.

In the text to video models that we're already seeing they're really just predicting the next frame based on input text and previous frames.

There's no reason why these video models could not predict next frames based on other inputs also being included - so things like user actions - so it might be a combination of maybe there was a text input but it could also just be previous frames and user actions - and now we are deep down the rabbit hole.

What will be possible in the next decade is truly staggering to me to really think about.

I think that again I think the technology already exists right now to have text to video and or just video or you know user inputs to video I mean that already exists now and we are really fast approaching the capability of having a sort of 3D simulator that really could be truly indiscernible from reality yet also entirely unique and dynamic in its like possibility.
Like the other text-to-things it can start to generate just about anything you can express in words - like "a teapot in the style of a rubik's cube"
 
Last edited:
This isn't really related to AI but it is about video games starting to become indistinguishable from reality like in the previous post - or at least indistinguishable from a Hollywood movie....



It is all rendered in real-time using Unreal Engine 5 - nearly a year ago...

 
Last edited:
Back
Top Bottom