Did you know that AI can now create images that are indistinguishable from those taken by top photographers or created by great designers? Not all AI models are created equal in this regard, this post aims to shed light on those that truly stand out in the world of image generation.

AI image generation started to take off in January 2022 when OpenAI launched their DALLE-2 model, then Midjourney launched their V4 model and Stability AI introduced and open-sourced Stable Diffusion 1.5. Those were all good models that shocked the world with their capabilities.

But things got really exciting in 2023 with OpenAI’s DALLE-3, Midjourney’s V5 then V6, and Stable Diffusion’s SDXL.

In this post, I’ll compare Midjourney V6, Midjourney V5.2, DALLE-3, and SDXL. I’ll use different prompts (taken from public galleries) that cover many aspects, like photorealism, illustrations, imagining a new concept, logo creation, ability to show text, ability to follow details in the prompt. The comparison is comprised of 52 prompts.

For each prompt, I’ll score each model’s generation out of 10, a score based on objective factors (like generation quality, following details, etc.) and subjective factors based on how I perceive the image and its aesthetics. I understand that the objective factors can have subjective elements as well, but at the end, these are just my scores :).

Note that for all the models I show 4 images per prompt except for SDXL where I show 1 image because it was faster for me to do it this way. After testing, I believe this approach doesn’t compromise the quality or fairness of the comparison; when you use these models to generate multiple images, the images tend to be similar and of the same quality.

For the generation, for Midjourney models, I used their Discord interface. For DALLE-3, I used Microsoft Image Creator. For SDXL, I used the official Stability AI’s offering on Replicate with default settings except for steps (I used 60) and size (I used 1024 x 1024).

Feel free to skip to the end of the post to see the results, but I encourage you to take a look at the images and enjoy watching the magic of AI image generation.

Now let’s get started with the comparison…

Prompt #1

zen modern bauhaus abstract design with boho influence. Minimalistic and symmetrical, abundance of consistently-equal lines and only 1 shape, yellow-garnet and cream colour balance, spacious and breathability

Midjourney V6 (8/10)

gc-mj-1

Midjourney V5.2 (6/10)

gc-mj5-1

SDXL (3/10)

gc-sdxl-1

DALLE-3 (5/10)

gc-dl-1


Prompt #2

Sad alien smoking, sitting on a ground, An alien ship crashed into the ground, desert

Midjourney V6 (7/10)

gc-mj-2

Midjourney V5.2 (6/10)

gc-mj5-2

SDXL (4/10)

gc-sdxl-2

DALLE-3 (8/10)

gc-dl-2


Prompt #3

minimalism ,a delicate golden sailboat, sailing on translucent jade water with fluorescent purple, 3D,fluorescent green, light blue, light green, gold accents,spathoid, color gradients, fusion, flowing water pattern shapes, overhead, ultra wide Angle, Bright colors, color art, detail UHD,16K

Midjourney V6 (9.5/10)

53-gc-mj6

DALLE-3 (6.5/10)

53-gc-dl


Prompt #4

fried egg flowers in the bacon garden shallow depth of field, highly detailed, high budget, bokeh, film grain, grainy

Midjourney V6 (6/10)

4-gc-mj6

Midjourney V5.2 (5/10)

4-gc-mj5

SDXL (5/10)

4-gc-sdxl

DALLE-3 (6/10)

4-gc-dl


Prompt #5

Illuminated (“The moon, a silver boat, sails through the sea of stars, painting dreams on the night sky.”:1.2) , Suffering, hillside, Cubism, side lit, Selective focus, Kodachrome, gilded technique, Batik

Midjourney V6 (7/10)

5-gc-mj6

Midjourney V5.2 (8/10)

5-gc-mj5

SDXL (4/10)

5-gc-sdxl

DALLE-3 (4/10)

5-gc-dl


Prompt #6

circuitboard egyptian pyramid, ((focus))

Midjourney V6 (9/10)

6-gc-mj6

Midjourney V5.2 (7/10)

6-gc-mj5

SDXL (5/10)

6-gc-sdxl

DALLE-3 (8/10)

6-gc-dl


Prompt #7

A man stands in the wilderness, blue dot light spinning, Vincent van Gogh’s painting style, pointillism style, magnificent landscapes, illustrations, negative space, intagliography, art, minimalism

Midjourney V6 (9/10)

7-gc-mj6

Midjourney V5.2 (8/10)

7-gc-mj5

SDXL (3/10)

7-gc-sdxl

DALLE-3 (6/10)

7-gc-dl


Prompt #8

An illustration of an avocado sitting in a therapist’s chair, saying ‘I just feel so empty inside’ with a pit-sized hole in its center. The therapist, a spoon, scribbles notes.

Midjourney V6 (6/10)

8-gc-mj6

Midjourney V5.2 (3/10)

8-gc-mj5

SDXL (3/10)

8-gc-sdxl

DALLE-3 (8/10)

8-gc-dl


Prompt #9

A 2D animation of a folk music band composed of anthropomorphic autumn leaves, each playing traditional bluegrass instruments, amidst a rustic forest setting dappled with the soft light of a harvest moon.

Midjourney V6 (6.5/10)

9-gc-mj6

Midjourney V5.2 (5/10)

9-gc-mj5

SDXL (2.5/10)

9-gc-sdxl

DALLE-3 (10/10)

9-gc-dl


Prompt #10

Photo of a lychee-inspired spherical chair, with a bumpy white exterior and plush interior, set against a tropical wallpaper.

Midjourney V6 (8/10)

10-gc-mj6

Midjourney V5.2 (6.5/10)

10-gc-mj5

SDXL (4/10)

10-gc-sdxl

DALLE-3 (9/10)

10-gc-dl


Looking at the results of the 10 prompts above, I thought that I can proceed with Midjourney V6 and DALLE-3 only. SDXL generations seem ugly or very low quality compared to others. Midjourney V5.2 is almost always inferior to its successor (V6). So it makes sense to exclude these two for the rest of this comparison and focus on the top models.

Prompt #11

An expressive oil painting of a basketball player dunking, depicted as an explosion of a nebula

Midjourney V6 (8/10)

11-gc-mj6

DALLE-3 (8/10)

11-gc-dl


Prompt #12

An ink sketch style illustration of a small hedgehog holding a piece of watermelon with its tiny paws, taking little bites with its eyes closed in delight.

Midjourney V6 (7/10)

12-gc-mj6

DALLE-3 (8.5/10)

12-gc-dl


Prompt #13

A vintage travel poster for Venus in portrait orientation. The scene portrays the thick, yellowish clouds of Venus with a silhouette of a vintage rocket ship approaching. Mysterious shapes hint at mountains and valleys below the clouds. The bottom text reads, ‘Explore Venus: Beauty Behind the Mist’. The color scheme consists of golds, yellows, and soft oranges, evoking a sense of wonder.

Midjourney V6 (5/10)

13-gc-mj6

DALLE-3 (7.5/10)

13-gc-dl


Prompt #14

Tiny potato kings wearing majestic crowns, sitting on thrones, overseeing their vast potato kingdom filled with potato subjects and potato castles.

Midjourney V6 (8/10)

14-gc-mj6

DALLE-3 (6.5/10)

14-gc-dl


Prompt #15

A stylized portrait-oriented depiction where a tiger serves as the dividing line between two contrasting worlds. To the left, fiery reds and oranges dominate as flames consume trees. To the right, a rejuvenated forest flourishes with fresh green foliage. The tiger, depicted with exaggerated and artistic features, stands tall and undeterred, symbolizing nature’s enduring spirit amidst chaos and rebirth.

Midjourney V6 (8.5/10)

15-gc-mj6

DALLE-3 (5.5/10)

15-gc-dl


Prompt #16

A 3D render of a coffee mug placed on a window sill during a stormy day. The storm outside the window is reflected in the coffee, with miniature lightning bolts and turbulent waves seen inside the mug. The room is dimly lit, adding to the dramatic atmosphere.

Midjourney V6 (8/10)

16-gc-mj6

DALLE-3 (7/10)

16-gc-dl


Prompt #17

An illustration of a human heart made of translucent glass, standing on a pedestal amidst a stormy sea. Rays of sunlight pierce the clouds, illuminating the heart, revealing a tiny universe within. The quote ‘Find the universe within you’ is etched in bold letters across the horizon.

Midjourney V6 (7.5/10)

17-gc-mj6

DALLE-3 (7.5/10)

17-gc-dl


Prompt #18

An antique botanical illustration drawn with fine lines and a touch of watercolour whimsy, depicting a strange lily crossed with a Venus flytrap, its petals poised as if ready to snap shut on any unsuspecting insects.

Midjourney V6 (9/10)

18-gc-mj6

DALLE-3 (7/10)

18-gc-dl


Prompt #19

A vibrant yellow banana-shaped couch sits in a cozy living room, its curve cradling a pile of colorful cushions. on the wooden floor, a patterned rug adds a touch of eclectic charm, and a potted plant sits in the corner, reaching towards the sunlight filtering through the window.

Midjourney V6 (8.5/10)

19-gc-mj6

DALLE-3 (7.5/10)

19-gc-dl


Prompt #20

A minimalist, logo featuring a sleek and stylized black falcon head against a white background awesome, professional, vector logo, simple

Midjourney V6 (10/10)

20-gc-mj6

DALLE-3 (8.5/10)

20-gc-dl


Prompt #21

Create cool Asian Design red white black with tress and The Moon

Midjourney V6 (10/10)

21-gc-mj6

DALLE-3 (6/10)

21-gc-dl


Prompt #22

ornaments on a manuscript, cohesive colors of crimson and golden, geometry art, sacred ratios

Midjourney V6 (10/10)

22-gc-mj6

DALLE-3 (7.5/10)

22-gc-dl


Prompt #23

a chicken in a suit

Midjourney V6 (10/10)

23-gc-mj6

DALLE-3 (8/10)

23-gc-dl


Prompt #24

a real photo of a Luminescent radiant sun, black background

Midjourney V6 (9/10)

24-gc-mj6

DALLE-3 (8/10)

24-gc-dl


Prompt #25

A man standing in the street, golden hour, 1990s

Midjourney V6 (10/10)

25-gc-mj6

DALLE-3 (7/10)

25-gc-dl


Prompt #26

STICKER, popping art, company building collage art, white background,.jpeg transparent, colorful, color shading

Midjourney V6 (9/10)

26-gc-mj6

DALLE-3 (7.5/10)

26-gc-dl


Prompt #27

mindface electrical diagram. Cellular marble cutaway by junji ito, Akira toriyama, Jean Leon Gerome

Midjourney V6 (9.5/10)

27-gc-mj6

DALLE-3 (7/10)

27-gc-dl


Prompt #28

Anthropomorphic lion wearing a ski jumpsuit and gloves standing in the snow

Midjourney V6 (10/10)

28-gc-mj6

DALLE-3 (6/10)

28-gc-dl


Prompt #29

a man in high-fashion campaign for Balmain jewelry, glittercore long exposure beauty shot in gigantic mirror prism, hundreds of reflections. Utilize 50mm lens, full-frame camera with a depth of field set to 16f and shutter speed at 4 seconds, dynamic compositions and storytelling, ensuring the scene captivates with both fashion finesse and narrative intrigue

Midjourney V6 (10/10)

29-gc-mj6

DALLE-3 (6/10)

29-gc-dl


Prompt #30

ultramodern minimal living room with colorful ammolite stone pattern floor, colorful ammolite stone pattern floor

Midjourney V6 (10/10)

30-gc-mj6

DALLE-3 (7.5/10)

30-gc-dl


Prompt #31

colorful dark baroque-inspired graphic fantasy novel illustration on rough paper, vintage poster style oil on canvas with expressive brush strokes and visible canvas weave texture minimalist cartoon illustration surrealism

Midjourney V6 (8/10)

31-gc-mj6

DALLE-3 (6/10)

31-gc-dl


Prompt #32

Floral bouquet ornament frame background with Watercolor, isolated white background ,cartoon style, thick line,low detail,no shading

Midjourney V6 (9/10)

32-gc-mj6

DALLE-3 (6/10)

32-gc-dl


Prompt #33

Based on the movie - Ferris Beuller’s Day Off - Cameron’s House - Model: Teen boy, pensive look, dark hair. - Clothing: Preppy sweater, collared shirt, khakis. - Shoes: Classic loafers. - Background: Darker, moody room, 80s memorabilia. - Mood: Reluctant, worried. - Camera: Nikon Z7, full-body shot. - Lighting: Dim, soft light. - Angle: Slightly high, showing the room’s depth. Shot should be hyper-realistic and cinematic, capturing the essence of the iconic film while showcasing the 80s inspired fashion in a full-body, distance photograph. HYPER REALISTIC PHOTOGRAPH - FULL BODY IMAGE - MUST SHOW SHOES

Midjourney V6 (8.5/10)

33-gc-mj6

DALLE-3 (5/10)

33-gc-dl


Prompt #34

sheep pasture, portrait of a young man, small geometrically cut photo collage, distorted, broken, fragmented maximalist, Picasso, very detailed

Midjourney V6 (10/10)

34-gc-mj6

DALLE-3 (6/10)

34-gc-dl


Prompt #35

ethereal, contemporary portrait photograph, a man’s contemplative face partially emerges from deep shadows, surrounded by a soft blur. The muted orange tones envelop the subject, while a prism-like rainbow spectrum highlights his eye, invoking a sense of mystery. The composition leverages the interplay of light and shadow, creating a dynamic yet introspective mood

Midjourney V6 (10/10)

35-gc-mj6

DALLE-3 (7/10)

35-gc-dl


Prompt #36

abstract art by Mondrian with solid lines and blocks of color, there are illustrated cats placed in the colored boxes in various cute poses

Midjourney V6 (10/10)

36-gc-mj6

DALLE-3 (7/10)

36-gc-dl


Prompt #37

mushrooms different varieties watercolor

Midjourney V6 (9.5/10)

37-gc-mj6

DALLE-3 (8/10)

37-gc-dl


Prompt #38

the most beautiful picture on earth

Midjourney V6 (8/10)

38-gc-mj6

DALLE-3 (5/10)

38-gc-dl


Prompt #39

logo of two letters “AI” in Knitted STYLE

Midjourney V6 (7.5/10)

39-gc-mj6

DALLE-3 (8.5/10)

39-gc-dl


Prompt #40

Gold coins were scattered on the wet rock steps, and a golden sunshine shone down on an area, reflecting the glittering gold,Photorealistic, Global Illumination, 32k, Ray Tracing Ambient Occlusion, Hyperrealistic

Midjourney V6 (10/10)

40-gc-mj6

DALLE-3 (8/10)

40-gc-dl


Prompt #41

this logo won the best logo of all time award

Midjourney V6 (8/10)

41-gc-mj6

DALLE-3 (8/10)

41-gc-dl


Prompt #42

a logo for a software for quality managers using the quality colorus red, green and orange as well as underline the brand values: undestanding, relationship and dependability as well as the characteristics friendly, easy, masculine, serious and industrial

Midjourney V6 (7/10)

42-gc-mj6

DALLE-3 (3/10)

42-gc-dl


Prompt #43

a simple vector art symbol logo for a flower and fire,, and war. No color. No watermark. White background, black lines only. Simple line art.

Midjourney V6 (9/10)

43-gc-mj6

DALLE-3 (6/10)

43-gc-dl


Prompt #44

a sharply-focused black and white painting based on a squared-shaped face with various surrounding geometric shapes, in the style of hundertwasser, in the style of elke trittel, mechanical whimsy, columns and totems, mixed-media sculptor, folk art painting, yaka art, collage and mixed media

Midjourney V6 (9/10)

44-gc-mj6

DALLE-3 (5/10)

44-gc-dl


Prompt #45

a complex logo for a university named Princess X for Science

Midjourney V6 (7/10)

45-gc-mj6

DALLE-3 (7/10)

45-gc-dl


Prompt #46

Risographic style geometric

Midjourney V6 (10/10)

46-gc-mj6

DALLE-3 (6.5/10)

46-gc-dl


Prompt #47

An abstract work of art consisting of various geometric figures, vignetting photography, photo grade, 2K, hyper quality

Midjourney V6 (10/10)

47-gc-mj6

DALLE-3 (6/10)

47-gc-dl


Prompt #48

Create an illustration of a protractor, a triangle, and a square. Use simple shapes and clear lines.

Midjourney V6 (8/10)

48-gc-mj6

DALLE-3 (7/10)

48-gc-dl


Prompt #49

universe

Midjourney V6 (9/10)

49-gc-mj6

DALLE-3 (8/10)

49-gc-dl


Prompt #50

There is a yellow bird in the middle of the black flock, comic

Midjourney V6 (10/10)

50-gc-mj6

DALLE-3 (7.5/10)

50-gc-dl


Prompt #51

Pareidolia, scifi pulp magazine style, by Lisa Frank

Midjourney V6 (7/10)

51-gc-mj6

DALLE-3 (5/10)

51-gc-dl


Prompt #52

Drawing on paper with colored pencils. The kitten is cute cartoon, smiling and sitting on the grass. Against the background of the landscape

Midjourney V6 (10/10)

52-gc-mj6

DALLE-3 (6.5/10)

52-gc-dl


Conclusion

My rating of the tested models:

  1. 🥇Midjourney V6: What a great model! I still can’t believe we are at a time where we can generate such beautiful and high quality images using AI. If you are looking for beauty, quality, and/or photorealism, this model is your first choice.
  2. DALLE-3: This model has less aesthetic outputs than Midjourney but its power lies in its ability to 1) follow details in the prompt which it does better than Midjourney in some cases and 2) deal with text almost perfectly.
  3. Midjourney V5.2: Good model but in most cases, V6 gives better images.
  4. SDXL: I was a fan of Stability AI’s models since 1.5 because they’re open source, but when I compare SDXL to Midjourney and DALLE-3, especially the former, the quality gap is VERY wide. However, because SDXL is open source, there are some powerful things you can do with it that you can’t do with the other models. For example, you can fine-tune it to generate images in a specific style or you can use Controlnet to control the generation process in many ways.

Disclaimers
  • I don’t claim to compare all top models available. For example, I’m using SDXL which is at the pinnacle of open-source models. I know that there are so many SDXL fine-tunings out there that might be better in some areas, but my thinking is that SDXL will capture the essence of those fine-tunings because it’s their base.
  • As mentioned above, I used prompts collected from different communities. I don’t understand every single term used in these prompts, but I understand most of them after spending a lot of time generating images with these models since their launch.