FLUX.1：初次印象_AI阅读总结 — 包阅AI

包阅导读总结

1. `FLUX.1`、`AI 模型`、`图像生成`、`Flow 匹配`、`独特优势`

2. FLUX.1 是新的 AI 图像生成模型，采用“Flow 匹配”技术，在文本转图像方面有独特优势，能准确表达、把握光影和纹理等，还能理解艺术风格和创作复杂场景，其“Flow”美学给图像带来独特动感。

– FLUX.1 是新的 AI 模型，可在 Replicate 上使用，采用“Flow 匹配”技术而非常见的扩散技术生成图像。

– “Flow 匹配”直接学习将噪声映射为真实图像的精确变换，与扩散模型逐渐去噪的方法不同，在速度和控制上有优势。

– 对各种提示生成的图像观察：

– 能较好处理文字到视觉的转化，如精确的文本渲染。

– 对光、影和纹理理解出色。

– 能掌握多种艺术风格，进行创新诠释。

– 擅长创作复杂场景，构图合理。

– 其“Flow”美学带来独特的有机动感和流体感。

– 文中图像均由优化速度和本地执行的 FLUX.1 [schnell] 生成，鼓励大家在 Replicate 上尝试。

思维导图：

文章地址：https://replicate.com/blog/flux-first-impressions

文章来源：replicate.com

作者：Replicate’s blog

发布时间：2024/8/2 18:21

语言：英文

总字数：752字

预计阅读时间：4分钟

评分：90分

标签：人工智能图像生成,流匹配技术,文本到图像模型,复制平台,创意人工智能应用

以下为原文内容

本内容来源于用户推荐转载，旨在分享知识与观点，如有侵权请联系删除联系邮箱 media@ilingban.com

FLUX.1 is a new AI model (available on Replicate) that makes images from text. Unlike most text-to-image models, which rely on diffusion, FLUX.1 uses an upgraded technique called “flow matching.”

While diffusion models create images by gradually removing noise from a random starting point, flow matching takes a more direct approach, learning the precise transformations needed to map noise onto a realistic image. This difference in methodology leads to a distinct aesthetic and unique advantages in terms of speed and control.

We were curious to see how this approach impacts the generated images, so we fed it a variety of prompts, many created by other AI models. Here are some observations:

Text: It gets it (mostly)

One of the challenges in text-to-image generation is accurately translating words into visual representations. FLUX.1 handles this surprisingly well, even in complex scenarios like memes.

Prompt:

Photograph of letterpress serif type on thick rough creamy paper saying ‘REPLICATE.COM’

Photograph of letterpress serif type on thick rough creamy paper saying 'REPLICATE.COM'

This image of letterpress type highlights how FLUX.1 can combine precise text rendering with its “flow” aesthetic. The letters are crisp, the ink looks wet. The paper is less convincing.

Prompt:

A meme of a famous actor making a funny face with the text ‘When you forget your lines’ in a quirky font

A meme of a man who looks like an actor making a funny face with the text 'When y forget your lines' [sic] in a quirky font

While it didn’t quite nail a specific actor’s likeness, this meme shows that FLUX.1 understands the concept. Just look at his face.

Prompt:

This is fine dog meme underwater. Text: ‘Climate change is fine’

A white4 cartoon dog sitting on a brown armchair in the middle of a room that is half-filled with water. Text at the top in bold white font says 'Climate change is fine.' and at the bottom 'is fine.'

The “This is fine” dog meme, now underwater, is a perfect example of FLUX.1’s ability to seamlessly blend text into an image. Well, near-seamlessly. Is fine.

Light and texture look good

FLUX.1 consistently generates high-quality images with a keen understanding of light, shadow, and texture.

Prompt:

A detailed image of a garden where the flowers are made of delicate glass, reflecting the sunlight beautifully

A close-up photograph-like image of clear glass flowers in sunlight, with other non-glass orange and purple flowers out of focus in the background

These glass flowers demonstrate how FLUX.1 grasps the interplay of light and material. The focus is not simply on the texture of glass, but on how light refracts and transmits through the petals, creating a luminous effect.

Prompt:

Owl feathers merging with autumn leaves in wind

A photograph-like image of an owl looking away to the right of the frame, sitting on a branch with orange autumn leaves. Some leaves are flying away in the wind.

FLUX.1 captures fine detail with precision. Notice how the owl feathers and autumn leaves are rendered with organic, natural textures.

Artistic styles: More than mimicry

FLUX.1 doesn’t just imitate artistic styles; it seems to grasp their underlying principles, allowing for creative reinterpretations.

Prompt:

A cubist interpretation of a famous superhero in action

A superhero with the Superman S on his chest and a Batman mask on, with his hands up in a boxer position. The background has blocky shapes in gray, yellow, and blue.

This cubist rendition of a superhero showcases FLUX.1’s ability to apply artistic principles to diverse subjects.

Prompt:

watercolor of famous wave painting

A stylized blue wave cresting, with a white background, inspired by Katsushika Hokusai's painting Great Wave off Kanagawa.

This “watercolor” version of Hokusai’s Great Wave off Kanagawa offers intriguing insights into FLUX.1. Not only does it suggest the iconic wave is part of the model’s training data, but it also highlights how the “flow” technique approximates the movement of pigments through water, paper, and ink.

Compositions: Making sense of the scene

FLUX.1 excels at composing complex scenes, placing objects and characters in a way that feels both believable and visually engaging.

Prompt:

A realistic image of an enchanted library where books float in mid-air and the shelves are made of ancient, twisted roots.

An illustration of an enchanted library where books float in mid-air and the shelves are made of ancient, twisted roots. There are ornate windows at the far end.

This enchanted library, with its trees growing through the bookshelves and books suspended in mid-air, showcases FLUX.1’s ability to create believable yet fantastical environments.

Prompt:

A realistic photo of a giant coffee cup being used as a hot tub by a group of friends.

A photograph-like image of a giant orange teacup on an outdoor patio with some green plants behind it and a foggy white background. There are three men and one woman sitting inside the teacup in hot steaming water.

FLUX.1 effortlessly captures the absurdity of a giant coffee cup hot tub. The scene is well-composed, with a clear sense of scale and playful interaction between the characters.

“Flow”: A new visual language

Perhaps the most striking aspect of FLUX.1 is its “flow” aesthetic, a consequence of the underlying flow matching technique. This gives the images a unique sense of organic movement and fluidity, almost as if the pixels themselves are in motion.

Prompt:

Dog with swirling, Van Gogh-style fur patterns

An illustration of the head of a small white dog with pointed ears and brown eyes, with a background of blue and yellow swirls.

The energy in this dog’s fur is almost tangible, blending directly into the whorls of paint suggestive of Starry Night.

The “flow” aesthetic is difficult to define but immediately recognizable. It evokes traditional artistic techniques like oil painting and airbrushing, imbuing the images with a dreamlike quality that sets FLUX.1 apart.

Ready to Explore the Flow?

All the images in this post were generated with FLUX.1 [schnell], a version optimized for speed and local execution.

FLUX.1 [schnell] is an exciting new tool for artists, developers, and anyone interested in exploring the potential of AI image generation. Try it out on Replicate and see what you can create.

An oversaturated image of a brain floating on a black background, with white text outlined in black in front of the brain that says 'Using AI for work / Using AI for art / AI for art / Using AI for memes / Becoming the meme'

分类

以下为原文内容

Text: It gets it (mostly)

Light and texture look good

Artistic styles: More than mimicry

Compositions: Making sense of the scene

“Flow”: A new visual language

Ready to Explore the Flow?