Posted in

FLUX.1:黑森林实验室推出的新一代图像生成模型_AI阅读总结 — 包阅AI

包阅导读总结

1. 关键词:FLUX.1、Black Forest Labs、图像生成模型、Replicate、性能特点

2. 总结:FLUX.1 是 Black Forest Labs 开发的新开源图像生成模型,可在 Replicate 上使用,具有出色性能,有多种变体和定价,后续将有新功能。

3. 主要内容:

– FLUX.1 是 Black Forest Labs 继 Stable Diffusion 后的新模型

– 可在 Replicate 上运行,示例代码为 JavaScript

– 性能特点包括出色的提示遵循、视觉质量、图像细节和输出多样性

– 能处理复杂文本,准确呈现

– 擅长复杂构图指令

– 生成的手相对较好

– 有三个变体

– FLUX.1 [pro]:性能顶级

– FLUX.1 [dev]:用于非商业,高效

– FLUX.1 [schnell]:速度快,开源

– 定价按图像计算

– 已推出,后续将有关于 FLUX.1 的微调等新功能

思维导图:

文章地址:https://replicate.com/blog/flux-state-of-the-art-image-generation

文章来源:replicate.com

作者:Replicate’s blog

发布时间:2024/8/1 13:53

语言:英文

总字数:435字

预计阅读时间:2分钟

评分:91分

标签:图像生成,AI 模型,黑森林实验室,复制平台,开源


以下为原文内容

本内容来源于用户推荐转载,旨在分享知识与观点,如有侵权请联系删除 联系邮箱 media@ilingban.com

FLUX.1 is a new open-source image generation model developed by Black Forest Labs, the creators of Stable Diffusion. It’s available on Replicate today, and you can run it in the cloud with one line of code.

Here’s an example of how to run FLUX.1 on Replicate using JavaScript:

import Replicate from "replicate";const replicate = new Replicate();const model = "black-forest-labs/flux-dev";const prompt = "Purple striped narwhal devouring a fluffy high-resolution everything bagel";const output = await replicate.run(model, {input: { prompt }});console.log(output);

You can try out FLUX.1 right in your browser, or run it programmatically in your language of choice.

What makes FLUX.1 special?

FLUX.1 models have state-of-the-art performance in prompt following, visual quality, image detail, and output diversity. Here are some particular areas where we’ve been impressed:

Text! Unlike older models that often messed up similar-looking letters, Flux can handle tricky words with repeated letters. This makes it great for designs where text needs to be accurate. Check out this Black Forest Flux Schnell gateau:

Cake with text

Complex composition. Flux is amazing at following complex instructions about where things should go in an image. For example, given the prompt “Three magical wizards standing on a yellow table, each holding a sign. On the left, a wizard in black robes holds a sign that says ‘AI’; in the middle, a witch in red robes holds a sign that says ‘is’; and on the right, a wizard in blue robes holds a sign that says ‘cool’“, Flux creates this scene perfectly:

Complex scene

Hands that (mostly) look right. Hands are tough for AIs, but Flux does a pretty good job. You’ll usually get the right number of fingers in the right places. It’s not perfect, but it’s a big step up—it’s consistently better than any other open text-to-image model we’ve tried:

Cool hand

Model variants

FLUX.1 comes in three variants:

  • FLUX.1 [pro] The best of FLUX.1, state-of-the-art image generation model with top-of-the-line performance.
  • FLUX.1 [dev] An open-weight, guidance-distilled model for non-commercial applications. Directly distilled from FLUX.1 [pro], FLUX.1 [dev] has similar quality and prompt adherence capabilities, while being more efficient than a standard model of the same size.
  • FLUX.1 [schnell]: The fastest model, tailored for local development and personal use. Openly available under an Apache 2.0 license.

Pricing

FLUX.1 is priced per image:

  • FLUX.1 [pro] is $0.055 per image.
  • FLUX.1 [dev] is $0.030 per image.
  • FLUX.1 [schnell] is 0.003 per image.

Next steps

FLUX.1 is a fantastic model, so do try it out. And keep watching this space—now that the model is out in the wild, we are starting to work on features like fine-tuning for FLUX.1, so stay tuned.