NF4 version with 24GB of VRAM can run it; individual players finally don't have to just watch the excitement.

View Original
CoinNetwork
Ideogram releases its first open-source weighted image model Ideogram 4.0: 9.3B parameters, supports structured JSON prompts
Ideogram 4.0 is the first open-source weighted foundational image model, with 9.3 billion parameters. It uses a single-stream diffusion transformer architecture, with the text encoder being qwen3-vl-8b-instruct. It provides NF4 (CUDA support for 24GB of VRAM) and FP8 versions. The inference code is licensed under Apache 2.0; it is free for non-commercial/academic research, while commercial deployment requires a commercial license. The core innovation is a structured JSON prompt interface, enabling users to precisely control image layout, style, and composition through JSON strings. Benchmarks: 7bench 0.69 MIoU, X-Omni 0.97, and it ranks first among open-source models in preference blind testing.
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pinned