Home > AI > Body

Stability AI Releases SD3: The Most Powerful, Open-Source Image Generator is Available in HuggingFace

clock
2024-06-12 14:10:26

Stability AI, a leading company in the field of artificial intelligence, has just released the latest generation of its open-source image generator, Stable Diffusion 3 (SD3). This model is the most powerful open-source, uncensored, customizable text-to-image generator to date.

SD3l is released under a free non-commercial license and is available via Hugging Face. It is also available on Stability AI's API and applications, including Stable Assistant and Stable Artisan. Commercial users are encouraged to contact Stability AI for licensing details.

"Stable Diffusion 3 Medium is Stability AI’s most advanced text-to-image open model yet, comprising two billion parameters,” Stability AI said in an official statement, “the smaller size of this model makes it perfect for running on consumer PCs and laptops as well as enterprise-tier GPUs. It is suitably sized to become the next standard in text-to-image models.”

Decrypt got access to the model, but the ComfyUI workflow shared by Stability required some nodes that were not still available. The usual workflows compatible with SD1.5 and SDXL don't work with SD3. There is a post in Reddit explaining how to run it using StableSwarmUI.

The model's key features include photorealism, prompt adherence, typography, resource-efficiency, and fine-tuning capabilities. It overcomes common artifacts in hands and faces, delivering high-quality images without the need for complex workflows. The model also comprehends complex prompts involving spatial relationships, compositional elements, actions, and styles. It's remarkably accomplished at generating text without artifacting and spelling errors, thanks to Stability AI's Diffusion Transformer architecture. The model is capable of absorbing nuanced details from small datasets, making it perfect for customization.

SD3 generation samples. Image: Stability AI
SD3 generation samples. Image: Stability AI

The model was first unveiled in February 2024, and was made available via API in April 2024.

Stability AI has collaborated with Nvidia to enhance the performance of all Stable Diffusion models. The TensorRT-optimized versions of the model will provide best-in-class performance, with past optimisations yielding up to a 50% increase in performance.

Stability AI conducted internal and external testing, as well as the implementation of numerous safeguards to prevent the misuse of SD3 Medium by bad actors.

According to a spokesperson from Stability AI, the minimum hardware requirements to run SD3 range from 5GB to 16GB of GPU VRAM, depending on the specific model and its size. SD3 uses a different encoding technology in this model, so it can generate better images and have a better understanding of text prompts. It will also be capable of generating text but will require large amounts of computational power.

“For SD3 Medium (2 billion parameters) we recommend 16GB of GPU VRAM for higher speed, but folks with lower VRAM can still run it with a minimum of 5GB of GPU VRAM," Stability AI told Decrypt. The firm added that, "SD3 has a modular structure, allowing it to work with all 3 Text Encoders, with smaller versions of the 3 Text Encoders or with just a subset of them. Much of the VRAM is used for the text encoders. There is also the possibility of running the biggest Text Encoder, which is T5-XXL, in CPU. This means that the minimum requirements to run SD3 2B are between SD1.5 and SDXL requirements. For fine tuning that also depends on how you handle Text Encoders. Assuming you preprocess your dataset and then you unload the encoders, the requirements are around the same of SDXL using the same method.”

Stability added that “there is no need for a refiner." This feature simplifies the generation process and enhances the overall performance of the model. SDXL introduced this feature by releasing two models that were supposed to run one after another. The base model generated the overall image and the refiner made sure to add the little details. However, the Stable Diffusion community quickly ditched the refiner and fine tuned the base model, making it capable of generating detailed images on its own.

For some examples of what custom SDXL models are capable of generating right now without detailers, we have a guide with photorealistic generations.

Despite controversy around the company’s finances and its future, Stability made sure to let us know this won’t likely be its last rodeo. "Stability is actively iterating on improving our image models as well as focusing on our multimodal efforts across video, audio & language," the spokesperson said.

Beyond Stable Diffusion, Stability AI has released open source models for video, text and audio. It also has other image generation technologies like Stable Cascade and Deepfloyd IF. Stability AI plans to continuously improve SD3 Medium based on user feedback.

“Our goal is to set a new standard for creativity in AI-generated art and make Stable Diffusion 3 Medium a vital tool for professionals and hobbyists alike.” Stability AI said.

Web3 Desktop Trading Tool
Stay ahead of the game in the cryptocurrency space.

7x24 Newsflash

20:29 2025-04-02
BTC falls below $86,000
The market shows that BTC has fallen below $86,000 and is now reported at $85,952.23. The 24-hour increase has narrowed to 1.11%. The market is volatile, so please do a good job in risk control.
20:29 2025-04-02
Trump: The United States will impose a comprehensive 10% tariff on all imports, and a 25% tariff on foreign cars will take effect at midnight
President Donald Trump said the United States would impose sweeping tariffs of 10 percent on all imports. Higher rates would be imposed on other countries deemed to be underperforming in trade. In addition, Trump said the 25% tariffs on foreign cars would take effect at midnight.
20:20 2025-04-02
Trump will sign an executive order on reciprocal tariffs
Trump announced the US reciprocal tariff plan and said he would sign an executive order on reciprocal tariffs.
20:17 2025-04-02
BTC breaks through $88,000
The market shows that BTC has broken through $88,000 and is now reported at $88,037, with a 24-hour increase of 3.58%. The market is volatile, so please do a good job in risk control.
20:14 2025-04-02
ETH falls below $1,900
The market shows that ETH has fallen below $1,900 and is now quoted at $1,899.69, a 24-hour decline of 0.36%. The market is volatile, so please do a good job of risk control.
20:14 2025-04-02
U.S. stocks opened lower and rose higher, and the three major indexes collectively closed higher
US stocks opened lower and moved higher, and the three major indexes collectively closed higher. The Nasdaq rose 0.87%, the S & P 500 rose 0.67%, and the Dow rose 0.56%. New shares Newsmax fell more than 77%, wiping more than $23 billion off its market value. The stock rose more than 22 times in the previous two trading days. Most large technology stocks rose, Tesla rose more than 5%, Amazon rose 2%, Nvidia, Apple, and Netflix rose slightly; Microsoft, Google, Meta, and Intel fell slightly.
19:59 2025-04-02
Coinbase will launch Definitive (EDGE)
Coinbase will add support for Definitive (EDGE) on the Base network. Trading will begin later today if liquidity conditions are met. The EDGE-USD trading pair will be launched in phases.
19:59 2025-04-02
Musk: News related to resignation from DOGE is fake news
Musk tweeted on social media that the news that he was about to resign from the U.S. Department of Government Effectiveness (DOGE) was fake news.
19:22 2025-04-02
Zuckerberg lobbied Trump to avoid Meta facing an antitrust trial
Zuckerberg is lobbying US President Donald Trump and White House officials to reach a settlement to avoid Meta Platforms facing an antitrust trial later this month, according to people familiar with the matter. Meta and its representatives have met with the president and his top advisers ahead of a US FTC (FTC) trial scheduled for April 14. The trial could force Meta to spin off WhatsApp, the instant messaging platform it acquired, and Instagram, the photo-sharing app. Potential...
18:49 2025-04-02
Neuralink: Now Open Patient Registry to the World
Musk's brain-computer interface company, Neuralink, is now opening patient registries to the world.
18:43 2025-04-02
Allianz Chief Economist: The Federal Reserve is expected to cut interest rates once in 2025
Allianz chief adviser El Erian expects the Fed to cut interest rates only once this year, in stark contrast to what the market and the Fed expect. The market has fully priced in expectations of at least two rate cuts in 2025, and there is likely to be a third cut. Overall, they expect a rate cut of around 70 basis points this year, with the first 25 basis points cut in July. Meanwhile, the Fed's latest dot plot calls for only two rate cuts. Erian said the Fed is "essentially dovish...
18:31 2025-04-02
Onlyfans founder and crypto foundation have submitted late-stage bids to acquire TikTok
Zoop, the new company of Onlyfans founder Tim Stokely, has partnered with a crypto foundation to submit a late-stage bid for TikTok. Zoop and the Hbar Foundation, which manages Hedera's network funding, are understood to have expressed their interest in bidding to the White House this week. It was previously reported that e-commerce giant Amazon will bid for TikTok in the United States, and the company made a last-minute bid for TikTok.