Stable Diffusion

4968 readers

1 users here now

Discuss matters related to our favourite AI Art generation technology

Also see

Stable Diffusion Art (See its sidebar for more GenAI Art comms)
!aihorde@lemmy.dbzer0.com

Other communities

founded 2 years ago

MODERATORS

db0@lemmy.dbzer0.com

Even_Adder@lemmy.dbzer0.com

OmniSVG: A Unified Scalable Vector Graphics Generation Model (omnisvg.github.io)

submitted 1 week ago* (last edited 1 week ago) by Even_Adder@lemmy.dbzer0.com to c/stable_diffusion@lemmy.dbzer0.com

10 comments fedilink hide all child comments

Abstract

Scalable Vector Graphics (SVG) is an important image format widely adopted in graphic design because of their resolution independence and editability. The study of generating high-quality SVG has continuously drawn attention from both designers and researchers in the AIGC community. However, existing methods either produces unstructured outputs with huge computational cost or is limited to generating monochrome icons of over-simplified structures. To produce high-quality and complex SVG, we propose OmniSVG, a unified framework that leverages pre-trained Vision-Language Models (VLMs) for end-to-end multimodal SVG generation. By parameterizing SVG commands and coordinates into discrete tokens, OmniSVG decouples structural logic from low-level geometry for efficient training while maintaining the expressiveness of complex SVG structure. To further advance the development of SVG synthesis, we introduce MMSVG-2M, a multimodal dataset with two million richly annotated SVG assets, along with a standardized evaluation protocol for conditional SVG generation tasks. Extensive experiments show that OmniSVG outperforms existing methods and demonstrates its potential for integration into professional SVG design workflows.

Paper: https://arxiv.org/abs/2504.06263

Code: https://github.com/OmniSVG/OmniSVG/

Weights: https://huggingface.co/OmniSVG/OmniSVG

Project Page: https://omnisvg.github.io/

Demo: https://huggingface.co/spaces/OmniSVG/OmniSVG-3B

you are viewing a single comment's thread
view the rest of the comments

[–] rizzothesmall@sh.itjust.works 5 points 1 week ago (5 children)

I am very into this if it can take a non-vector graphic as input and work to that. OpenAI's attempts at that have been complete dickfarts

[–] Even_Adder@lemmy.dbzer0.com 2 points 1 week ago (1 children)

It can do IMG to SVG. Check out the right side of this image:

[–] GenderNeutralBro@lemmy.sdf.org 5 points 1 week ago (1 children)

Hard to judge quality when what we're seeing is practically a pixel-perfect recreation. The tricky part of automated vectorization is detecting and plotting curves in such a way that it scales correctly. Bad implementations will use too many elements, or include straight lines that should be parts of curves, etc. Those errors would not be visible in those low-res rasterizations.

[–] Even_Adder@lemmy.dbzer0.com 4 points 1 week ago (1 children)

The project page didn't have a link to it, but there is a demo on HF.

[–] GenderNeutralBro@lemmy.sdf.org 3 points 1 week ago

Just gave it a try. I couldn't get coherent results from img-to-svg with a few different tests of low-res pixel art and high-res cartoons. txt-to-svg also gave me incoherent blobs even with simple prompts. Something must be wrong there. Is it working for anyone else?

I might just try installing it locally when I get home.

load more comments (3 replies)