Synthesizing visual content that meets users' needs often requires flexible and precise controllability of the pose, shape, expression, and layout of the generated objects. Existing approaches gain controllability of generative adversarial networks (GANs) via manually annotated training data or a prior 3D model, which often lack flexibility, precision, and generality. In this work, we study a powerful yet much less explored way of controlling GANs, that is, to

Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold

Introduction

What is DragGAN?

DragGAN is a powerful tool for interactive point-based manipulation on the generative image manifold. It allows users to "drag" any points of the image to precisely reach target points in a user-interactive manner.

How does DragGAN work?

DragGAN consists of two main components: 1) a feature-based motion supervision that drives the handle point to move towards the target position, and 2) a new point tracking approach that leverages the discriminative GAN features to keep localizing the position of the handle points.

Features of DragGAN

  • Precise control over where pixels go, thus manipulating the pose, shape, expression, and layout of diverse categories such as animals, cars, humans, landscapes, etc.
  • Ability to deform an image with realistic outputs even for challenging scenarios such as hallucinating occluded content and deforming shapes that consistently follow the object's rigidity.
  • Can be used for image manipulation and point tracking tasks.

Price

The paper and code are available for free, and the images, text, and video files on the site are made freely available for non-commercial use under the Creative Commons CC BY-NC 4.0 license.

Drag - Alternative

Roboflow: Computer vision tools for developers and enterprises

Everything you need to build and deploy computer vision models, from automated annotation tools to robust, device-agnostic deployment solutions.

1.3 M
Unleash Your Creative Potential with Roughly App

Unleash your creative potential with Roughly. Explore a new level of visual expression and watch your ideas materialize with this AI-powered digital tool. Ideal for artists, designers, and creative professionals, Roughly's AI Art Assistant helps bring your sketches, doodles, and illustrations to life. Whether you're creating for Instagram or using it as a self-hosted notes app, Roughly offers a seamless experience for drawing, shaping, and exporting your work into PDFs or images for downloads. With insights into mobile app usage statistics and a global user base, Roughly is revolutionizing the way iPhone apps are used for creative endeavors.

2.4 K
RunDiffusion - Automatic1111 in the Cloud

Fully managed Automatic1111, Fooocus, and ComfyUI in the cloud on blazing fast GPUs. No code. Get a private workspace in 90 seconds. Start creating AI Generated art now!

157.0 K
Runway AI

Runway AI: Runway is a leading AI research company that is influencing the future of art, entertainment, and human creativity. Explore cutting-edge AI tools and solutions at Runway AI.

6.3 M
More Tags about: Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold