Synthesizing visual content that meets users' needs often requires flexible and precise controllability of the pose, shape, expression, and layout of the generated objects. Existing approaches gain controllability of generative adversarial networks (GANs) via manually annotated training data or a prior 3D model, which often lack flexibility, precision, and generality. In this work, we study a powerful yet much less explored way of controlling GANs, that is, to

Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold

Introduction

What is DragGAN?

DragGAN is a powerful tool for interactive point-based manipulation on the generative image manifold. It allows users to "drag" any points of the image to precisely reach target points in a user-interactive manner.

How does DragGAN work?

DragGAN consists of two main components: 1) a feature-based motion supervision that drives the handle point to move towards the target position, and 2) a new point tracking approach that leverages the discriminative GAN features to keep localizing the position of the handle points.

Features of DragGAN

  • Precise control over where pixels go, thus manipulating the pose, shape, expression, and layout of diverse categories such as animals, cars, humans, landscapes, etc.
  • Ability to deform an image with realistic outputs even for challenging scenarios such as hallucinating occluded content and deforming shapes that consistently follow the object's rigidity.
  • Can be used for image manipulation and point tracking tasks.

Price

The paper and code are available for free, and the images, text, and video files on the site are made freely available for non-commercial use under the Creative Commons CC BY-NC 4.0 license.

Drag - Alternative

AI Consistent Character Creator

AI Consistent Character Creator: Effortlessly craft uniform characters with artificial intelligence featuring an array of poses, facial expressions, and headshots. Ideal for character designers and creatives.

282
Photo, illustration and video editor AI tool: cre8tiveAI

An AI-based SaaS that solves a variety of photo and illustration editing tasks in under 10 seconds, such as automatic painting and increasing the resolution of images and videos, as well as clipping, layering, and color correction.

116.2 K
Cutout.Pro - AI Photo Editing | Visual Content Generation Platform, best for image and video design

All-in-one visual design platform containing AI photo and video editing tools. Automatic process for background remove, image restoration, graphic design, and content generation. With Cutout.Pro, it is one click away to optimize your content and transform your design ideas into special asset effectively.

14.2 M
Try Dalle 3 Free Online-Dall-E 3 AI Image

Announced by OpenAI, DALL-E 3 represents the latest iteration of its groundbreaking AI image generator, demonstrating remarkable improvements in accurately translating text prompts into highly realistic and detailed visuals. Releasing first to ChatGPT Plus and Enterprise users in October 2023, DALL-E 3 tight integration with ChatGPT streamlines creating prompts and maintaining image context. Incorporating safety measures against harmful content and giving artists control over art usage, DALL-E 3 promises to revolutionize turning ideas into precise images.

14.2 K
More Tags about: Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold