Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold

Synthesizing visual content that meets users' needs often requires flexible and precise controllability of the pose, shape, expression, and layout of the generated objects. Existing approaches gain controllability of generative adversarial networks (GANs) via manually annotated training data or a prior 3D model, which often lack flexibility, precision, and generality. In this work, we study a powerful yet much less explored way of controlling GANs, that is, to

Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold

Introduction

What is DragGAN?

DragGAN is a powerful tool for interactive point-based manipulation on the generative image manifold. It allows users to "drag" any points of the image to precisely reach target points in a user-interactive manner.

How does DragGAN work?

DragGAN consists of two main components: 1) a feature-based motion supervision that drives the handle point to move towards the target position, and 2) a new point tracking approach that leverages the discriminative GAN features to keep localizing the position of the handle points.

Features of DragGAN

  • Precise control over where pixels go, thus manipulating the pose, shape, expression, and layout of diverse categories such as animals, cars, humans, landscapes, etc.
  • Ability to deform an image with realistic outputs even for challenging scenarios such as hallucinating occluded content and deforming shapes that consistently follow the object's rigidity.
  • Can be used for image manipulation and point tracking tasks.

Price

The paper and code are available for free, and the images, text, and video files on the site are made freely available for non-commercial use under the Creative Commons CC BY-NC 4.0 license.

Drag - Alternative

FlowGPT - a visual interface for ChatGPT

FlowGPT is a visual interface for ChatGPT with Multi-threaded visual conversation flow

16.2 K
ExtendImageAI - Expand your images with generative AI - Try it for free

Discover ExtendImage, the ultimate AI Image Extender. Effortlessly extend images with our AI, perfecting visuals with precision. Use ExtendImage to expand images, leveraging AI to enhance details. Ideal for professionals seeking to scale visuals across various platforms.

108.1 K
Godinabox.co:GPT-3.5 AI Chatbot on Whatsapp | God In A Box

Godinabox.co:Experience the power of ChatGPT/GPT-3 on Whatsapp with our innovative and user-friendly bot, always updated with the latest model, offering unparalleled conversational AI capabilities at an affordable price, as the pioneering paid ChatGPT on Whatsapp service.

4.2 K
This is Korewa.ai: AI-Powered Conversational Design Platform for Businesses

Korewa AI is a revolutionary chat platform that utilizes artificial intelligence to bring anime characters to life, allowing users to create, publish, and interact with scarily realistic personalities, complete with memories and emotions, for a truly immersive experience.

41.7 K
More Categories