Zero-Shot Video Generation

The intersection of artificial intelligence and multimedia continues to evolve, breaking down barriers between different forms of media. In this project, the research titled “Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators” conducted by Picsart AI Research Lab represents a significant breakthrough. This study introduces a pioneering method that directly converts textual descriptions into videos, bridging the gap between natural language processing and computer vision. This development not only caters to the growing demand for dynamic visual content but also showcases the machine’s capability to interpret and transform human language into a visual format. By addressing the challenge of text-to-video synthesis, this research sets a new standard for interdisciplinary studies in artificial intelligence. ...

November 22, 2023 · 16 min · Amey Thakur

Generative Adversarial Networks

Special thanks to Mega Satish for her meaningful contributions, support, and wisdom that helped shape this work. Deep learning’s breakthrough in the field of artificial intelligence has resulted in the creation of a slew of deep learning models. One of these is the Generative Adversarial Network, which has only recently emerged. The goal of GAN is to use unsupervised learning to analyse the distribution of data and create more accurate results.

August 27, 2021 · 34 min · Amey Thakur

Adversarial Open Domain Adaption Framework (AODA): Sketch-to-Photo Synthesis

Special thanks to Mega Satish for her meaningful contributions, support, and wisdom that helped shape this work. This paper aims to demonstrate the efficiency of the Adversarial Open Domain Adaption framework for sketch-to-photo synthesis. The unsupervised open domain adaption for generating realistic photos from a hand-drawn sketch is challenging as there is no such sketch of that class for training data.

July 28, 2021 · 20 min · Amey Thakur