Sieve Blog - Discover everything about video & audio AI

Blog

Our latest product updates and thoughts on state-of-the-art AI capabilities.

September 10, 2024

How Scenery approaches human-centric AI video understanding with Sieve

by Mokshith Voodarla • 2 min read

We discuss how Scenery uses Sieve to run human-centric video understanding workloads that power features like AI Shorts.

September 9, 2024

VEED partners with Sieve to launch VEED Clips

by Mokshith Voodarla • 3 min read

We discuss a partnership between VEED and Sieve to launch VEED Clips, a new AI-powered video clipping tool.

September 2, 2024

Exploring ways to text prompt SAM 2

by Lachlan Gray • 6 min read

SAM 2 can't natively take in text prompts. We discuss various ways to build pipelines around SAM 2 to accomplish text-prompted segmentation.

August 27, 2024

The fastest way to run Meta's SAM 2 (Segment Anything Model 2)

by Jacob Marshall • 6 min read

Learn about Meta's SAM 2 (Segment Anything Model 2) and how Sieve's optimized implementation runs 2x faster. Explore use cases, benchmarks, and how to use SAM 2.

August 20, 2024

MuseTalk: Real-Time High Quality Lip Synchronization with Latent Space Inpainting

by Gaurang Bharti • 4 min read

In this blog, we dive into MuseTalk, a state-of-the-art zero-shot lipsyncing model. We cover how it works, its pros and cons, and how to run it on Sieve.

July 30, 2024

Dubbing an entire Khan Academy course in 10 minutes

by Mokshith Voodarla • 5 min read

We walk through using the Sieve API to download and dub an entire Khan Academy course in under 10 minutes.

June 20, 2024

Introducing Sieve Dubbing 1.0: AI Dubbing for Developers

by Mokshith Voodarla • 4 min read

We discuss the launch of Sieve’s Dubbing API, the first AI dubbing solution purpose-built for developers.

May 3, 2024

Introducing Autocrop 1.0: Format videos into different aspect ratios with AI editing

by Mokshith Voodarla • 3 min read

We discuss the launch of Autocrop 1.0, a new API that allows you to format videos into different aspect ratios with AI editing.

April 16, 2024

Zight and Sieve: Using AI to build better video communication

by Mokshith Voodarla • 4 min read

We discuss the importance of AI in video communication and why Zight chose Sieve to power their new AI features.

April 3, 2024

Finding highlights in long-form video content automatically

by Gaurang Bharti • 4 min read

We do a deep dive into building an intricate algorithm on top of LLMs to accurately identify and extract highlights from long-form video content.

March 26, 2024

How developers are changing video creation once again with AI

by Mokshith Voodarla • 5 min read

We discuss the first time computers drastically changed video creation and how it’s changing once again because of new AI models.

March 15, 2024

Introducing Describe: Incredibly descriptive audiovisual summaries for videos

by Gaurang Bharti • 5 min read

We discuss the launch of Describe along with the challenges and approaches to generating audiovisual descriptions of videos.

March 13, 2024

Adding Sound Effects to Stock Videos with AI

by Mokshith Voodarla • 2 min read

In this post, we build an app that adds sound effects to stock videos using vision language models and audio generation models.

March 6, 2024

Introducing GPU sharing on Sieve

by Gaurav Rao • 4 min read

In this post, we discuss support for GPU sharing on Sieve and how it enables faster, more cost-effective AI models.

February 28, 2024

Fast, efficient active speaker detection on videos

by Mokshith Voodarla • 5 min read

In this post, we discuss active speaker detection as a deep learning task and how we built a solution that performs ~90% faster than other solutions.

December 11, 2023

Announcing the most cost-effective audio transcription API

by Mokshith Voodarla • 5 min read

In this post, we discuss the commoditization of audio transcription and a new Sieve offering around it that is 5x cheaper than other providers while still maintaining speed and accuracy.

November 22, 2023

Improving on open-source for fast, high-quality AI lipsyncing

by Abhinav Ayalur • 5 min read

We discuss modifying current lipsyncing solutions such as OpenRetalker’s Video Retalking to get a performant, production-ready lipsyncing solution.

October 19, 2023

State of the art audio enhancement in 5 minutes

by Abhi Upadhyay • 4 min read

Learn how we developed a quality AI audio enhancement app with open-source, rivaling the best APIs in the market. Try it for yourself!

March 7, 2023

Automatically generating video chapter titles with AI

by Abhi Upadhyay • 4 min read

In this blog post, we go through the process of generating video chapter titles with OpenAI's Whisper + GPT-3 models and an open-source text segmentation technique!

February 28, 2023

Building realistic video AI avatars in an hour from scratch

by Abhi Upadhyay • 4 min read

Learn about our process building a Twitter AI bot that can generate avatar videos and responses in minutes using Sieve.

November 14, 2022

Sieve's Video AI API Beta and ~$4M Raise

by Mokshith Voodarla • 2 min read

The explosion of rich data, the Sieve public beta, our ~$4M seed round, and how we enable developers to build amazing experiences with video + AI.