Play Ironman BGM
Play Summer BGM
Logo

Infinite-Length Video Generation with Error Recycling

PS: The background video is generated by our Stable Video Infinity

Wuyang Li, Wentao Pan, Po-Chien Luan, Yang Gao, Alexandre Alahi

Technique Report Code B Bilibili YouTube

⚠️ Note

All videos displayed on this website have been compressed for web delivery, which may result in reduced visual quality compared to the original generated content. The compression is necessary to ensure optimal loading times and bandwidth efficiency. All the videos have been sped-up from 16 FPS to 24 FPS for better visual experience.

🎵 Click the BGM buttons on the right side to enjoy background music!

Stable Video Infinity

Stable Video Infinity (SVI) is able to generate ANY-length videos with high temporal consistency, plausible scene transitions, and controllable streaming storylines in ANY domains. All the following videos are generated by SVI in an end-to-end manner using a prompt stream, e.g., the 8 min Tom and Jerry.

StreamingT2V [1]

FramePack [2]

Stable Video Infinity (SVI-Film, Ours)

Show Prompts
Tony Stark demonstrates his latest weapons technology at a military demonstration in Afghanistan. Iron Man tests his new repulsor technology. Tony Stark works frantically in his cave workshop building the first Iron Man suit. Iron Man breaks free from the terrorist cave using his makeshift armor. Tony Stark perfects his arc reactor technology in his Malibu mansion workshop. Iron Man tests his red and gold Mark II armor in his private garage. Tony Stark announces to the world that he is Iron Man at a press conference. Iron Man confronts Iron Monger in an epic battle above Stark Industries. Tony Stark discovers his father's hidden research on the Tesseract. Iron Man races through Monaco fighting Whiplash on the race track. Tony Stark creates a new element using his particle accelerator. Iron Man battles multiple Hammer drones in the Stark Expo. Tony Stark joins the Avengers Initiative after meeting Nick Fury. Iron Man flies alongside Thor and Captain America in the Battle of New York. Tony Stark guides a nuclear missile through the portal to destroy the Chitauri mothership.
Show Prompts
Iron Man stands in a futuristic city street, palm extended and glowing with energy, facing a horde of invading robots, his suit humming as he charges the repulsor beam for an imminent attack. With a surge of power, Iron Man fires a bright blue repulsor blast from his palm, striking the lead robot and causing it to explode in a shower of sparks and debris. The explosion scatters nearby robots, Iron Man leaps into the air with thrusters igniting, scanning the battlefield for the next threat as alarms blare in the background. Dodging incoming laser fire from two flanking drones, Iron Man twists mid-air, his helmet visor locking onto targets while his suit's AI provides tactical overlays. Iron Man counters with dual repulsor shots, disintegrating the drones in mid-flight, then lands gracefully on a rooftop to assess the growing robot invasion below. A larger mech emerges from the shadows, charging toward Iron Man; he activates his chest arc reactor, building energy for a unibeam attack while evading ground tremors. Iron Man unleashes the unibeam, a powerful white energy ray slicing through the mech's armor, causing it to stagger and emit smoke as civilians flee in the distance. As the mech retaliates with missile launches, Iron Man deploys countermeasures, flares exploding in the sky to divert the projectiles while he circles around for another strike. Diving low, Iron Man grabs the mech's arm with his enhanced strength, twisting it off in a metallic crunch, sparks flying as the enemy unit malfunctions. The defeated mech collapses, but more robots swarm from alleyways; Iron Man calls Jarvis for backup analysis, his suit displaying holographic enemy schematics. Iron Man takes to the skies again, weaving through skyscrapers as pursuing drones fire energy bolts, his thrusters leaving trails of blue flame. Spotting a weak point in the drone formation, Iron Man unleashes a barrage of mini-missiles from his shoulders, exploding multiple targets in a chain reaction. Debris rains down as Iron Man lands in a park, where a group of civilians cheers; he reassures them with a nod before detecting a new signal from the robot hive. Activating stealth mode, Iron Man cloaks his suit and infiltrates an abandoned warehouse, sneaking past patrolling sentries toward the control center. Inside, he hacks a terminal with his suit's interface, downloading data on the invasion's origin while overriding security protocols to disable nearby guards. Alarms trigger as he's discovered; Iron Man blasts through a wall, emerging into the street amid a firefight with reinforced robot units. Engaging in close-quarters combat, Iron Man punches through one robot's chassis, then spins to elbow another, his movements fluid and superhuman. A sniper drone takes aim from afar; Iron Man deploys his energy shield, deflecting the shot before retaliating with a precise repulsor ray. As the battle intensifies, Iron Man notices a mysterious energy signature; he flies toward it, evading anti-air fire from ground turrets. Arriving at a high-tech lab, Iron Man confronts a holographic villain projection, who taunts him about the invasion's master plan. Ignoring the taunts, Iron Man smashes through the lab's defenses, destroying computer banks that control part of the robot army. Robots breach the lab; Iron Man uses environmental hazards, toppling shelves and machinery onto them in a chaotic skirmish.
Show Prompts
Goku, filled with rage, charges forward through the air, his muscles bulging as he flies directly at the villains, the wind whipping his orange gi, energy crackling around him in a fiery aura. As Goku closes in, the villains smirk confidently, one firing a massive energy blast towards him, but Goku dodges it mid-flight, his eyes locked on the target with unyielding determination. Goku counters by gathering ki in his palms, forming a small energy sphere that grows brighter, the villains preparing their defenses as the battlefield shakes from the impending clash. Launching the energy blast, Goku sends it hurtling towards the lead villain, who blocks it with a force field, sparks flying as the two powers collide in a brilliant explosion of light. The explosion clears, revealing Goku unscathed, powering up further with a guttural yell, his hair standing on end as golden sparks begin to flicker around his body. The villains charge at Goku in unison, one swinging a massive punch, but Goku flips backward gracefully, landing on a rocky outcrop and smirking defiantly. Goku retaliates with a series of rapid punches, his fists blurring as he strikes the first villain, sending him crashing into a nearby mountain with a thunderous impact. Turning to the second villain, Goku unleashes a powerful kick, the force creating shockwaves that ripple through the air, knocking the enemy off balance. The third villain attempts a sneak attack from behind, firing dark energy beams, but Goku senses it and spins around, deflecting them with his bare hands. Building momentum, Goku channels more energy, his aura expanding into a blazing golden hue as he begins his transformation into Super Saiyan. With a mighty roar, Goku's hair turns golden and stands upright, his eyes turning teal, the ground trembling as his power level skyrockets dramatically. Now in Super Saiyan form, Goku dashes forward at supersonic speed, grabbing one villain by the collar and hurling him into the sky with effortless strength. The hurled villain recovers mid-air and dives back down, clashing fists with Goku in a high-speed aerial battle, punches echoing like thunder. Goku gains the upper hand, delivering an uppercut that sends the villain spiraling downward, crashing into the earth and creating a massive crater. The remaining villains regroup, combining their powers to form a gigantic energy wave aimed at Goku, the beam glowing with ominous purple light. Goku stands firm, cupping his hands to his side and chanting "Ka... me... ha... me...", blue energy gathering between his palms. As the villains' beam launches, Goku releases his Kamehameha Wave, the two massive energies colliding in a spectacular beam struggle that lights up the sky. Pushing harder, Goku's beam overpowers the villains', surging forward and engulfing them in a blinding explosion that shakes the entire landscape.

Creative Video Generation

Use a text prompt stream to generate short films with specific storylines, where each 5-second clip is controlled by one different text prompt

This setting targets the needs of vloggers (e.g., TikTok) for shot video creation, emphasizing moderate scene transitions.

StreamingT2V [1]

FramePack [2]

Stable Video Infinity (SVI-Film, Ours)

The image captures an aerial view of a bustling cityscape with numerous skyscrapers and high-rise buildings, set against a backdrop of water and distant greenery under a partly cloudy sky.
In the next five years, the city will experience a significant increase in renewable energy sources, reducing its carbon footprint dramatically.
A major transportation hub will be built, connecting the city to a new international airport that will facilitate increased international travel.
The urban landscape will undergo a massive transformation as part of a comprehensive green initiative, incorporating more parks and green spaces within the city limits.
High-rise buildings will start to incorporate advanced technologies for energy efficiency and sustainability, such as smart windows and solar panels.
A new cultural district will emerge, featuring world-class museums, theaters, and galleries, attracting tourists from all over the globe.
The city will implement a smart grid system, ensuring reliable power supply and reducing the risk of blackouts.
An innovative vertical farming project will be established within one of the tallest buildings, providing fresh produce directly to residents.
The city will launch a pilot program for driverless cars and autonomous public transportation systems, significantly reducing traffic congestion and emissions.
A new university campus will be built, fostering innovation and research in fields like artificial intelligence and biotechnology.
A giraffe stands in its enclosure at the zoo, its long neck reaching towards the camera.
The giraffe begins to walk slowly along the dirt path within its enclosure.
It approaches a feeding station where a caretaker is preparing to feed it.
The giraffe extends its tongue to reach for some leaves placed by the caretaker.
Its legs move gracefully as it takes steps towards the feeding area.
The giraffe's coat displays a unique pattern of brown spots on a white background.
The enclosure features a mix of trees and grassy areas providing shade and space for the animal.
The zoo's visitor center can be seen in the background with its distinctive giraffe-themed exterior.
Caretakers regularly clean the enclosure to maintain a healthy environment for the giraffe.
The giraffe's ears twitch slightly as it hears movement from nearby.
A golden retriever puppy sits on a grassy field adorned with scattered orange flowers.
The puppy's ears are perked up, attentively listening to something off-camera.
The puppy's eyes sparkle with curiosity and excitement.
A gentle breeze stirs the puppy's fur, making it feel refreshed.
The puppy begins to wag its tail vigorously in response to something it finds intriguing.
A small bird lands nearby, and the puppy watches intently without barking.
Suddenly, the puppy's owner approaches from a distance, causing the puppy to bark happily.
The puppy decides to playfully chase after a butterfly that flutters near it.
The puppy catches the butterfly and holds it gently between its front paws.
As the puppy examines the butterfly, it notices a squirrel scampering through the grass.
The image shows two elephants standing close together in a sunny, open field with a clear blue sky and distant hills in the background.
In the near future, the elephant on the left will start to lift its trunk, possibly to scratch an itch.
The elephant on the right will begin to move its ears, indicating a change in its mood or response to something off-camera.
The elephants will eventually turn their heads towards a nearby water source, seeking refreshment on this warm day.
A group of tourists will approach on foot, carrying cameras and binoculars, eager to capture the majestic creatures in their natural habitat.
The elephants will engage in a gentle push-and-pull interaction, showcasing their social bonds and playful nature.
A local conservationist will observe the elephants from a distance, noting their behavior for research purposes.
The elephants will continue their grazing, using their trunks to select fresh grass and leaves.
A small bird will fly overhead, perhaps attracted by insects stirred up by the elephants' movements.
A nearby tree will rustle slightly as the elephants' tails gently brush against it while they move.
A band is performing energetically on stage under purple and green lights.
The lead guitarist bends forward, his passion for the music evident in every movement.
Behind him, another musician strums his guitar with intensity, lost in the rhythm.
The drummer, partially obscured by the drums, maintains a steady beat.
The bassist stands at the back, his fingers dancing over the strings of his instrument.
The audience is captivated, their faces illuminated by the stage lights.
As the performance progresses, the lead singer joins the band, adding vocals to the mix.
The lighting changes, shifting from green to blue as the song transitions.
A smoke machine introduces wisps of fog that swirl around the performers, enhancing the atmosphere.
The lead guitarist's long hair flows with the movement of his body, a beautiful sight against the backdrop of the stage.
A black and white dog is giving a high-five with its front paw to a person's hand.
The dog will soon learn to high-five with both of its paws.
The person will teach the dog more tricks to further bond with them.
The dog will grow up and become an even better companion.
The dog will start to show more affection towards other pets in the neighborhood.
The person will take the dog to a park where it will meet new friends.
The dog will start to play fetch with a frisbee in the backyard.
The person will start training the dog for a dog show competition.
The dog will develop a special bond with the person through daily walks and playtime.
The dog will begin to help out around the house by fetching items.
A night-time view of an urban roundabout with multiple lanes converging at the center, illuminated by streetlights, with various cars and buses traveling in different directions.
In the near future, the roundabout will be surrounded by more modern buildings and structures.
Urban planners will propose a redesign to include additional pedestrian crossings and cycling lanes to enhance safety and accessibility.
The roundabout will see an increase in traffic as new residential and commercial areas are developed nearby.
A major infrastructure project will begin, involving the construction of an elevated walkway connecting the roundabout to nearby metro stations.
The roundabout will become a focal point for community gatherings, with temporary installations such as street art and food vendors.
The city council will approve the installation of advanced traffic monitoring systems to optimize traffic flow and reduce congestion.
The roundabout will be featured in urban design exhibitions, showcasing sustainable transportation solutions.
A new skyscraper will be built adjacent to the roundabout, providing a bird's-eye view of the intersection.
The roundabout will be renamed to honor a local civic leader who championed improved urban mobility and public transportation.
A cultural center will be established near the roundabout, hosting events and workshops on urban planning and sustainable living.
The image depicts a woman and a young girl in a kitchen setting, engaged in some form of cleaning or craft activity.
The woman is wearing a light-colored, short-sleeved top with a subtle pattern and has a tattoo visible on her left arm. She is also wearing pink rubber gloves.
The young girl, standing beside her, is dressed in a sleeveless white shirt adorned with colorful dots and stripes, complemented by a large pink bow on her head.
Both appear focused and content as they work together at a counter.
As the woman guides the girl through the activity, she picks up a spray bottle filled with a liquid, possibly a cleaning solution.
The girl reaches out to touch the counter, exploring its texture or looking for something specific.
The woman explains the next step of their task, pointing towards a set of cleaning brushes and a bucket nearby.
The girl reaches for one of the brushes, seemingly ready to assist her mother.
The woman demonstrates how to use the spray bottle effectively, showing the girl to aim it carefully at a surface.
The girl watches intently as her mother sprays the solution onto a surface, likely a countertop or a table.
A Siamese kitten rests snugly inside a straw hat, its head slightly tilted as it gazes curiously to the side.
The kitten decides to explore the room and jumps out of the hat onto the soft carpet below.
The kitten sees a feather toy and immediately pounces on it, chasing it around the living room.
The feather toy becomes stuck in a corner, causing the kitten to climb up a nearby piece of furniture to reach it.
As the kitten plays with the feather toy, it accidentally scratches the armrest of the sofa.
The owner notices the scratch mark and gently removes the kitten from the furniture, placing it back into the hat for safety.
The kitten, still playful, decides to investigate a box that has been moved to the floor.
Inside the box, the kitten discovers a small toy mouse, which it pounces on with great enthusiasm.
The kitten's excitement grows as it chases the toy mouse around the house, causing some minor chaos.
After a while, the kitten tires itself out and falls asleep on the living room floor, still clutching the toy mouse.
The drummer is seated behind a drum set with a cymbal in the foreground, ready to play.
The drummer's hand moves swiftly, striking the snare drum with precision.
A slight shimmer of light reflects off the metallic surfaces of the drums and cymbals.
The sound of the drumsticks hitting the drumheads resonates through the air.
The drummer's foot taps rhythmically on the bass drum pedal.
A small audience gathers around, their attention captivated by the performance.
The drummer switches to playing the bass drum, creating a powerful beat.
The cymbal in the foreground glints in the stage lights as it is struck.
The drummer adjusts the tension on one of the drum heads for better sound quality.
A soft, melodic tone emerges from the snare drum, adding depth to the music.
A beautifully plated dish of meat and vegetables sits elegantly on a dark gray plate against a deep blue tablecloth.
The chef carefully arranges microgreens on top of the meat, ensuring they are evenly distributed.
A diner approaches the table, admiring the vibrant colors and textures of the dish before taking their seat.
The diner begins to cut into the meat with a sharp knife, revealing a perfectly cooked interior.
The sizzling sound of the meat cooking in a pan can be heard faintly from the kitchen.
The diners are served with a side of creamy mashed potatoes, adding a comforting element to the meal.
A glass of red wine is poured, complementing the rich flavors of the meat and vegetables.
The meal is accompanied by a delicate salad, offering a refreshing contrast to the main course.
The chef presents the dish to the table, garnishing it with a sprinkle of freshly ground black pepper for added flavor.
The diner takes a bite of the meat, savoring the tender texture and savory taste.

Ultra-Long Creative Generation

Use a text prompt stream to generate long stories, where each 5-second clip is controlled by one different text prompt

This setting targets vlogger use cases (e.g., TikTok), emphasizing storytelling with plausible scene transitions and exciting contents. All methods are conditioned on the same prompts. In the compared methods, accumulated errors manifest as (1) failed text following, (2) degraded motion, and (3) visual artifacts.

Wan 2.1 [3]

FramePack [2]

Stable Video Infinity (SVI-Film, Ours)

Show Prompts
In the pre-dawn quiet, the massive highway interchange lies dormant, a concrete titan asleep. A network of empty asphalt ribbons weaves between darkened buildings and sleeping forests. The first vehicle appears, its headlights a lonely streak of white light in the gloom. The sun begins to rise, casting a soft orange glow on the highest ramps and overpasses. The city's streetlights flicker off one by one as the sky brightens. A trickle of early commuters begins, their cars spaced far apart. The interchange is the city's circulatory system, and its heart is beginning to beat. The first rays of sunlight reveal the autumn colors of the trees nestled between the ramps. The quiet hum of the first wave of traffic is the sound of the city waking up. The great daily migration is about to begin. Within minutes, the trickle of cars becomes a steady stream. Thousands of individual journeys converge on this single, complex point. The interchange channels the flow, sorting cars onto different paths with engineered precision. The morning rush hour builds, a river of steel and glass flooding the concrete banks. From above, the patterns of movement are hypnotic and complex. White headlights pour into the city center; red taillights are few. The interchange operates at peak efficiency, a testament to decades of planning. Each car is a single data point in a massive, real-time logistical puzzle. The collective sound grows from a hum to a constant, powerful roar. The city is now fully awake, inhaling its workforce from the surrounding suburbs. Buses navigate their dedicated lanes, carrying hundreds of passengers at a time. Delivery trucks rumble along, carrying the goods that keep the city running. The sun climbs higher, glinting off a million windshields and painted roofs. The interchange seems to breathe, managing the relentless pulse of traffic. Every ramp and every lane is now filled, moving in a controlled, chaotic dance. It is a marvel of modern infrastructure, a solution to the problem of movement. The surrounding trees, in their autumn finery, provide a natural contrast to the grey concrete. The system is designed for flow, for perpetual, uninterrupted motion. For a few hours, this massive structure is one of the busiest places on the planet. The morning rush finally begins to ease as commuters reach their destinations. The traffic thins slightly, moving faster now, the pressure lessened. Midday traffic is different: shoppers, tourists, and commercial vehicles take over. The pace is less frantic but still constant. The sun is directly overhead, eliminating all shadows in the interchange's canyons. The system continues its work, flawlessly directing the city's lifeblood. Then, on a crucial, high-level ramp, a plume of white smoke erupts from a car's engine. The car sputters to a halt, instantly blocking a critical artery. The driver behind brakes sharply, and the car behind them fails to stop in time. A minor collision occurs, but the blockage is now complete. The ripple effect is instantaneous and unforgiving. The flow of traffic behind the incident stops completely. A solid line of red taillights begins to snake backward down the ramp. Within minutes, the congestion spills onto the main highway below. The interchange's perfect rhythm is broken. The system, designed for motion, is now causing a massive standstill. Horns begin to honk, a symphony of frustration rising from the jam. From the air, the traffic jam looks like a clot, spreading through the arteries. Thousands of people are now trapped, their journeys indefinitely delayed. The city's inhalation has been choked off. Emergency services are dispatched, their sirens wailing in the distance. A police car and a tow truck fight their way through the congested lanes.
Show Prompts
The river continues to flow rapidly forward, churning over submerged rocks. A large wave of white water surges upwards and then crashes downwards. A piece of driftwood is carried forward by the current. The clouds in the sky drift slowly forward from right to left. A single pine needle falls downwards from a tree into the water. A fish jumps upwards from the river, then splashes back down. The camera slowly zooms in, moving forward to focus on the rapids. The camera pans to the left, revealing more of the rocky shoreline. A small squirrel runs quickly upwards along the trunk of a pine tree. The squirrel runs back downwards and disappears into the undergrowth. The water level of the river begins to rise slowly upwards. The river's current becomes stronger, moving forward with more force. A kayaker in a yellow kayak appears upstream, paddling forward. The kayaker skillfully navigates the rapids, moving left and right. The kayaker's paddle dips downwards into the water on the left side. The kayaker's paddle dips downwards into the water on the right side. The kayaker successfully passes the rapids and moves forward downstream. An eagle soars in circles high above the river. The eagle folds its wings and dives straight downwards towards the water. The eagle pulls up just above the water, a fish in its talons, and flies upwards. The eagle flies away, moving backward up the river. A large brown bear emerges from the forest on the right, moving forward. The bear walks down to the river's edge. The bear dips its head downwards to drink from the river. The bear then wades into the water, moving forward. The bear attempts to catch a fish with its paw, swiping downwards. The bear walks back out of the river and moves backward into the forest. Dark clouds begin to roll in, moving forward from the background. The sunlight fades as the clouds cover the sun. Rain begins to fall downwards, creating small splashes on the river's surface. Lightning flashes in the distance, illuminating the sky. The rain becomes heavier, falling straight down. The river's water level rises upwards more quickly. A small stream of rainwater flows downwards over the rocks on the right. The storm passes, and the clouds move away to the left. The sun reappears, and its light moves forward across the landscape. A rainbow forms, arching upwards over the river. The sun begins to set, moving downwards towards the horizon. The sky turns orange and pink, reflecting on the water's surface. The sun disappears, and the scene darkens into twilight. The moon rises upwards, casting a silvery light downwards. A bat flies erratically back and forth above the river. An owl, perched on a high branch, flies downwards from its perch. The camera viewpoint shifts, descending downwards to water level. The camera moves forward, skimming just above the surface of the rapids. The camera ascends straight upwards for a top-down view of the river. The season changes to autumn; the deciduous trees turn yellow. Yellow leaves fall downwards and are carried forward by the river. The season changes to winter; snow begins to fall downwards. The snow covers the rocks and trees in a layer of white. Ice begins to form inwards from the edges of the river.
Show Prompts
A man and a young girl ride their bikes along a sun-dappled path. The cobblestones make a gentle rumbling sound beneath their tires. Lush green trees line the path, their leaves rustling in the breeze. The man smiles, watching the girl pedal confidently beside him. She glances at him, her face full of joy for their shared adventure. They ride in a comfortable silence, enjoying the perfect spring afternoon. Sunlight filters through the leaves, creating shifting patterns on the ground. They decide to bike towards the old, historic part of the city. The man points towards a distant spire, suggesting it as their destination. The girl agrees enthusiastically, pedaling a little faster. They leave the park path and merge onto a quiet city street. Old brick buildings with colorful flower boxes line their route. They pass a bustling outdoor cafe, the air filled with the scent of coffee and pastries. They ring their bells cheerfully at a stray cat sunning itself on a wall. The man points out interesting architectural details on the buildings they pass. The girl listens, her imagination sparked by stories of the city's past. They decide to take a shortcut down a narrow, unfamiliar alleyway. The alley opens up into a small, forgotten courtyard they've never seen before. In the center of the courtyard stands a tall, old clock tower, silent and imposing.

Multimodal Conditional Generation

Audio-guided talking animation.

Wan 2.1 [3]

MultiTalk [4]

Stable Video Infinity (SVI-Talk, Ours)

This person is speaking.

Multimodal Conditional Generation

Skeleton-guided dancing animation.

Pose Video

UniAnimate-DiT [5]

Stable Video Infinity (SVI-Dance, Ours)

This person is dancing.

Generalization to Other Domains (8 min Tom & Jerry)

Let your imagination run wild and generate cartoon animations from text prompt streams.

Show Prompts
A static shot of the bright 1950s kitchen, turquoise cabinets and a chrome sink glinting; Tom cat hovers over the counter, yellow eyes narrowed, while Jerry mouse stands defiantly in a tiny milk puddle near a stack of purple plates. Close-up on Tom cat’s face: a wicked smirk creases his white muzzle; his black brows angle into a sharp V as he crooks one claw toward Jerry mouse like a menacing metronome. Jerry mouse plants his little feet, whiskers twitching; he points a thumb at his chest with comical bravado, then glances sideways at the slick milk trail leading to the sink. Tom cat springs; the countertop blurs as his shadow sails over Jerry mouse; the dive ends in a metallic CLANG against the cabinet doors, which vibrate and rattle all the bowls inside. Jerry mouse spins and sprints; he kick-starts a top plate so it skids like a flying saucer; Tom cat recovers, lunges, and belly-flops into the steel sink as the plate ricochets off the rim. A geyser of hot water smacks Tom cat in the face; steam curls around his ears; Jerry mouse swings gymnast-style from the faucet handle, lands lightly, and darts behind a chrome toaster. Tom cat wrenches a giant fork from the rack and jabs the toaster; the fork tangles, the lever drops, coils glow red; POP—two toasts shoot up and bonk the fork, which flips and claps Tom cat’s head. Slapstick slide: Tom cat steps on a rogue soap bar, windmills backward, and whooshes off the counter into a bucket that flips and helmets him; two eyeholes punch out, and bucket-legs sprint blindly. Jerry mouse rides a dishtowel down to the floor like a fireman; he threads between table legs; the bucket jams, shudders loose, trips on a taut string, and faceplants Tom cat into a pyramid of salt. Tom cat sits up, cheeks..。

Consistent Video Generation

Use one text prompt to control the motion and scene dynamics of the entire video sequence

This setting aims to generate temporally coherent videos in a homogeneous scene controlled by a single text prompt, which aligns with the previous long video objective.

StreamingT2V [1]

FramePack [2]

Stable Video Infinity (SVI-Shot, Ours)

A mother elephant and her calf stand together in a grassy field under a clear blue sky, with the mother elephant appearing protective and the calf nestled close to her side.
A person clad in dark winter attire walks through a heavy snowstorm, their figure partially obscured by swirling snowflakes, creating a stark contrast against the snowy street beneath them.
A modern Alstom Adessia train, painted in a sleek white design with green and red accents, speeds along tracks surrounded by lush greenery under a clear blue sky, with distant buildings visible in the background.
A sleek white motor yacht speeds across the turquoise blue sea, leaving a dramatic wake of white foam behind it under a clear blue sky.
A family, including a mother and her children, are feeding a giraffe at a zoo, with the children extending branches of leaves towards the giraffe through a fence.

More Creative Video Generation

Additional examples of creative video generation with diverse scenes and storylines

StreamingT2V [1]

FramePack [2]

Stable Video Infinity (SVI-Film, Ours)

A truck is traveling down a wide road with multiple lanes, set against a backdrop of an overcast sky.
In the near future, the truck will overtake a car on the left, maintaining a safe distance.
The car will merge into the right lane, allowing the truck to pass.
The truck will continue its journey, navigating through various terrain changes such as a field with wildflowers, a dense forest, and an open plain.
A wild deer will cross the road ahead, prompting the truck driver to slow down and swerve slightly to avoid it.
The truck will come to a stop at a traffic light, waiting for the signal to change.
A group of travelers will disembark from the truck, carrying backpacks and camping gear, preparing to explore a nearby hiking trail.
The truck will pick up speed again, continuing its route through increasingly unpredictable weather conditions, including light rain, heavy fog, and even hail.
A massive storm will sweep through the area, causing the truck to reduce speed and navigate carefully, with windshield wipers working furiously to clear the rain.
The truck will eventually reach its destination, a remote campsite nestled in the mountains, where the travelers will set up camp for the evening.
The serene scene captures a lone figure sitting cross-legged atop a large rock, gazing at the tranquil waters of Lake Tahoe under a vibrant sunset sky.
The person slowly stands up and carefully walks to the edge of the rock, preparing to jump into the water below.
As he jumps, the splash creates ripples that spread across the lake surface, disturbing the stillness of the water.
The ripples gradually dissipate, leaving behind a calm, undisturbed lake once again.
A gentle breeze begins to pick up, causing the rocks around him to sway slightly.
The sky starts to turn from deep oranges to softer pinks as the sun sets completely.
The person wades into the lake, feeling the cool water against his skin.
He notices a small fish swimming near the surface and decides to chase it with his hands.
Suddenly, a seagull flies overhead, startled by his sudden movement.
The fish swims away, and the person feels disappointed but content.
The car drives through a vast, barren desert landscape.
A lone figure emerges from the horizon to greet the vehicle.
The driver spots an oasis in the distance and steers towards it.
Dust clouds from the car's wheels create a dramatic effect as they travel across the terrain.
An unexpected sandstorm begins to form around the car, trapping it in its midst.
A herd of camels appears from nowhere, blocking the car's path.
The car's headlights illuminate the expansive desert, revealing hidden crevices.
A mirage appears on the horizon, giving the illusion of water nearby.
The car's engine struggles with the harsh desert heat, causing it to stall.
A group of explorers encounter the stranded car, offering help.
The two individuals are lying in bed, focused intently on a laptop screen.
The woman adjusts her position slightly to get a better view of the screen.
The man leans closer, trying to see what she is watching.
She taps the keyboard gently with her right hand, indicating she is typing or scrolling.
He leans back, catching his breath from leaning forward.
The woman turns her head to look at him, possibly discussing what they are seeing.
They both seem engrossed in whatever is displayed on the laptop.
The room's lighting remains dim, creating an intimate atmosphere.
A soft glow emanates from the laptop, illuminating their faces.
The woman's left hand rests on her chin, suggesting deep concentration.
The image captures four dancers in a dance studio, each in a unique pose, reflecting their individual styles and techniques.
The dancer in the purple leotard will soon join the others in a choreographed routine to showcase their teamwork and harmony.
As they practice, the dancer in the blue tutu might accidentally trip over her pointe shoes, causing her to lose her balance momentarily.
The dancer in the white tank top will likely adjust her position to ensure she maintains proper posture during the next rehearsal session.
The dancer in the black outfit will prepare for a solo performance by practicing her footwork with precision and grace.
The group of dancers will soon be joined by a choreographer who will offer feedback on their movements and suggest improvements to their technique.
During a break from practice, the dancers will chat about their favorite ballet pieces and share tips on mastering complex steps.
The dancers will rehearse a new piece that incorporates elements of contemporary and classical ballet to create a unique fusion of styles.
The dancer in the purple leotard will work on improving her flexibility by incorporating stretching exercises into her daily routine.
The dancer in the blue tutu will experiment with different ways to enhance her pirouettes, focusing on maintaining a steady tempo and fluid motion.
A vibrant spiral galaxy dominates the vast expanse of space, its arms reaching out like fingers of an ancient cosmic hand.
In the distant future, scientists aboard a new generation of spacecraft observe the galaxy's core, marveling at the swirling dance of stars and nebulae.
Astronauts in orbit around Earth witness a massive comet passing close to the galaxy, its tail illuminated by the galactic light.
Astronomers on Earth use advanced telescopes to capture high-resolution images of the galaxy's supermassive black hole, revealing its immense gravitational pull.
Astronauts from a far-off civilization discover the galaxy's outer edges, mapping uncharted territories and charting the galaxy's history through ancient stellar remnants.
Astrophysicists simulate the galaxy's interactions with nearby stars, predicting potential changes in its structure over the next million years.
Astronomers aboard a space station observe a rare alignment of the galaxy's arms, creating a mesmerizing pattern that captivates the scientific community.
Scientists develop new technologies to harness the energy of the galaxy's central star, potentially powering future interstellar travel.
Astronomical observatories on Earth experience unprecedented data overload as they study the galaxy's supernova activity, pushing the limits of current computational capabilities.
Astronauts from a coalition of planets explore the galaxy's inner regions, encountering diverse forms of life and civilizations.
A woman stands in a gym, lifting a barbell with weights attached.
The woman successfully completes her lift and drops the barbell to the ground.
She catches her breath after completing the lift.
Her muscles visibly tense as she prepares for the next repetition.
A fitness tracker on her wrist vibrates, signaling the end of her workout session.
She removes her weightlifting shoes, preparing to exit the gym.
The gym's lighting dims slightly, indicating the end of her workout session.
She walks over to the squat rack and places the barbell back on it.
The woman takes a moment to stretch her arms and shoulders.
She grabs a towel from the wall-mounted rack and dries off her sweat.
A baby in an orange and white striped shirt is sitting inside a cardboard box, looking curious and playful.
The baby will soon discover a toy hidden under the box.
Curiosity encourages the baby to stretch out his hands to explore every corner of the box.
The baby's eyes widen with excitement as they spot a shiny object.
The baby will attempt to climb out of the box, using its tiny hands for support.
Baby will find a piece of paper and start tearing it up.
The baby will make a mess with the torn paper, scattering it around the room.
The baby will try to stack the pieces of paper, creating a small tower.
The baby will accidentally knock over the paper tower, causing a small avalanche of paper.
Baby will giggle as the papers scatter across the floor.
A group of ballerinas in light blue tutus perform a graceful dance on stage under a large, glowing moon backdrop.
The dancers will soon begin their rehearsal for the upcoming festival.
The lead ballerina will practice her pirouettes to perfect her technique.
The choreographer will introduce a new sequence into their routine during the next practice session.
The lighting crew will adjust the stage lights to enhance the ethereal glow of the moon.
The dancers will work on their timing and synchronization as they rehearse together.
The audience will be captivated by the serene and beautiful performance on the night of the festival.
The dancers will be dressed in their finest attire for the gala event at the end of the festival.
The ballerinas will spend hours perfecting their jumps and leaps for the grand finale.
The dancers will receive constructive feedback from the judges after the competition.
Two women are celebrating a special moment inside a brightly lit bowling alley, with one woman raising her arms in joy and the other looking on with a wide smile.
Confetti will start falling from the ceiling, adding to the festive atmosphere as the women continue their celebration.
A third person will join them, carrying a cake to mark the occasion, and everyone will sing happy birthday together.
The women will take turns bowling, with each successful strike or spare drawing enthusiastic cheers from the group.
A photographer will capture candid moments of the celebration, ensuring everyone has memories to look back on.
The group will move to a nearby table for refreshments, enjoying snacks and drinks while chatting and laughing.
A slideshow of photos from the event will play on a large screen in the background, allowing everyone to relive the day's highlights.
Friends and family will share stories and well-wishes, making the birthday person feel truly special.
The party will wrap up with group photos and goodbyes, with everyone promising to stay in touch.
Social media posts will be shared later that evening, commemorating the event and the joy it brought to everyone involved.
A majestic waterfall cascades down a rugged cliffside under a vibrant blue sky with fluffy white clouds.
The waterfall's roar grows louder as it descends into a lush, verdant valley below.
A group of hikers approaches the base of the waterfall, marveling at its grandeur.
Local residents gather to celebrate the natural wonder, setting up a festive picnic near the water's edge.
Scientists conduct research nearby, studying the ecosystem around the waterfall and the effects of climate change on local flora and fauna.
The waterfall becomes a popular tourist destination, drawing visitors from all over the world.
A team of geologists explores the geological formation that supports the waterfall, uncovering secrets about the region's past.
The local community implements eco-friendly practices to protect the surrounding environment, ensuring the waterfall remains a pristine natural treasure.
A rare species of bird is spotted near the waterfall, adding to the biodiversity of the area.
The waterfall's flow temporarily decreases due to a drought, prompting concern among environmentalists.
A hand extended upward is catching raindrops on a rainy day, with the sky filled with gray clouds and rain visible in the background.
More raindrops will fall, creating a soothing pattern on the person's palm.
The person will begin to feel the cool, refreshing sensation of the rain on their skin.
A gentle breeze will blow, causing the rain to swirl around the hand, making the experience even more enjoyable.
The person will start to notice the rhythmic sound of the rain hitting the ground and their hand.
As the rain intensifies, puddles will begin to form around the person's feet, reflecting the cloudy sky above.
The person will enjoy the peaceful moment, taking in the beauty of the rain and the sense of calm it brings.
Birds will fly overhead, seeking shelter from the rain, while the person continues to stand in the open, enjoying the moment.
The rain will start to lighten, and the person will begin to lower their hand, preparing to head indoors.
The person will reflect on the experience, feeling refreshed and connected to nature through this simple act of catching raindrops.

[1] Henschel, R., Khachatryan, L., Poghosyan, H., Hayrapetyan, D., Tadevosyan, V., Wang, Z., ... & Shi, H. (2025). Streamingt2v: Consistent, dynamic, and extendable long video generation from text. In CVPR 2025.

[2] Zhang, L., & Agrawala, M. (2025). Packing input frame context in next-frame prediction models for video generation. arXiv preprint arXiv:2504.12626.

[3] Wan, T., Wang, A., Ai, B., Wen, B., Mao, C., Xie, C. W., ... & Liu, Z. (2025). Wan: Open and advanced large-scale video generative models. arXiv preprint arXiv:2503.20314.

[4] Kong, Z., Gao, F., Zhang, Y., Kang, Z., Wei, X., Cai, X., ... & Luo, W. (2025). Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation. In NeurIPS 2025.

[5] Wang, X., Zhang, S., Tang, L., Zhang, Y., Gao, C., Wang, Y., & Sang, N. (2025). Unianimate-dit: Human image animation with large-scale video diffusion transformer. arXiv preprint arXiv:2504.11289.