Kling O1

    The world's first unified multimodal video model. Accept text, image, and video inputs simultaneously. Combine up to 7 reference elements (characters, avatars, props, outfits, scenes) for unmatched consistency and creative control.

    Try Kling O1 Now

    Create Videos in 3 simple steps

    Tell your vision

    Tell your vision

    Write a prompt describing your video. Kling O1 accepts text, image, and video inputs simultaneously for maximum creative control.

    Add references

    Add references

    Upload up to 7 reference elements: characters, avatars, props, outfits, and scenes. Use video as motion reference to copy actions or camera movements, or as the base for transformations.

    Edit or share

    Edit or share

    Preview your video with perfect visual consistency across all elements, make adjustments if needed, and download or share your creation directly from Freepik.

    Complex editing workflows

    Complex editing workflows

    Build advanced production pipelines combining multiple references and modifying existing videos with precision.

    Storyboarding with existing assets

    Use your own characters, scenarios, and elements to create animated storyboards while maintaining total visual consistency.

    Video re-styling

    Transform existing videos: change visual styles, modify backgrounds, and add or remove elements. Adjust shot sizes and camera movements while preserving the original motion.

    What our creators are saying

    Rachel Kim

    Rachel Kim

    Motion Designer, Pixel Perfect Studio

    Kling O1 understands exactly what I want to achieve. The ability to combine 7 different references in one generation has revolutionized how we create content for brands.

    David Okonkwo

    David Okonkwo

    Marketing Lead, FutureTech Startups

    Being able to edit existing videos with text prompts is incredible. We repurpose old content giving it a completely new look without re-shooting anything.

    Laura Vega

    Laura Vega

    Brand Strategist, Creativa Digital

    It's truly 'the video Nano Banana'. The consistency between characters and scenes we achieve now was impossible with other models. A game changer for our clients.

    Create AI videos with Kling O1, now on Freepik

    Discover the world's first unified multimodal video model. Generate, edit, and transform videos with complete creative control using advanced AI technology, now available on Freepik.

    Create with Kling O1

    Tools to skyrocket your creative freedom

    More tools and features coming soon! Want to test them before anyone? Become our AI partner.

    Explore other AI models

    Discover our collection of AI-powered generation tools

    Frequently Asked Questions

    • Kling O1 STD (720p) costs 75 cr/sec without video reference and 110 cr/sec with video reference. Kling O1 PRO (1080p) costs 100 cr/sec without video reference and 150 cr/sec with video reference.
    • Kling O1 is the first model that accepts text, image, and video inputs simultaneously. You can combine up to 7 reference elements (characters, avatars, props, outfits, scenes) without video, or up to 4 references when including a video reference.
    • Yes, you can upload a 3-10 second video and modify elements, backgrounds, or styles using text prompts. Kling O1 renders at 60fps for exceptionally smooth results. This allows you to transform existing content without starting from scratch.
    • Kling O1 is nicknamed "The Video Nano Banana" because it solves the consistency problem in video generation—similar to how Nano Banana revolutionized image consistency. It maintains perfect visual coherence across characters and scenes.
    • Yes, you can use any video to copy actions or camera movements into a completely new scene. This transfers movement dynamics seamlessly while creating entirely new visuals.

    If you need further information, please contact us