Google releases new AI creation tool to accelerate multimodal content generation.

robot
Abstract generation in progress
At the recent I/O developer conference, Google announced a series of upgrades to its AI creation tools for developers, aimed at lowering the barrier to entry and improving efficiency for multimedia content generation through the latest Gemini model family. In the field of video and multimodal creation, Google released the new Gemini Omni model. This model can understand and process text, images, audio, and video inputs, and generate coherent video content. Its most prominent feature is support for conversational editing—users simply describe modification requirements in natural language, such as changing a character, adjusting lighting, or altering a scene, and the model automatically completes the edits. (Sina Finance)
View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
Add a comment
Add a comment
No comments
  • Pinned