Microsoft Copilot คืออะไร?

Microsoft Copilot คือฟีเจอร์ผู้ช่วยอัจฉริยะที่ใช้ AI เพื่อช่วยในการทำงานภายในแอปของ Microsoft 365 เช่น Word, Excel, PowerPoint, Outlook และ Teams โดยทำหน้าที่ช่วยสรุป เขียน วิเคราะห์ และจัดการข้อมูล

ต้องเชื่อมต่ออินเทอร์เน็ตหรือไม่จึงจะใช้งาน Copilot ได้?

จำเป็นต้องเชื่อมต่ออินเทอร์เน็ต เนื่องจาก Copilot ทำงานร่วมกับโมเดล AI บนคลาวด์เพื่อให้ผลลัพธ์ที่แม่นยำและอัปเดตข้อมูลล่าสุด

สามารถใช้ Copilot ช่วยเขียนเอกสารหรืออีเมลได้อย่างไร?

ผู้ใช้สามารถพิมพ์คำสั่ง เช่น “สรุปรายงานในย่อหน้าเดียว” หรือ “เขียนอีเมลตอบลูกค้าอย่างเป็นทางการ” และ Copilot จะสร้างข้อความให้ตามคำสั่ง

Copilot ปลอดภัยต่อข้อมูลส่วนบุคคลหรือไม่?

ใช่ Copilot ได้รับการออกแบบโดยยึดหลักความปลอดภัยและการปกป้องความเป็นส่วนตัว โดยข้อมูลของผู้ใช้จะไม่ถูกใช้ในการฝึกโมเดล AI และมีระบบการควบคุมสิทธิ์การเข้าถึงข้อมูลอย่างเข้มงวด

Gemini Omni: Google's new AI for intelligent video creation.

Q: Copilot ใช้งานได้กับแอปไหนบ้าง?

ปัจจุบัน Copilot รองรับ Microsoft Word, Excel, PowerPoint, Outlook, Teams, OneNote, และอื่น ๆ ในตระกูล Microsoft 365

Artificial intelligence is evolving rapidly, and Google has unveiled one of its most ambitious AI models yet, called Gemini Omni, announced at Google I/O 2026. This new family of AI models represents a significant step towards multimodal AI, integrating the ability to understand text, images, audio, and video into a single creative system.

According to a Google blog post, Gemini Omni is designed to “create everything from every input,” starting with video creation and editing. This makes the model a next-generation AI platform for creative work that could transform how digital content is created in the future.

What is Gemini Omni?

Gemini Omni is a new family of AI models from Google developed to elevate the capabilities of AI from specialized systems to a comprehensive multimodal platform capable of understanding and creating various media formats within a single system.

Unlike previous generations of AI, which often separated processing between text, images, audio, or video, Gemini Omni is designed to process multiple data types simultaneously, including text, images, audio, and video clips. The AI can then create or edit high-quality videos using natural, conversational commands.

Google describes Gemini Omni as a significant step forward in modern AI because it focuses not only on content creation but also on understanding the world, maintaining scene realism, ensuring physics-based movement, and integrating multimodal reasoning capabilities. This results in more consistent and realistic output.

The first model launched in this family was... Gemini Omni Flash It is designed to make AI-powered video creation faster, easier to use, and more accessible to users.

Why is Gemini Omni important?

Over the years, the AI industry has been moving towards systems that can integrate multiple capabilities into a single platform. However, today, most users still have to switch between various tools for writing text, creating images, editing videos, animating, or managing audio.

Gemini Omni attempts to change this concept by integrating all creative workflows into a single conversational system. Users don't need editing skills or to use complex software; they simply describe what they need in natural language, and the AI can automatically manage the content creation and editing process.

For example, users can transform ordinary photos into cinematic videos, adjust lighting and scene atmosphere, create animations, add effects, or even edit entire scenes instantly through conversation. These capabilities make Gemini Omni considered one of Google's most advanced creative AI systems today.

Key features of Gemini Omni

Create videos from multiple input formats.

One of the most important capabilities of the Gemini Omni is its ability to create multimodal videos.

Users can include:

Text command
picture
original video
Reference sound

To create a new video generated by AI.

Google states that the system can maintain continuity of scenes, characters, and movement better than previous AI generations.

Conversational Video Editing with Gemini Omni

Traditional video editing software often requires highly technical skills, but Gemini Omni changes this with its dialogue-based editing system.

Users can type simple commands such as:

"Change the light to resemble sunset."
"Add rain effect"
"Move the camera closer."
"Change the background to a futuristic city."

The AI will update the video while maintaining the continuity of all scenes.

This approach helps overcome the limitations in creating professional-quality content.

Gemini Omni and World Understanding

Another key aspect of Gemini Omni is its ability to create a realistic world-building experience.

Google explains that this model incorporates the following capabilities:

Understanding Physics
space perception
Contextual reasoning
Knowledge about the real world.

This helps the created scenes look more realistic.

For example:

The shadows in the scene are consistent.
The movement follows the principles of physics.
The characters retain their original identity.
The scene remained consistent even after multiple revisions.

This represents a significant advancement compared to previous generations of AI-generated media, which often suffered from issues of realism and scene continuity.

Gemini Omni Flash

The first version to be made public was Gemini Omni Flash.

According to Google, this model emphasizes:

Creating content quickly.
An accessible workflow.
Creativity through conversation
User-friendly design

Google also stated that Omni Flash is being gradually rolled out in:

Gemini App
Google Flow
YouTube Shorts

Platform	Usage
Gemini App	Create videos with AI
YouTube Shorts	Create short video content.
Google Flow	Creative Workflow
Google AI Tools	Developer tools
Future Workspace Tools	Multimedia Productivity Tools

In addition, Google plans to release an API for developers in the future.

Gemini Omni versus traditional AI models.

Features	Traditional AI Tools	Gemini Omni
Creating a message	Supported	Supported
visualization	Limit	Advanced
video creation	Requires multiple tools	Integrated into one system
Conversational Editing	Rarely found	Directly supported
Multimodal Input	some	Full version
Continuity of the scene	uneven	Much better

The most significant highlight of Gemini Omni is its ability to integrate multiple creative workflows into a single AI experience.

Community feedback on Gemini Omni.

The initial reception to Gemini Omni has been overwhelmingly positive, particularly among content creators and AI developers.

Many users consider this system to be:

A significant step for AI in the "create anything" concept.
Centralized creative platform
Step into a fully interactive AI environment.

In some Reddit conversations, Omni has been compared to a future where AI can create entire digital worlds, not just individual media pieces.

However, concerns remain regarding:

Fake data created by AI.
The risks of Deepfake
Content credibility
Over-reliance on AI in creative work.

Google has therefore expanded various systems, such as:

SynthID
Content Credentials
AI-powered media monitoring tools.

To increase the transparency of AI-generated content.

The Future of Multimodal AI

The launch of Gemini Omni reflects a major shift in the field of AI.

The industry is moving towards a system that can:

Understanding multiple types of media simultaneously.
Create interactive content.
Maintain continuity within the long-term context.
Works like a creative assistant.

Instead of using separate AI for text, images, and videos, future systems may become completely centralized creative tools.

And Gemini Omni seems to be Google's vision for that future.

Summary

With the launch of Gemini Omni, Google is pushing multimodal AI into a new era focused on content creation, conversational editing, and realistic understanding of the world.

By integrating text, image, audio, and video capabilities into a single system, Gemini Omni has the potential to transform the way creators, businesses, educators, and developers create digital content.

As AI-powered media becomes more advanced, tools like Gemini Omni could transform video production from a technical workflow into a fully conversational creative experience in the future.

Interested in Microsoft products and services? Send us a message here.

Explore our digital tools

If you are interested in implementing a knowledge management system in your organization, contact SeedKM for more information on enterprise knowledge management systems, or explore other products such as Jarviz for online timekeeping, OPTIMISTIC for workforce management. HRM-Payroll, Veracity for digital document signing, and CloudAccount for online accounting.

Read more articles about knowledge management systems and other management tools at Fusionsol Blog, IP Phone Blog, Chat Framework Blog, and OpenAI Blog.

New Gemini Tools For Educators: Empowering Teaching with AI

Digital Signature

E Signature

E Learning

Online Learning

If you want to stay up-to-date with the latest technology and AI news, check out this website It's updated daily!

What Is Agentic AI? Understanding the Next Leap in Autonomous Intelligence

Fusionsol Blog in Vietnamese

Frequently Asked Questions (FAQ)

What is Microsoft Copilot?

Microsoft Copilot is an AI-powered assistant feature that helps you work within Microsoft 365 apps like Word, Excel, PowerPoint, Outlook, and Teams by summarizing, writing, analyzing, and organizing information.

Which apps does Copilot work with?

Copilot currently supports Microsoft Word, Excel, PowerPoint, Outlook, Teams, OneNote, and others in the Microsoft 365 family.

Do I need an internet connection to use Copilot?

An internet connection is required as Copilot works with cloud-based AI models to provide accurate and up-to-date results.

How can I use Copilot to help me write documents or emails?

Users can type commands like “summarize report in one paragraph” or “write formal email response to client” and Copilot will generate the message accordingly.

Is Copilot safe for personal data?

Yes, Copilot is designed with security and privacy in mind. User data is never used to train AI models, and access rights are strictly controlled.

Table of Contents

Gemini Omni: Google's new AI for intelligent video creation.