Company patents

Zoom Video Communications, Inc.

Zoom Video Communications, Inc. surprisingly showed a significant increase in patenting activity in emerging AI-related fields in 2024, with Natural Language Processing growing by +92.0% YoY and Computer Vision by +69.7% YoY, alongside a +105.0% YoY surge in Messaging & Email. However, patent filings across almost all categories, including its core Streaming & Real-Time Media (33.7% of portfolio) and Pictorial / Video Communications (30.4% of portfolio), have seen a dramatic decline in 2025 and so far in 2026, with categories like Routing, Switching & QoS and Business Methods & Fintech showing 0 patents filed so far in 2026, indicating a potential shift in its overall IP strategy.

Patent Trend by Technology Area

Yearly patent publications since 2023

Product themes

Product-level themes inferred from filings since 2023, with category chips showing where each theme appears. Select a theme to filter the patents below.

1,074 US filings (since 2023) · 12 categories · 32 themes

Real-time Multimedia Conferencing

Technologies enabling synchronous, interactive multimedia communication sessions, including user interfaces, content sharing, and underlying session management for multiple participants.

Streaming & Real-Time Media

Who else files here? →

516since 2023

-76.5%YoY

Communication Network Session Management

Systems and methods for establishing, maintaining, modifying, and terminating communication sessions across various network architectures, including service discovery, resource allocation, and resilience mechanisms.

Streaming & Real-Time Media

Who else files here? →

242since 2023

-87.2%YoY

Collaborative User Experiencesfiltered

User interface designs and systems that enable multiple users to interact with shared content, provide feedback, or coordinate activities, often across different devices or locations.

Input/Output & User Interfaces

Who else files here? →

195since 2023

-70.9%YoY

Video Enhancement & Object Tracking

Methods and systems for improving the quality of video streams, generating intermediate frames, or continuously locating and following objects within a sequence of images, even under occlusion.

Image Processing

Who else files here? →

168since 2023

-70.1%YoY

Automated Call Handling & Routing

Systems and methods for automatically managing telephone calls, including intelligent routing based on various criteria, scheduling callbacks, and processing emergency calls.

Telephone Equipment

Who else files here? →

157since 2023

-86.9%YoY

Dynamic Content & Engagement Features

Features within messaging platforms that enhance user interaction and content consumption through intelligent suggestions, content persistence mechanisms, engagement analytics, and adaptive presentation of conversational media.

Messaging & Email

Who else files here? →

134since 2023

-79.4%YoY

Video Quality & Encoding Optimization

Methods and apparatus for improving the visual fidelity, resolution, or compression efficiency of video signals, often through advanced processing, up-scaling, or neural network-based filters.

Pictorial / Video CommunicationsComputer Vision

Who else files here? →

123since 2023

-69.8%YoY

Secure & Context-Aware Messaging

Methods and systems for enhancing the security and privacy of electronic messages, often by integrating contextual data such as location, social network graphs, or user authentication levels to control access, filter content, or enable specific group interactions.

Messaging & Email

Who else files here? →

106since 2023

-78.9%YoY

Advanced Audio Signal Processing

Techniques for enhancing, encoding, decoding, or separating speech and audio signals, often involving multi-microphone arrays, acoustic echo cancellation, beamforming, or advanced audio compression for improved clarity and quality.

Speech Processing

Who else files here? →

88since 2023

-75.0%YoY

Interactive & Generative Display Systems

Technologies that create dynamic and interactive visual content for displays, including virtual/wearable systems, by generating overlays, replacing input streams, or merging real-time user actions with digital environments.

Pictorial / Video Communications

Who else files here? →

80since 2023

-61.1%YoY

Voice & Audio Quality Enhancement

Techniques for improving the perceived quality, synchronization, and moderation of audio and voice streams, often involving codec management, transcoding, and content analysis.

Streaming & Real-Time Media

Who else files here? →

70since 2023

-57.6%YoY

Adaptive Speech Synthesis & Messaging

Technologies for generating artificial speech that is personalized, context-aware, or adaptable to specific virtual agents or messaging campaigns, often utilizing text-to-speech (TTS) and audio caching for efficient delivery.

Speech Processing

Who else files here? →

49since 2023

-33.3%YoY

AR/VR User Interfaces

Techniques for rendering, interacting with, and managing content within augmented or virtual reality environments, including spatial tracking, gaze interaction, and dynamic multi-application display management.

Input/Output & User Interfaces

Who else files here? →

44since 2023

-47.1%YoY

Context-Aware Conversational AI

AI systems designed to engage in natural language dialogue, maintain conversation state, understand user intent, and generate relevant responses, often across multiple communication channels or modalities.

Speech ProcessingTelephone EquipmentNatural Language ProcessingMessaging & Email

Who else files here? →

23since 2023

-60.0%YoY

Adaptive Media Delivery & Caching

Methods and systems for efficiently distributing and delivering media content, including techniques for multi-source streaming, content caching, and optimizing delivery based on network conditions or device capabilities.

Streaming & Real-Time Media

Who else files here? →

19since 2023

-80.0%YoY

Wearable & Mobile Interaction

Designing user interfaces and interaction methods specifically for mobile or wearable devices, enabling control of external systems, monitoring user states, or facilitating real-world transactions.

Input/Output & User Interfaces

Who else files here? →

13since 2023

-50.0%YoY

Document & Information Extraction

Methods and systems for identifying, extracting, and structuring specific entities, relationships, or insights from text-based documents, often involving techniques like named entity recognition, relation extraction, or summarization.

Natural Language Processing

Who else files here? →

13since 2023

-50.0%YoY

Network Security & Access Control

Methods and systems for protecting network resources and data from unauthorized access, misuse, or attack, encompassing authentication, authorization, encryption, and traffic filtering mechanisms. This includes securing communication channels and validating network access.

Routing, Switching & QoS

Who else files here? →

12since 2023

n/a

Large Model Text Generation

Techniques for generating human-like text or other content using large pre-trained models, often involving prompt engineering, speculative decoding, or multi-modal inputs for content creation.

Natural Language ProcessingMachine Learning & AI

Who else files here? →

11since 2023

+350.0%YoY

Message System Management & Delivery

Core infrastructure and operational techniques for efficient and reliable message handling, including server-side logic for managing subscriptions, aggregating messages, optimizing network connections, and ensuring data consistency across distributed messaging services.

Messaging & Email

Who else files here? →

9since 2023

-33.3%YoY

Deepfake Voice Detection

Methods and systems for identifying synthetic or manipulated speech (deepfake audio) using forensic analysis of audio features, such as breath patterns, vocoder signatures, or machine learning models to determine authenticity.

Speech Processing

Who else files here? →

8since 2023

0.0%YoY

ASR Accuracy & Contextualization

Techniques to improve the accuracy and robustness of Automatic Speech Recognition (ASR) systems by incorporating contextual information, dynamic hint words, or customized machine learning models for specific domains or users.

Speech Processing

Who else files here? →

7since 2023

-33.3%YoY

Physical Layer & Interface Optimization

Enhancements to the physical and data link layers of network communication, focusing on hardware components, signal integrity, power efficiency, and efficient data transfer mechanisms for specific interfaces and buses.

Routing, Switching & QoS

Who else files here? →

7since 2023

n/a

Location-Aware Mobile Services

Mobile applications and systems leveraging wireless communication and location data (e.g., GPS, RFID, geo-fencing) to provide context-specific services, transactions, or user interactions.

Telephone Equipment

Who else files here? →

6since 2023

n/a

Personalized Recommendations

Systems that use user data, preferences, and machine learning to generate tailored advice, product recommendations, goal-setting plans, or contextual information for individuals across different domains.

Business Methods & Fintech

Who else files here? →

5since 2023

n/a

Multimodal Data Fusion

Techniques for combining and analyzing information from multiple distinct data modalities (e.g., text, image, video, audio, sensor data) to derive richer insights or improve system performance and decision-making.

Machine Learning & AI

Who else files here? →

4since 2023

new

MLOps & Model Deployment

Systems and methods for automating the lifecycle of machine learning models, including pipeline deployment, model management, versioning, and configuring for different inference environments.

Machine Learning & AI

Who else files here? →

3since 2023

new

Multi-Sensor Imaging & Synthesis

Systems that combine data from multiple camera sensors or capture multiple images from different perspectives or qualities, often involving image processing techniques like synthesis to create enhanced or comprehensive views.

Pictorial / Video Communications

Who else files here? →

3since 2023

n/a

Speech-based Health & Accessibility

Applications of speech processing and artificial intelligence for medical diagnosis, therapeutic interventions, or accessibility solutions, particularly for conditions affecting speech production or hearing.

Speech Processing

Who else files here? →

3since 2023

n/a

Real-time Graphics Rendering

Techniques and hardware architectures designed to efficiently generate and display complex 3D graphics, particularly for interactive applications like virtual reality, focusing on speed and visual quality.

Image Processing

Who else files here? →

2since 2023

n/a

Automated Transaction Systems

Systems designed to streamline and automate various commercial transactions, including mobile-enhanced processes, secure online checkouts, customer service interactions, and privilege issuance, often leveraging digital authentication.

Business Methods & Fintech

Who else files here? →

2since 2023

n/a

Flexible/Foldable Device Structures

Engineering solutions for creating electronic devices with bendable, foldable, or stretchable form factors, often involving hinges, flexible displays, and sliding mechanisms to enable dynamic physical configurations.

Telephone Equipment

Who else files here? →

1since 2023

n/a

Patents

Showing 1-10 of 219

Collaborative User Experiences

Page 1 of 22

US 20260122118 A1APPLICATION

H04L65/403

Active Speaker Proxy Presentation for Sign Language Interpreters

Filed:2024-12-30Pub:2026-04-30

Applicant:Zoom Video Communications, Inc.

Methods and systems provide for an active server proxy presentation for sign language interpreters within a video communication session. In one embodiment, a method presents a user interface for each of a number of client devices connected to a communication session, with each UI including one or more video feeds associated with participants of the communication session. The method receives an indication that a first participant is designating a second participant as a sign language interpreter who will perform voicing for the first participant. The method then determines that the second participant is performing voicing for the first participant, then presents, within the UIs of at least a subset of the client devices, a video feed associated with the first participant in a highlighted fashion concurrently to the second participant performing the voicing for the first participant.

US 20250310395 A1APPLICATION

H04L67/02

COLLABORATIVE WEB BROWSING DURING VIDEO CONFERENCES

Filed:2024-03-29Pub:2025-10-02

Applicant:Zoom Video Communications, Inc.

Techniques for collaborative web browsing during video conferences are disclosed. In an example method, a client device joins a video conference including a number of client devices. The client device executes a command to start a collaborative web browsing session using a web browser. The client device receives a first indication of a first web browsing action associated with a first web page by another client device. The client device outputs a first representation of the first action on the first page. The client device receives a second indication of a second web browsing action associated with a second web page by the other client device, in which the second action modifies the first representation of the first action. The client device outputs a second representation of the second action on the second page based on the modification to the first representation of the first action.

US 20250310387 A1APPLICATION

H04L65/401

SHARED APPLICATION CONTEXTS DURING VIDEO CONFERENCES

Filed:2024-03-29Pub:2025-10-02

Applicant:Zoom Video Communications, Inc.

Techniques for implementing shared application contexts during video conferences are disclosed. In an example method, a first client device joins a video conference hosted by a video conference provider. The first client device receives, from a second client device, a network address. The first client device receives, from the video conference provider, information about the network address based on the network address. The first client device renders and displays the information about the network address, in which the rendering includes one or more controls, each control associated with an action to take in response to selecting the control. The first client device receives an indication of a selection of a first control of the one or more controls. The first client device outputs a command to execute the action associated with the first control.

US 20250298791 A1APPLICATION

G06F16/242

CHAT-BASED QUERYING OF MULTIPLE DATA SOURCES USING A MULTI-AGENT INFRASTRUCTURE

Filed:2024-07-26Pub:2025-09-25

Applicant:Zoom Video Communications, Inc.

Systems and methods for implementing chat-based querying of multiple data sources using a multi-agent infrastructure are provided. In an example method, a computing system receives, from a client device, a query. The computing system determines, using an orchestrator agent, one or more relevant contexts based on the query. The computing system receives, from a storage system, context information based on the one or more relevant contexts. The computing system generates a modified query based on the query and the context information. The computing system outputs, to the orchestrator agent, the modified query and the context information. The computing system receives, from the orchestrator agent, a response to the modified query. The computing system outputs the response to the client device.

US 20250254141 A1APPLICATION

H04L51/224

AI-ASSISTED NOTIFICATIONS OF RELEVANT CONTENT DURING A VIRTUAL CONFERENCE

Filed:2024-02-06Pub:2025-08-07

Applicant:Zoom Video Communications, Inc.

An example method for AI-assisted notifications of relevant content during a virtual conference includes joining, by a client device, a virtual conference attended by one or more participants using a client application. For example, the client device may receive, a transcript of the virtual conference from a server during the virtual conference and determine if the transcript includes a phrase associated with the participant. In response to determining that the transcript includes the phrase associated with the participant, outputting a notification to the participant on a graphical user interface (GUI) provided by the client application.

US 20250227003 A1APPLICATION

H04L12/18

MULTI-MEETING MODE FOR VIRTUAL MEETINGS

Filed:2025-01-06Pub:2025-07-10

Applicant:Zoom Video Communications, Inc.

Systems and methods for providing multi-meeting modes for virtual meetings are provided. In aspects, a system including a non-transitory computer-readable medium, a communications interface, and a processor is provided. The processor may be configured to execute instructions to establish a first virtual meeting and establish a second virtual meeting. The second virtual meeting may run concurrent with the first virtual meeting. The instructions may further cause the processor to receive, from a first client device, a first request to join the first virtual meeting and a second request to join the second virtual meeting, and transmit to the first client device: a first set of multimedia streams and a second set of multimedia streams. The instructions may further cause the processor to determine a primary virtual meeting and modify one of the first set of multimedia streams or the second set of multimedia streams based on the primary virtual meeting.

US 20250158844 A1APPLICATION

H04L12/18

USER INITIATED NOTIFICATIONS FOR CHAT SECTIONS

Filed:2025-01-15Pub:2025-05-15

Applicant:Zoom Video Communications, Inc.

Example methods and systems for providing notifications of unread messages is provided. A client device detects an input action proximate to a displayed grouping of one or more chat sessions associated with a recipient user account. At least one chat session of the one or more chat sessions comprises at least one unread message. The client device provides a notification associated with the at least one unread message. The notification includes at least one chat identity corresponding to the at least one chat session of the one or more chat sessions. The client device displays the at least one unread message in the at least one chat session based on a triggering action associated with the notification.

US 20250119510 A1APPLICATION

G06F3/16

MEDIATING PARTICIPANT INTERACTIONS DURING A VIDEO WEBINAR MEETING

Filed:2024-12-18Pub:2025-04-10

Applicant:Zoom Video Communications, Inc.

One example method for mediating participant interactions during a video webinar meeting includes establishing a video webinar meeting; admitting a host and a plurality of participants to the video webinar meeting; not distributing audio streams from the plurality of participants to other participants in the video webinar meeting; receiving, from a first participant, a first submission to be posed during the video webinar meeting; determining a priority for the first submission based on one or more parameters; and distributing an audio stream associated with the first participant to the host and the remaining plurality of participants based on the priority to enable the first participant to pose the first submission.

US 20250104570 A1APPLICATION

G09B7/00

AUTOMATIC GENERATION OF INTERACTION TOOLS

Filed:2023-09-26Pub:2025-03-27

Applicant:Zoom Video Communications, Inc.

Example methods and systems for automatic generation of interaction tools. A communication platform receives a request to generate an interaction tool associated with a virtual communication session and accesses virtual communication data associated with the virtual communication session. The communication platform identifies a set of keypoint data from the virtual communication data based on the request using a machine learning model. The communication platform generates a list of questions based on the set of keypoint data and the request using a first generative artificial intelligence (AI) model. The communication platform provides the interaction tool based on the list of questions.

US 20250103821 A1APPLICATION

G06F40/35

INTERACTIVE QUERY FACILITATION

Filed:2023-09-25Pub:2025-03-27

Applicant:Zoom Video Communications, Inc.

Example methods and systems for facilitating queries about a virtual communication session are provided. A communication platform receives an initial query about the virtual communication session from a user. The communication platform accesses virtual communication data associated with a virtual communication session. The communication platform generates an initial response to the initial query based on the virtual communication data using a first pre-trained generative artificial intelligence (AI) model. The communication platform generates a first set of follow-up queries based on the initial response using a second pre-trained generative AI model. The communication platform receives a selection of a first follow-up query out of the first set of follow-up queries. The communication platform provides a first response to the first follow-up query using the first pre-trained generative AI model.

1 2 3 4 5…22