Company patents
Zoom Communications, Inc.
Zoom Communications, Inc. surprisingly shows a broad patent portfolio beyond its core video conferencing, with significant filings in 'Streaming & Real-Time Media' (28.6% of portfolio) and 'Pictorial / Video Communications' (27.0%). However, patenting activity across almost all categories, including 'Computer Vision' with a -71.9% YoY decline and 'Streaming & Real-Time Media' with a -47.4% YoY decline, indicates a significant shift in patenting priorities, though 2026 data is partial.
Patent Trend by Technology Area
Yearly patent publications since 2023
Product themes
Product-level themes inferred from filings since 2023, with category chips showing where each theme appears. Select a theme to filter the patents below.
608 US filings (since 2023) · 12 categories · 36 themes
Technologies enabling synchronous, interactive multimedia communication sessions, including user interfaces, content sharing, and underlying session management for multiple participants.
Systems and methods for establishing, maintaining, modifying, and terminating communication sessions across various network architectures, including service discovery, resource allocation, and resilience mechanisms.
User interface designs and systems that enable multiple users to interact with shared content, provide feedback, or coordinate activities, often across different devices or locations.
Methods and systems for improving the quality of video streams, generating intermediate frames, or continuously locating and following objects within a sequence of images, even under occlusion.
Systems and methods for automatically managing telephone calls, including intelligent routing based on various criteria, scheduling callbacks, and processing emergency calls.
Features within messaging platforms that enhance user interaction and content consumption through intelligent suggestions, content persistence mechanisms, engagement analytics, and adaptive presentation of conversational media.
Methods and apparatus for improving the visual fidelity, resolution, or compression efficiency of video signals, often through advanced processing, up-scaling, or neural network-based filters.
Methods and systems for enhancing the security and privacy of electronic messages, often by integrating contextual data such as location, social network graphs, or user authentication levels to control access, filter content, or enable specific group interactions.
Techniques for enhancing, encoding, decoding, or separating speech and audio signals, often involving multi-microphone arrays, acoustic echo cancellation, beamforming, or advanced audio compression for improved clarity and quality.
Technologies that create dynamic and interactive visual content for displays, including virtual/wearable systems, by generating overlays, replacing input streams, or merging real-time user actions with digital environments.
Techniques for improving the perceived quality, synchronization, and moderation of audio and voice streams, often involving codec management, transcoding, and content analysis.
Techniques for rendering, interacting with, and managing content within augmented or virtual reality environments, including spatial tracking, gaze interaction, and dynamic multi-application display management.
Technologies for generating artificial speech that is personalized, context-aware, or adaptable to specific virtual agents or messaging campaigns, often utilizing text-to-speech (TTS) and audio caching for efficient delivery.
Systems and methods for authenticating users, devices, or applications, authorizing their access to resources based on policies, and managing digital identities across various platforms.
AI systems designed to engage in natural language dialogue, maintain conversation state, understand user intent, and generate relevant responses, often across multiple communication channels or modalities.
Systems and methods utilizing artificial intelligence, particularly large language models and neural networks, to extract, summarize, generate, or categorize information from unstructured or semi-structured data sources.
Methods and systems for efficiently distributing and delivering media content, including techniques for multi-source streaming, content caching, and optimizing delivery based on network conditions or device capabilities.
Techniques for generating human-like text or other content using large pre-trained models, often involving prompt engineering, speculative decoding, or multi-modal inputs for content creation.
Core infrastructure and operational techniques for efficient and reliable message handling, including server-side logic for managing subscriptions, aggregating messages, optimizing network connections, and ensuring data consistency across distributed messaging services.
Methods and systems for identifying, extracting, and structuring specific entities, relationships, or insights from text-based documents, often involving techniques like named entity recognition, relation extraction, or summarization.
Methods and systems for protecting network resources and data from unauthorized access, misuse, or attack, encompassing authentication, authorization, encryption, and traffic filtering mechanisms. This includes securing communication channels and validating network access.
Designing user interfaces and interaction methods specifically for mobile or wearable devices, enabling control of external systems, monitoring user states, or facilitating real-world transactions.
Systems that combine data from multiple camera sensors or capture multiple images from different perspectives or qualities, often involving image processing techniques like synthesis to create enhanced or comprehensive views.
Systems that process data to provide personalized recommendations, predict events, or automate decision-making processes based on learned patterns, user behavior, or environmental factors.
Techniques and hardware architectures designed to efficiently generate and display complex 3D graphics, particularly for interactive applications like virtual reality, focusing on speed and visual quality.
Techniques to improve the accuracy and robustness of Automatic Speech Recognition (ASR) systems by incorporating contextual information, dynamic hint words, or customized machine learning models for specific domains or users.
Methods and systems for integrating, transforming, and managing complex or domain-specific data from disparate sources into a unified structure, often for specific applications like social networks, genomics, or business forms.
Techniques for combining and analyzing information from multiple distinct data modalities (e.g., text, image, video, audio, sensor data) to derive richer insights or improve system performance and decision-making.
Systems that employ imaging and image processing to automatically detect defects, verify states, or ensure quality control in manufactured goods, printed materials, or industrial processes.
Methods and systems for displaying complex data in three-dimensional graphical formats, allowing users to manipulate, explore, and derive insights from the data through interactive controls.
Methods and systems for identifying synthetic or manipulated speech (deepfake audio) using forensic analysis of audio features, such as breath patterns, vocoder signatures, or machine learning models to determine authenticity.
Technologies for establishing and maintaining secure communication channels between devices or networks, often employing encryption, secure protocols, or virtual private networks (VPNs).
Engineering solutions for creating electronic devices with bendable, foldable, or stretchable form factors, often involving hinges, flexible displays, and sliding mechanisms to enable dynamic physical configurations.
Mobile applications and systems leveraging wireless communication and location data (e.g., GPS, RFID, geo-fencing) to provide context-specific services, transactions, or user interactions.
Techniques utilizing deep learning models like Generative Adversarial Networks (GANs) or diffusion models to create new images, modify existing ones, or generate synthetic data based on various inputs or conditions.
Techniques for combining data from disparate sensor types (e.g., cameras, radar, mobile device signals) to achieve a more robust and comprehensive understanding of an environment or subject, often leveraging machine learning for interpretation and correlation.
Patents
Showing 1-10 of 608