Company patents
Beijing Baidu Netcom Science Technology Co., Ltd.
BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD. shows a surprising shift in its patent strategy, with significant year-over-year fluctuations across its core computing categories. While Machine Learning & AI (22.4% of portfolio) and Computer Vision (22.4% of portfolio) remain dominant, the dramatic 187.1% YoY growth in Natural Language Processing in 2025 suggests an emerging focus, despite a subsequent decline so far in 2026, indicating a dynamic and potentially reactive approach to innovation.
Patent Trend by Technology Area
Yearly patent publications since 2023
Product themes
Product-level themes inferred from filings since 2023, with category chips showing where each theme appears. Select a theme to filter the patents below.
1,243 US filings (since 2023) · 12 categories · 49 themes
Techniques for generating human-like text or other content using large pre-trained models, often involving prompt engineering, speculative decoding, or multi-modal inputs for content creation.
Systems and methods utilizing artificial intelligence, particularly large language models and neural networks, to extract, summarize, generate, or categorize information from unstructured or semi-structured data sources.
Methods and apparatus for improving the visual fidelity, resolution, or compression efficiency of video signals, often through advanced processing, up-scaling, or neural network-based filters.
Methods and systems for improving the quality of video streams, generating intermediate frames, or continuously locating and following objects within a sequence of images, even under occlusion.
Utilizing machine learning, particularly deep learning, to analyze medical data such as images, sensor readings, or physiological signals for disease prediction, diagnosis, or treatment assessment.
Techniques for generating, updating, and utilizing highly detailed digital maps that include lane-specific information, and for precisely determining a vehicle's position within these lanes, often using sensor data.
Methods and apparatus for detecting objects and determining their three-dimensional position and orientation (pose) using imagery or point cloud data, often for navigation, surveying, or environmental understanding.
Technologies for generating artificial speech that is personalized, context-aware, or adaptable to specific virtual agents or messaging campaigns, often utilizing text-to-speech (TTS) and audio caching for efficient delivery.
Techniques utilizing deep learning models like Generative Adversarial Networks (GANs) or diffusion models to create new images, modify existing ones, or generate synthetic data based on various inputs or conditions.
Methods and systems for identifying, extracting, and structuring specific entities, relationships, or insights from text-based documents, often involving techniques like named entity recognition, relation extraction, or summarization.
Methods and systems for identifying synthetic or manipulated speech (deepfake audio) using forensic analysis of audio features, such as breath patterns, vocoder signatures, or machine learning models to determine authenticity.
Algorithms and hardware optimizations for rapidly identifying and characterizing relevant visual features (e.g., objects, motion, gradients) from images or video streams, often integrating machine learning for feature representation and recognition, with a focus on real-time performance and reduced computational cost.
Specialized hardware, architectural designs, and computational methods to improve the speed, efficiency, and security of artificial intelligence and machine learning model execution, particularly for inference and data processing.
Systems and methods for enhancing the safety of vulnerable road users (pedestrians, cyclists) by improving their detection, prediction, and precise localization relative to the vehicle, often leveraging communication technologies and specialized markers.
Techniques for combining and analyzing information from multiple distinct data modalities (e.g., text, image, video, audio, sensor data) to derive richer insights or improve system performance and decision-making.
AI systems designed to engage in natural language dialogue, maintain conversation state, understand user intent, and generate relevant responses, often across multiple communication channels or modalities.
Methods for training machine learning models across multiple decentralized devices or servers while keeping data localized, often involving aggregation of model parameters and secure communication.
Techniques for combining data from disparate sensor types (e.g., cameras, radar, mobile device signals) to achieve a more robust and comprehensive understanding of an environment or subject, often leveraging machine learning for interpretation and correlation.
Techniques for enhancing, encoding, decoding, or separating speech and audio signals, often involving multi-microphone arrays, acoustic echo cancellation, beamforming, or advanced audio compression for improved clarity and quality.
Developing and applying machine learning algorithms that leverage quantum computing principles, such as quantum circuits or autoencoders, for tasks like simulation or data processing.
Algorithms and systems for generating, optimizing, and executing trajectories for autonomous vehicles or robots to move through an environment, often involving obstacle avoidance, route validation, and goal reaching.
Methods and systems for efficiently allocating computing resources, balancing workloads, and managing power states to improve performance, reduce energy consumption, or enhance reliability in computing platforms.
Systems that employ imaging and image processing to automatically detect defects, verify states, or ensure quality control in manufactured goods, printed materials, or industrial processes.
Technologies that create dynamic and interactive visual content for displays, including virtual/wearable systems, by generating overlays, replacing input streams, or merging real-time user actions with digital environments.
Algorithms and systems for planning and executing complex vehicle maneuvers, often involving cooperation with other vehicles or infrastructure, to optimize traffic flow, avoid collisions, or navigate challenging scenarios. This includes lane changes, cut-ins, and traffic congestion.
Applications of speech processing and artificial intelligence for medical diagnosis, therapeutic interventions, or accessibility solutions, particularly for conditions affecting speech production or hearing.
Techniques and hardware for autonomous systems to gather and interpret data about their surroundings, including obstacle detection, object recognition, and depth estimation, to inform control decisions.
Integrated systems for managing parking facilities, guiding vehicles to available spots, and providing notifications, often leveraging sensors, communication, and remote control.
Technologies enabling the creation and management of virtual computing environments, including virtual machines and virtual desktops, with an emphasis on secure and efficient remote access, updates, and performance.
Systems leveraging artificial intelligence and machine learning to dynamically adjust educational content, learning paths, goals, or feedback based on individual user performance, progress, or physiological data. This includes generating personalized exercises, recommendations, and adaptive sequencing of knowledge points.
Systems and methods that leverage location data, IoT sensors, and predictive analytics to optimize urban services such as traffic flow, emergency response, parking, and waste collection.
Methods and systems that identify unusual or suspicious patterns in data streams, often leveraging machine learning models trained on normal behavior, to detect threats, faults, or significant events as they occur.
Systems that process data to provide personalized recommendations, predict events, or automate decision-making processes based on learned patterns, user behavior, or environmental factors.
Techniques for monitoring system components and behaviors to anticipate failures, performance degradation, or anomalies, often leveraging machine learning for pattern recognition and forecasting.
Systems and methods for automating the lifecycle of machine learning models, including pipeline deployment, model management, versioning, and configuring for different inference environments.
Processes for creating or manipulating three-dimensional digital representations of objects or environments, including mesh generation, surface fitting, and depth estimation from multiple views.
Technologies for generating and utilizing detailed spatial maps within confined or structured environments like care facilities, construction sites, or industrial warehouses, often for robot navigation or asset tracking.
Integration and processing of data from diverse sensors (e.g., magnetometers, odometers, IMUs, vision sensors) to achieve robust and accurate positioning, especially in environments where GPS is unreliable or unavailable.
Techniques to improve the accuracy and robustness of Automatic Speech Recognition (ASR) systems by incorporating contextual information, dynamic hint words, or customized machine learning models for specific domains or users.
Application of machine learning models to process complex data and generate actionable insights, predictions, or classifications that inform or automate decision-making processes in various domains like healthcare, business, or industrial control.
Methods and systems for integrating, transforming, and managing complex or domain-specific data from disparate sources into a unified structure, often for specific applications like social networks, genomics, or business forms.
Development and optimization of novel neural network layers or architectures specifically designed to improve performance or efficiency for computer vision tasks.
Automated systems using image processing and artificial intelligence to identify, classify, and assess the extent of damage to structures or objects, supporting maintenance or insurance claims.
Systems that monitor a vehicle operator's physiological state, attentiveness, or behavior using in-cabin sensors and machine learning to enhance safety or personalize vehicle functions.
Technologies for deploying, managing, and governing applications and services in cloud environments, particularly focusing on containerization, microservice architectures, API gateways, and distributed data management.
Technologies that process, analyze, and leverage geographic information system (GIS) data, location data, and spatial analytics for applications such as monitoring, navigation, and environmental assessment.
Techniques and hardware architectures designed to efficiently generate and display complex 3D graphics, particularly for interactive applications like virtual reality, focusing on speed and visual quality.
Involves systems designed to automatically detect errors or failures and initiate predefined or intelligent corrective actions, recovery procedures, or notifications to minimize downtime and manual intervention.
Systems that combine data from multiple camera sensors or capture multiple images from different perspectives or qualities, often involving image processing techniques like synthesis to create enhanced or comprehensive views.
Patents
Showing 1-10 of 125
Vision-Based Object & Pose Estimation