Databricks, Inc.

États‑Unis d’Amérique

1-100 de 188 pour Databricks, Inc.

Trier par

Recheche Texte


Affiner par
Type PI
Brevet	164
Marque	24

Juridiction
États-Unis	170
International	11
Canada	4
Europe	3

Date
Nouveautés (dernières 4 semaines)	9
2026 avril (MACJ)	2
2026 mars	5
2026 février	5
2026 janvier	5
2025 décembre	2
2026 (AACJ)	20
2025	62
2024	35
2023	16
2022	13
2021	10
Avant 2021	32
Voir plus Voir moins
Classe IPC
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage	42
G06F 16/2453 - Optimisation des requêtes	27
G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet	26
G06F 16/2455 - Exécution des requêtes	25
G06F 16/23 - Mise à jour	20
G06F 21/62 - Protection de l’accès à des données via une plate-forme, p. ex. par clés ou règles de contrôle de l’accès	19
G06F 11/34 - Enregistrement ou évaluation statistique de l'activité du calculateur, p. ex. des interruptions ou des opérations d'entrée–sortie	18
G06F 16/25 - Systèmes d’intégration ou d’interfaçage impliquant les systèmes de gestion de bases de données	15
G06F 16/00 - Recherche d’informationsStructures de bases de données à cet effetStructures de systèmes de fichiers à cet effet	14
G06F 16/14 - Détails de la recherche de fichiers basée sur les métadonnées des fichiers	10
G06F 16/21 - Conception, administration ou maintenance des bases de données	10
G06F 9/50 - Allocation de ressources, p. ex. de l'unité centrale de traitement [UCT]	10
G06N 20/00 - Apprentissage automatique	10
G06F 11/07 - Réaction à l'apparition d'un défaut, p. ex. tolérance de certains défauts	9
G06F 9/48 - Lancement de programmes Commutation de programmes, p. ex. par interruption	9
G06F 16/16 - Opérations sur les fichiers ou les dossiers, p. ex. détails des interfaces utilisateur spécialement adaptées aux systèmes de fichiers	7
G06F 17/30 - Recherche documentaire; Structures de bases de données à cet effet	7
G06F 9/445 - Chargement ou démarrage de programme	7
G06F 9/54 - Communication interprogramme	7
G06F 11/30 - Surveillance du fonctionnement	6
G06F 16/24 - Requêtes	5
G06F 16/245 - Traitement des requêtes	5
G06F 16/2458 - Types spéciaux de requêtes, p. ex. requêtes statistiques, requêtes floues ou requêtes distribuées	5
G06F 16/174 - Élimination de redondances par le système de fichiers	4
G06F 16/215 - Amélioration de la qualité des donnéesNettoyage des données, p. ex. déduplication, suppression des entrées non valides ou correction des erreurs typographiques	4
G06F 16/242 - Formulation des requêtes	4
G06F 9/455 - ÉmulationInterprétationSimulation de logiciel, p. ex. virtualisation ou émulation des moteurs d’exécution d’applications ou de systèmes d’exploitation	4
G06N 5/022 - Ingénierie de la connaissanceAcquisition de la connaissance	4
G06F 16/172 - Mise en cache, pré-extraction ou accumulation de fichiers	3
G06F 16/248 - Présentation des résultats de requêtes	3
Voir plus Voir moins
Classe NICE
42 - Services scientifiques, technologiques et industriels, recherche et conception	23
09 - Appareils et instruments scientifiques et électriques	20
35 - Publicité; Affaires commerciales	10
41 - Éducation, divertissements, activités sportives et culturelles	6

Statut
En Instance	53
Enregistré / En vigueur	135

1 2 Prochaine page

1. FASTER AND ACCURATE FORWARD GEOHASHING FOR GEOSPATIAL DATA

Numéro d'application	18925393
Statut	En instance
Date de dépôt	2024-10-24
Date de la première publication	2026-04-02
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Karavelas, Menelaos Boric, Nemanja

Abrégé

A data processing service executes a new forward and reverse geohashing process that is correct up to a threshold geohash precision. The forward and reverse geohashing processes described are correct for precisions up to 18, where a precision corresponds to 5 geohash bits. The forward geohashing process gives correct results for precisions up to 19, and the reverse geohashing process gives correct results for precisions up to 20. The geohashing methods described herein is configured to perform a relatively small number of floating point and integer operations to avoid the iterative nature of existing geohashing processes. Moreover, the transformations are completed in an accurate way while saving more computational power compared to existing geohash algorithms.

Classes IPC ?

G06F 5/01 - Procédés ou dispositions pour la conversion de données, sans modification de l'ordre ou du contenu des données maniées pour le décalage, p. ex. la justification, le changement d'échelle, la normalisation
G06F 7/487 - MultiplicationDivision

2. FASTER AND ACCURATE REVERSE GEOHASHING FOR GEOSPATIAL DATA

Numéro d'application	18925413
Statut	En instance
Date de dépôt	2024-10-24
Date de la première publication	2026-04-02
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Karavelas, Menelaos Boric, Nemanja

Abrégé

Classes IPC ?

G06F 7/487 - MultiplicationDivision
G06F 5/01 - Procédés ou dispositions pour la conversion de données, sans modification de l'ordre ou du contenu des données maniées pour le décalage, p. ex. la justification, le changement d'échelle, la normalisation

3. DATABRICKS LAKEBASE

Numéro de série	99736988
Statut	En instance
Date de dépôt	2026-03-31
Propriétaire	Databricks, Inc. (USA)
Classes de Nice ?	09 - Appareils et instruments scientifiques et électriques 35 - Publicité; Affaires commerciales 42 - Services scientifiques, technologiques et industriels, recherche et conception

Produits et services

Downloadable software for use in data management, data integration, data warehousing, data mining, data sharing, data collection, data interpretation, data storage, data processing, data recovery, data queries and data analytics; downloadable software for use in integrating databases with data storage repositories; downloadable software for use in integrating relational database management systems (RDBMS) with data storage repositories; downloadable software development tools; downloadable software development tools for building designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software featuring artificial intelligence and machine learning technology Database management; data management and processing, namely, data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science; business data analysis Providing online non-downloadable software for use in data management, data integration, data warehousing, data mining, data sharing, data collection, data interpretation, data storage, data processing, data recovery, data queries and data analytics; providing online non-downloadable software for use in integrating databases with data storage repositories; providing online non-downloadable software for integrating relational database management systems (RDBMS) with data storage repositories; providing online non-downloadable software development tools; providing online non-downloadable software development tools for building designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software featuring artificial intelligence and machine learning technology; software as a service (SAAS) services featuring software for use in data management, data integration, data warehousing, data mining, data sharing, data collection, data interpretation, data storage, data processing, data recovery, data queries and data analytics; software as a service (SAAS) services featuring software for use in integrating databases with data storage repositories; software as a service (SAAS) services featuring software for integrating relational database management systems (RDBMS) with data storage repositories; software as a service (SAAS) services featuring software development tools; software as a service (SAAS) services featuring software development tools for building designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software featuring artificial intelligence and machine learning technology

4. LAKEBASE

Numéro de série	99736987
Statut	En instance
Date de dépôt	2026-03-31
Propriétaire	Databricks, Inc. (USA)
Classes de Nice ?	09 - Appareils et instruments scientifiques et électriques 35 - Publicité; Affaires commerciales 42 - Services scientifiques, technologiques et industriels, recherche et conception

Produits et services

5. Data lineage tracking

Numéro d'application	17862158
Numéro de brevet	12591556
Statut	Délivré - en vigueur
Date de dépôt	2022-07-11
Date de la première publication	2026-03-31
Date d'octroi	2026-03-31
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Feng, Tao Sun, Menglei Wang, Zhuoying

Abrégé

The present application discloses a method, system, and computer system for managing lineage data for data entities. The method includes generating lineage data, wherein generating the lineage data, and storing and indexing, in a data structure, the lineage data in association with the selected data entity. The generating the lineage data includes selecting a selected data entity, obtaining a query tree that was used to generate the selected data entity, and determining lineage data for the selected data entity based at least in part on the query tree.

Classes IPC ?

G06F 16/215 - Amélioration de la qualité des donnéesNettoyage des données, p. ex. déduplication, suppression des entrées non valides ou correction des erreurs typographiques
G06F 11/07 - Réaction à l'apparition d'un défaut, p. ex. tolérance de certains défauts
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage
G06F 16/23 - Mise à jour

6. Data Accessing with a Virtual Sandbox Database

Numéro d'application	19404167
Statut	En instance
Date de dépôt	2025-12-01
Date de la première publication	2026-03-26
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Khurana, Amandeep Li, Nong

Abrégé

Various embodiments of the present technology generally relate to management of big data storage and data access control systems. In some embodiments, a data access system for use in multiple application service and multiple storage service environments comprises a sandbox database for users, wherein the sandbox database is a virtual database environment via which a user may access datasets according to one or more access policies. In some embodiments, the data access system receives a user request to access a dataset stored in a database into the sandbox environment, wherein the database is associated with the data access system. In response to the request, the data access system may retrieve the corresponding data from the database, determine any associated sandbox access policies, and generate an anonymized data table in the sandbox environment.

Classes IPC ?

G06F 21/53 - Contrôle des utilisateurs, des programmes ou des dispositifs de préservation de l’intégrité des plates-formes, p. ex. des processeurs, des micrologiciels ou des systèmes d’exploitation au stade de l’exécution du programme, p. ex. intégrité de la pile, débordement de tampon ou prévention d'effacement involontaire de données par exécution dans un environnement restreint, p. ex. "boîte à sable" ou machine virtuelle sécurisée
G06F 16/248 - Présentation des résultats de requêtes
G06F 21/62 - Protection de l’accès à des données via une plate-forme, p. ex. par clés ou règles de contrôle de l’accès

7. RUNTIME ERROR ATTRIBUTION FOR DATABASE QUERIES SPECIFIED USING A DECLARATIVE DATABASE QUERY LANGUAGE

Numéro d'application	19344282
Statut	En instance
Date de dépôt	2025-09-29
Date de la première publication	2026-03-26
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Wang, Gengliang Fan, Wenchen Rielau, Serge Shen, Entong

Abrégé

A device may receive, from a user interface of a client device, a database query specified using a declarative database query language. A device may generate code based on the database query, wherein the generated code represents sets of instructions for executing the database query. A device may generate an error attribution mapping that maps sets of instructions to origins of the sets of instructions, wherein the error attribution mapping is filtered by eliminating one or more origins from the error attribution mapping. A device may determine a runtime error caused by executing the generated code. A device may identify one or more origins of the runtime error in the database query based on the error attribution mapping. A device may display information describing the one or more origins of the runtime error on the user interface of the client device.

Classes IPC ?

G06F 11/3604 - Analyse de logiciel pour vérifier les propriétés des programmes
G06F 16/25 - Systèmes d’intégration ou d’interfaçage impliquant les systèmes de gestion de bases de données
G06F 16/901 - IndexationStructures de données à cet effetStructures de stockage

8. SYNCHRONIZING LIBRARY VIRTUAL ENVIRONMENTS IN A SERVERLESS SETTING

Numéro d'application	US2025042679
Numéro de publication	2026/059701
Statut	Délivré - en vigueur
Date de dépôt	2025-08-20
Date de publication	2026-03-19
Propriétaire	DATABRICKS, INC. (USA)
Inventeur(s)	Falaki, Mohammad, Hossein Singh, Jaipreet Yuan, Jove, Omega Akbar, Deka, Auliya Wan, Ran

Abrégé

A data processing service receives a request from a user to execute code in a notebook. The code includes pre-defined functions. The service initializes a VM for executing the code in the notebook which is configured with a virtual environment. The service accesses a computing cluster for execution of the pre-defined functions. The computing cluster includes a driver node and one or more worker nodes. The service stores the virtual environment in a data store and provides metadata to the driver node. The metadata specifies a storage location of the virtual environment in the data store. The driver node downloads the stored virtual environment using the received metadata and initializes an environment at the one or more worker nodes using the downloaded virtual environment. The worker nodes execute the pre-defined functions in the initialized environment.

Classes IPC ?

G06F 16/25 - Systèmes d’intégration ou d’interfaçage impliquant les systèmes de gestion de bases de données
G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet

9. VIRTUAL ENVIRONMENT CACHING FOR SERVERLESS WORKLOADS

Numéro d'application	18885197
Statut	En instance
Date de dépôt	2024-09-13
Date de la première publication	2026-03-19
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Akbar, Deka Auliya Falaki, Mohammad Hossein Singh, Jaipreet Wan, Ran Yuan, Jove Omega

Abrégé

A data processing service receives a request from a user to execute code in a notebook. The service initiates a first VM for execution of the code in the notebook based on the request. The first VM may be set up with a virtual environment with configurations for executing the code. The service automatically caches the virtual environment with the configurations in a data store and automatically caches metadata associated with the virtual environment in a metadata store. The metadata may include a location identifier for identifying a caching location of the virtual environment in the data store. The metadata may include an expiration condition. When the virtual environment meets the expiration condition, the virtual environment will be invalidated in the data store. The service executes the code in the notebook in the virtual environment.

Classes IPC ?

G06F 9/455 - ÉmulationInterprétationSimulation de logiciel, p. ex. virtualisation ou émulation des moteurs d’exécution d’applications ou de systèmes d’exploitation
G06F 9/445 - Chargement ou démarrage de programme

10. Scaling delta table optimize command

Numéro d'application	18787819
Numéro de brevet	12566731
Statut	Délivré - en vigueur
Date de dépôt	2024-07-29
Date de la première publication	2026-03-03
Date d'octroi	2026-03-03
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Das, Tathagata Mahadev, Rahul Shivu Yavuz, Burak

Abrégé

The interface is to receive an indication to execute an optimize command. The processor is to receive a file name; determine whether adding a file of the file name to a current bin causes the current bin to exceed a threshold; associate the file with the current bin in response to determining that adding the file does not cause the current bin to exceed the bin threshold; in response to determining that adding the file to the current bin causes the current bin to exceed the bin threshold: associate the file with a next bin, indicate that the current bin is closed, and add the current bin to a batch of bins; determine whether a measure of the batch of bins exceeds a batch threshold; and in response to determining that the measure exceeds the batch threshold, provide the batch of bins for processing.

Classes IPC ?

G06F 16/172 - Mise en cache, pré-extraction ou accumulation de fichiers
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage

11. LAKEBASE

Numéro d'application	245797000
Statut	En instance
Date de dépôt	2026-02-24
Propriétaire	Databricks, Inc. (USA)
Classes de Nice ?	09 - Appareils et instruments scientifiques et électriques 35 - Publicité; Affaires commerciales 42 - Services scientifiques, technologiques et industriels, recherche et conception

Produits et services

(1) Software; downloadable software; downloadable software for big data processing and analytics; downloadable software for accessing, managing and connecting to data warehouses, data assets, data files, data sources; downloadable software for application database integration; downloadable software for use in ETL (extract, transform, load) data processing; downloadable software for compiling, organizing, visualizing, sharing and analyzing business intelligence; desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; downloadable software featuring libraries for data science training; downloadable software featuring libraries for data analytics; downloadable software featuring libraries for machine learning training; downloadable software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software featuring artificial intelligence and machine learning technology; downloadable software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; downloadable software for modifying and enabling transmission of images, audio, audio visual and video content and data; downloadable software for processing images, graphics, audio, video, and text; downloadable software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; downloadable software for use in data centers and data storage; downloadable software for use as an application programming interface (API); downloadable software development tools; downloadable software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; downloadable software development tools for building designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software development kits (SDKs); downloadable software for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; downloadable software for use in data governance, namely, software featuring artificial intelligence that allows users to define and manage policies for gathering, querying, storing, processing, sharing, accessing and disposing of data, create data clean rooms, and track data lineage; downloadable software for retrieval augmented generation (RAG); downloadable software for retrieval augmented generation (RAG) for use in generative AI applications; downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; downloadable podcasts in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence (1) Database management; database management consultancy; business consulting in the field of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; business data analysis; business data consultancy; providing an online marketplace featuring downloadable software applications, data sets, data notebooks, artificial intelligence models, machine learning models; data management and processing, namely, data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, and data science (2) Providing online non-downloadable software; providing online non-downloadable software for big data processing and analytics; providing online non-downloadable software for accessing, managing and connecting to data warehouses, data assets, data files, data sources; providing online non-downloadable software for application database integration; providing online non-downloadable software for use in ETL (extract, transform, load) data processing; providing online non-downloadable software for compiling, organizing, visualizing, sharing and analyzing business intelligence; providing online non-downloadable desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; providing online non-downloadable software featuring libraries for data science training; providing online non-downloadable software featuring libraries for data analytics; providing online non-downloadable software featuring libraries for machine learning training; providing online non-downloadable software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software featuring artificial intelligence and machine learning technology; providing online non-downloadable software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; providing online non-downloadable software for modifying and enabling transmission of images, audio, audio visual and video content and data; providing online non-downloadable software for processing images, graphics, audio, video, and text; providing online non-downloadable software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; providing online non-downloadable software for use in data centers and data storage; providing online non-downloadable software for use as an application programming interface (API); providing online non-downloadable software development tools; providing online non-downloadable software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; providing online non-downloadable software development tools for building designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software development kits (SDKs); software as a service (SAAS) services; software as a service (SAAS) services featuring software for big data processing and analytics; software as a service (SAAS) services, namely, featuring software for accessing, managing and connecting to data warehouses, data assets, data files, data sources; software as a service (SAAS) services featuring software for application database integration; software as a service (SAAS) services featuring software for use in ETL (extract, transform, load) data processing; software as a service (SAAS) services featuring software for compiling, organizing, visualizing, sharing and analyzing business intelligence; software as a service (SAAS) services featuring desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; software as a service (SAAS) services featuring software featuring libraries for data science training; software as a service (SAAS) services featuring software featuring libraries for data analytics; software as a service (SAAS) services featuring software featuring libraries for machine learning training; software as a service (SAAS) services featuring software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software featuring artificial intelligence and machine learning technology; software as a service (SAAS) services featuring software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; software as a service (SAAS) services featuring software for modifying and enabling transmission of images, audio, audio visual and video content and data; software as a service (SAAS) services featuring software for processing images, graphics, audio, video, and text; software as a service (SAAS) services featuring software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; software as a service (SAAS) services featuring software for use in data centers and data storage; software as a service (SAAS) services featuring software for use as an application programming interface (API); software as a service (SAAS) services featuring software development tools; software as a service (SAAS) services featuring software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; software as a service (SAAS) services featuring software development tools for building designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software development kits (SDKs); platform as a services (PAAS) services; platform as a services (PAAS) services featuring software for big data processing and analytics; platform as a services (PAAS) services, namely, featuring software for accessing, managing and connecting to data warehouses, data assets, data files, data sources; platform as a services (PAAS) services featuring software for use in ETL (extract, transform, load) data processing; platform as a services (PAAS) services featuring software for compiling, organizing, visualizing, sharing and analyzing business intelligence; platform as a services (PAAS) services featuring desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; platform as a services (PAAS) services featuring software featuring libraries for data science training; platform as a services (PAAS) services featuring software featuring libraries for data analytics; platform as a services (PAAS) services featuring software featuring libraries for machine learning training; platform as a services (PAAS) services featuring software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software featuring artificial intelligence and machine learning technology; platform as a services (PAAS) services featuring software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; platform as a services (PAAS) services featuring software for modifying and enabling transmission of images, audio, audio visual and video content and data; platform as a services (PAAS) services featuring software for processing images, graphics, audio, video, and text; platform as a services (PAAS) services featuring software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; platform as a services (PAAS) services featuring software for use in data centers and data storage; platform as a services (PAAS) services featuring software for use as an application programming interface (API); platform as a services (PAAS) services featuring software development tools; platform as a services (PAAS) services featuring software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; platform as a services (PAAS) services featuring software development tools for building designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software development kits (SDKs); data mining; computer services, namely, hosting of search platforms on the Internet to allow users to index, integrate, warehouse, mine, process, share, collect, interpret, research, query, visualize, and analyze data; custom design and development of computer software; software engineering services for data processing; development and creation of computer programs for data processing and analysis; providing a website featuring information in the fields of technology, computers, computer software, computer networks, web services, mobile computing and artificial intelligence; technology consulting in the field of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing of online non-downloadable software for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; providing online non-downloadable software featuring artificial intelligence for use in data governance, namely, software that allows users to define and manage policies for gathering, querying, storing, processing, sharing, accessing and disposing of data, create data clean rooms, and track data lineage; providing online non-downloadable software for retrieval augmented generation (RAG); providing online non-downloadable software for retrieval augmented generation (RAG) for use in generative AI applications; providing online non-downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; software as a service (SAAS) services, namely, featuring software use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance; platform as a services (PAAS) services, namely, featuring software for data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance; application service provider services, namely, hosting, managing, developing and maintaining applications, and software of others in the fields of big data processing and analytics, data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science; computer services, namely, hosting of artificial intelligence models, machine learning models, and large language models (LLMs); computer services, namely, hosting of artificial intelligence models, machine learning models, and large language models (LLMs) to allow users to perform search queries and develop inference; technology consulting in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, and data science

12. LAKEBASE

Numéro d'application	019321243
Statut	En instance
Date de dépôt	2026-02-24
Propriétaire	Databricks, Inc. (USA)
Classes de Nice ?	09 - Appareils et instruments scientifiques et électriques 35 - Publicité; Affaires commerciales 42 - Services scientifiques, technologiques et industriels, recherche et conception

Produits et services

Software; downloadable software; downloadable software for big data processing and analytics; downloadable software for accessing, managing and connecting to data warehouses, data assets, data files, data sources; downloadable software for application database integration; downloadable software for use in ETL (extract, transform, load) data processing; downloadable software for compiling, organizing, visualizing, sharing and analyzing business intelligence; desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; downloadable software featuring libraries for data science training; downloadable software featuring libraries for data analytics; downloadable software featuring libraries for machine learning training; downloadable software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software featuring artificial intelligence and machine learning technology; downloadable software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; downloadable software for modifying and enabling transmission of images, audio, audio visual and video content and data; downloadable software for processing images, graphics, audio, video, and text; downloadable software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; downloadable software for use in data centers and data storage; downloadable software for use as an application programming interface (API); downloadable software development tools; downloadable software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; downloadable software development tools for building designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software development kits (SDKs); downloadable software for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; downloadable software for use in data governance, namely, software featuring artificial intelligence that allows users to define and manage policies for gathering, querying, storing, processing, sharing, accessing and disposing of data, create data clean rooms, and track data lineage; downloadable software for retrieval augmented generation (RAG); downloadable software for retrieval augmented generation (RAG) for use in generative AI applications; downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; downloadable podcasts in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence. Database management; database management consultancy; business consulting in the field of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; business data analysis; business data consultancy; providing an online marketplace featuring downloadable software applications, data sets, data notebooks, artificial intelligence models, machine learning models; data management and processing, namely, data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, and data science. Providing online non-downloadable software; providing online non-downloadable software for big data processing and analytics; providing online non-downloadable software for accessing, managing and connecting to data warehouses, data assets, data files, data sources; providing online non-downloadable software for application database integration; providing online non-downloadable software for use in ETL (extract, transform, load) data processing; providing online non-downloadable software for compiling, organizing, visualizing, sharing and analyzing business intelligence; providing online non-downloadable desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; providing online non-downloadable software featuring libraries for data science training; providing online non-downloadable software featuring libraries for data analytics; providing online non-downloadable software featuring libraries for machine learning training; providing online non-downloadable software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, , predictive analytics and business intelligence; providing online non-downloadable software featuring artificial intelligence and machine learning technology; providing online non-downloadable software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; providing online non-downloadable software for modifying and enabling transmission of images, audio, audio visual and video content and data; providing online non-downloadable software for processing images, graphics, audio, video, and text; providing online non-downloadable software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; providing online non-downloadable software for use in data centers and data storage; providing online non-downloadable software for use as an application programming interface (API); providing online non-downloadable software development tools; providing online non-downloadable software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; providing online non-downloadable software development tools for building designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software development kits (SDKs); software as a service (SAAS) services; software as a service (SAAS) services featuring software for big data processing and analytics; software as a service (SAAS) services, namely, featuring software for accessing, managing and connecting to data warehouses, data assets, data files, data sources; software as a service (SAAS) services featuring software for application database integration; software as a service (SAAS) services featuring software for use in ETL (extract, transform, load) data processing; software as a service (SAAS) services featuring software for compiling, organizing, visualizing, sharing and analyzing business intelligence; software as a service (SAAS) services featuring desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; software as a service (SAAS) services featuring software featuring libraries for data science training; software as a service (SAAS) services featuring software featuring libraries for data analytics; software as a service (SAAS) services featuring software featuring libraries for machine learning training; software as a service (SAAS) services featuring software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software featuring artificial intelligence and machine learning technology; software as a service (SAAS) services featuring software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; software as a service (SAAS) services featuring software for modifying and enabling transmission of images, audio, audio visual and video content and data; software as a service (SAAS) services featuring software for processing images, graphics, audio, video, and text; software as a service (SAAS) services featuring software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; software as a service (SAAS) services featuring software for use in data centers and data storage; software as a service (SAAS) services featuring software for use as an application programming interface (API); software as a service (SAAS) services featuring software development tools; software as a service (SAAS) services featuring software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; software as a service (SAAS) services featuring software development tools for building designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software development kits (SDKs); platform as a services (PAAS) services; platform as a services (PAAS) services featuring software for big data processing and analytics; platform as a services (PAAS) services, namely, featuring software for accessing, managing and connecting to data warehouses, data assets, data files, data sources; platform as a services (PAAS) services featuring software for use in ETL (extract, transform, load) data processing; platform as a services (PAAS) services featuring software for compiling, organizing, visualizing, sharing and analyzing business intelligence; platform as a services (PAAS) services featuring desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; platform as a services (PAAS) services featuring software featuring libraries for data science training; platform as a services (PAAS) services featuring software featuring libraries for data analytics; platform as a services (PAAS) services featuring software featuring libraries for machine learning training; platform as a services (PAAS) services featuring software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software featuring artificial intelligence and machine learning technology; platform as a services (PAAS) services featuring software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; platform as a services (PAAS) services featuring software for modifying and enabling transmission of images, audio, audio visual and video content and data; platform as a services (PAAS) services featuring software for processing images, graphics, audio, video, and text; platform as a services (PAAS) services featuring software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; platform as a services (PAAS) services featuring software for use in data centers and data storage; platform as a services (PAAS) services featuring software for use as an application programming interface (API); platform as a services (PAAS) services featuring software development tools; platform as a services (PAAS) services featuring software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; platform as a services (PAAS) services featuring software development tools for building designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software development kits (SDKs); data mining; computer services, namely, hosting of search platforms on the Internet to allow users to index, integrate, warehouse, mine, process, share, collect, interpret, research, query, visualize, and analyze data; custom design and development of computer software; software engineering services for data processing; development and creation of computer programs for data processing and analysis; providing a website featuring information in the fields of technology, computers, computer software, computer networks, web services, mobile computing and artificial intelligence; technology consulting in the field of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing of online non-downloadable software for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; providing online non-downloadable software featuring artificial intelligence for use in data governance, namely, software that allows users to define and manage policies for gathering, querying, storing, processing, sharing, accessing and disposing of data, create data clean rooms, and track data lineage; providing online non-downloadable software for retrieval augmented generation (RAG); providing online non-downloadable software for retrieval augmented generation (RAG) for use in generative AI applications; providing online non-downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; software as a service (SAAS) services, namely, featuring software use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance; pr platform as a services (PAAS) services, namely, featuring software for data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance; application service provider services, namely, hosting, managing, developing and maintaining applications, and software of others in the fields of big data processing and analytics, data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science; computer services, namely, hosting of artificial intelligence models, machine learning models, and large language models (LLMs); computer services, namely, hosting of artificial intelligence models, machine learning models, and large language models (LLMs) to allow users to perform search queries and develop inference; technology consulting in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, and data science.

13. Background Dataset Maintenance

Numéro d'application	19340571
Statut	En instance
Date de dépôt	2025-09-25
Date de la première publication	2026-02-19
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Khurana, Amandeep Li, Nong

Abrégé

Various embodiments of the present technology generally relate to management of big data storage and the physical removal of data via data access systems for large data processing environments having multiple application services and multiple storage services. In some embodiments, a method of physically removing data from a storage system provides for identifying one or more files needing data removal treatment, determining that a file needing data removal treatment should be queued, and populating a queue with the file. Determining that a file should be queued is based, at least in part, on a staleness tolerance. The method further provides for treating the file and replacing a previous version of the file in storage with the updated file. In some implementations, treating the file includes removing data from the file to create an updated file and may further include additional changes to the file.

Classes IPC ?

G06F 16/16 - Opérations sur les fichiers ou les dossiers, p. ex. détails des interfaces utilisateur spécialement adaptées aux systèmes de fichiers
G06F 9/54 - Communication interprogramme
G06F 16/17 - Détails d’autres fonctions de systèmes de fichiers
G06F 16/23 - Mise à jour
G06F 21/62 - Protection de l’accès à des données via une plate-forme, p. ex. par clés ou règles de contrôle de l’accès

14. Efficiently Vectorized Implementation of Operations in a Global Grid Indexing Library

Numéro d'application	19343642
Statut	En instance
Date de dépôt	2025-09-29
Date de la première publication	2026-02-05
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Cheong Zhi Xi, Desmond Karavelas, Menelaos

Abrégé

A data processing service generates for iteratively applying a geospatial function to geospatial data. The generated code includes at least a first iterative loop and a second iterative loop. The data processing service compiles the generated code to generate compiled code that vectorized at least the second iterative loop. The data processing service receives a request from a client device to perform one or more data processing operations including applying the geospatial function to a data table of geospatial cell indices. The data processing service compiles the request into one or more tasks including at least a vectorized operation based on the compiled code and executes the one or more tasks by at least invoking the vectorized operation on the set of worker nodes.

Classes IPC ?

G06F 8/41 - Compilation

15. Automatic vector index generation with machine-learned large language model

Numéro d'application	18755593
Numéro de brevet	12541493
Statut	Délivré - en vigueur
Date de dépôt	2024-06-26
Date de la première publication	2026-02-03
Date d'octroi	2026-02-03
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Gupta, Akhil Peter, Eric Christopher Qu, Zhidong Raji Cherian, Kevin Tsarev, Sergei Sergeevich Vij, Ankit

Abrégé

A data processing system accesses a dataset from a data source and generates a set of embedding vectors representing the dataset in a latent space. The system splits the dataset into a set of data chunks and generates the embedding vectors. Each embedding vector represents a data chunk. The system may store the generated set of embedding vectors in a vector database that includes a plurality of embedding vectors. The system updates the embedding vectors by detecting a change to a first dataset that is represented by a first set of embedding vectors in the vector database, determining that the change to the first dataset is related to a first data chunk of the first set of data chunks included in the first dataset, updating a first embedding vector representing the first data chunk with the detected change; and storing the updated first embedding vector in the vector database.

Classes IPC ?

G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage

16. SYNCHRONIZATION OF ACCESS CONTROL POLICIES WITH EXTERNAL DATA PLATFORMS

Numéro d'application	19343892
Statut	En instance
Date de dépôt	2025-09-29
Date de la première publication	2026-01-29
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Li, Nong Neeman, Itay Alfred

Abrégé

A system manages access control policies for accessing data stored in a plurality of data platforms. The system receives access control policy specification describing an access control policy that controls access to a set of datasets by a set of users. The set of datasets is defined based on a condition based on the data tags representing attributes of the datasets. The system compiles the access control policy specification to generate a platform independent access control representation of the access control policy. The platform independent access control representation comprises a set of tuples. Each tuple identifies a particular set of users, a particular set of datasets, and a particular action. The system further generates data platform specific instructions for each data platform of the plurality of data platforms.

Classes IPC ?

G06F 21/62 - Protection de l’accès à des données via une plate-forme, p. ex. par clés ou règles de contrôle de l’accès

17. Deterministic cross-platform circuit design emulation

Numéro d'application	17953192
Numéro de brevet	12536354
Statut	Délivré - en vigueur
Date de dépôt	2022-09-26
Date de la première publication	2026-01-27
Date d'octroi	2026-01-27
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Gommershtadt, Boris Zucker, Yan Szwarcfiter, Yuval

Abrégé

A configuration/method/system for cross-platform capture/replay of an emulation of a circuit design. The system receives, during an emulation of a circuit design performed on a first platform, time-annotated ready bits messages (TARBMs), and generates a first set of TARBM files, each corresponding to a particular FPGA in the first set of FPGAs. The system also generates a first set of bitmap file for the first platform, and a second set of bitmap file for a second platform. Each bitmap file records a bit index of each ready bit of an FPGA in a corresponding platform. The system then generates a second set of TARBM files based in part on the first set of TARBM files and the two sets of bitmap files. The emulation of the circuit design can then be replayed on the second platform based in part on the second set of TARBM files.

Classes IPC ?

G06F 30/327 - Synthèse logiqueSynthèse de comportement, p. ex. logique de correspondance, langage de description de matériel [HDL] à liste d’interconnections [Netlist], langage de haut niveau à langage de transfert entre registres [RTL] ou liste d’interconnections [Netlist]
G06F 30/3312 - Analyse temporelle

18. Notebook snapshot restore in a serverless setting

Numéro d'application	18999384
Numéro de brevet	12530267
Statut	Délivré - en vigueur
Date de dépôt	2024-12-23
Date de la première publication	2026-01-20
Date d'octroi	2026-01-20
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Singh, Jaipreet Rokkam, Rohith

Abrégé

A data processing service monitors an activity status of a first set of computing devices that executes code in a notebook. The service may determine that the activity status of the first set of computing devices meets a termination condition. The service may generate, prior to termination of the first set of computing devices, a snapshot recording a current code execution progress for the notebook. To generate the snapshot, the service may determine serialized variables included in the current code execution progress and store the generated snapshot in a data store. The generated snapshot may include the determined serialized variables. The service may terminate the first set of computer devices from executing the code in the notebook.

Classes IPC ?

G06F 11/14 - Détection ou correction d'erreur dans les données par redondance dans les opérations, p. ex. en utilisant différentes séquences d'opérations aboutissant au même résultat

19. Multiple pass sort

Numéro d'application	18816131
Numéro de brevet	12530334
Statut	Délivré - en vigueur
Date de dépôt	2024-08-27
Date de la première publication	2026-01-20
Date d'octroi	2026-01-20
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Armstrong, Timothy Guliyev, Khayyam Krishnan, Arvind Sai

Abrégé

A system for multipass sort includes a communication interface and a processor. The communication interface is configured to receive from a client device a request to sort a dataset that includes a plurality of rows. The processor is configured to perform a first sort pass on the dataset in part by: extracting prefixes associated with a first schema element associated with the dataset for the plurality of rows; and sorting the extracted prefixes utilizing an integer sort algorithm based on a sort order included in the request to sort the dataset, where sorting the extracted prefixes includes utilizing NULL values to resolve a tied range that includes at least two rows of the plurality of rows having a same extracted prefix.

Classes IPC ?

G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage
G06F 16/2455 - Exécution des requêtes

20. PIPELINED EXECUTION OF DATABASE QUERIES PROCESSING STREAMING DATA

Numéro d'application	19321518
Statut	En instance
Date de dépôt	2025-09-08
Date de la première publication	2026-01-01
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Armbrust, Michael Paul Balikov, Alexander Peng, Boyang

Abrégé

A database system performs pipelined execution of queries that process batches of streaming data. The database system compiles a database query to generate an execution plan and determines a set of stages based on the execution plan. The database query processes streaming data comprising batches. A scheduler schedules pipelined execution stages of the database query. Accordingly, the database system performs execution of a particular stage processing a batch of the streaming data in parallel with subsequent stages of the database query processing previous batches of the streaming data. The system further maintains watermarks for different stages of the database query.

Classes IPC ?

G06F 16/2455 - Exécution des requêtes
G06F 9/48 - Lancement de programmes Commutation de programmes, p. ex. par interruption
G06F 16/2453 - Optimisation des requêtes

21. LAKEFLOW

Numéro de série	99563163
Statut	En instance
Date de dépôt	2025-12-23
Propriétaire	Databricks, Inc. (USA)
Classes de Nice ?	09 - Appareils et instruments scientifiques et électriques 35 - Publicité; Affaires commerciales 41 - Éducation, divertissements, activités sportives et culturelles 42 - Services scientifiques, technologiques et industriels, recherche et conception

Produits et services

Downloadable software for big data processing and analytics; downloadable software for accessing, managing and connecting to data warehouses, data assets, data files, data sources; downloadable software for application database integration; downloadable software for use in ETL (extract, transform, load) data processing; downloadable software for compiling, organizing, visualizing, sharing and analyzing business intelligence data; desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; downloadable software featuring libraries for data science training; downloadable software featuring libraries for data analytics; downloadable software featuring libraries for machine learning training; downloadable software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software featuring artificial intelligence and machine learning technology for deep learning, high performance computing, distributed computing, virtualization, natural language generation, statistical learning, supervised learning, un-supervised learning, data mining, predictive analytics and business intelligence; downloadable software for development and implementation of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; downloadable software for modifying and enabling transmission of images, audio, audio visual and video content and data; downloadable software for processing images, graphics, audio, video, and text; downloadable software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; downloadable software for use in operating data centers and data storage; downloadable software for use as an application programming interface (API); downloadable software development tools; downloadable software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; downloadable software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software development kits (SDKs); downloadable software for use in performing data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; downloadable software for use in data governance, namely, software featuring artificial intelligence that allows users to define and manage policies for gathering, querying, storing, processing, sharing, accessing and disposing of data, create data clean rooms, and track data lineage; downloadable software for retrieval augmented generation (RAG); downloadable software for retrieval augmented generation (RAG) for use in generative AI applications; downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; downloadable podcasts in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence Database management; database management consultancy for business purposes; business consulting in the field of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; business data analysis; business data consultancy; providing an online marketplace featuring downloadable software applications, data sets, data notebooks, artificial intelligence models, machine learning models; business data management and processing, namely, data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, and data science Conducting conferences, seminars, classes workshops, courses, and webinars in field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; training services in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing training for certification in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; non-downloadable publications and blogs in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; non-downloadable podcasts in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence Providing online non-downloadable software; providing online non-downloadable software for big data processing and analytics; providing online non-downloadable software for accessing, managing and connecting to data warehouses, data assets, data files, data sources; providing online non-downloadable software for application database integration; providing online non-downloadable software for use in ETL (extract, transform, load) data processing; providing online non-downloadable software for compiling, organizing, visualizing, sharing and analyzing business intelligence data; providing online non-downloadable desktop and mobile computing and operating platforms featuring software for collection, analysis, sharing, interpretation and management of data; providing online non-downloadable software featuring libraries for data science training; providing online non-downloadable software featuring libraries for data analytics; providing online non-downloadable software featuring libraries for machine learning training; providing online non-downloadable software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software featuring artificial intelligence and machine learning technology for deep learning, high performance computing, distributed computing, virtualization, natural language generation, statistical learning, supervised learning, un-supervised learning, data mining, predictive analytics and business intelligence; providing online non-downloadable software for development and implementation of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; providing online non-downloadable software for modifying and enabling transmission of images, audio, audio visual and video content and data; providing online non-downloadable software for processing images, graphics, audio, video, and text; providing online non-downloadable software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; providing online non-downloadable software for use in operating data centers and data storage; providing online non-downloadable software for use as an application programming interface (API); providing online non-downloadable software development tools; providing online non-downloadable software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; providing online non-downloadable software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software development kits (SDKs); software as a service (SAAS) services featuring software for big data processing and analytics; software as a service (SAAS) services, namely, featuring software for accessing, managing and connecting to data warehouses, data assets, data files, data sources; software as a service (SAAS) services featuring software for application database integration; software as a service (SAAS) services featuring software for use in ETL (extract, transform, load) data processing; software as a service (SAAS) services featuring software for compiling, organizing, visualizing, sharing and analyzing business intelligence data; software as a service (SAAS) services featuring desktop and mobile computing and operating platforms featuring software for collection, analysis, sharing, interpretation and management of data; software as a service (SAAS) services featuring software featuring libraries for data science training; software as a service (SAAS) services featuring software featuring libraries for data analytics; software as a service (SAAS) services featuring software featuring libraries for machine learning training; software as a service (SAAS) services featuring software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software featuring artificial intelligence and machine learning technology for deep learning, high performance computing, distributed computing, virtualization, natural language generation, statistical learning, supervised learning, un-supervised learning, data mining, predictive analytics and business intelligence; software as a service (SAAS) services featuring software for development and implementation of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; software as a service (SAAS) services featuring software for modifying and enabling transmission of images, audio, audio visual and video content and data; software as a service (SAAS) services featuring software for processing images, graphics, audio, video, and text; software as a service (SAAS) services featuring software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; software as a service (SAAS) services featuring software for use in operating data centers and data storage; software as a service (SAAS) services featuring software for use as an application programming interface (API); software as a service (SAAS) services featuring software development tools; software as a service (SAAS) services featuring software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; software as a service (SAAS) services featuring software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software development kits (SDKs); platform as a services (PAAS) services featuring software for big data processing and analytics; platform as a services (PAAS) services, namely, featuring software for accessing, managing and connecting to data warehouses, data assets, data files, data sources; platform as a services (PAAS) services featuring software for use in ETL (extract, transform, load) data processing; platform as a services (PAAS) services featuring software for compiling, organizing, visualizing, sharing and analyzing business intelligence data; platform as a services (PAAS) services featuring desktop and mobile computing and operating platforms featuring software for collection, analysis, sharing, interpretation and management of data; platform as a services (PAAS) services featuring software featuring libraries for data science training; platform as a services (PAAS) services featuring software featuring libraries for data analytics; platform as a services (PAAS) services featuring software featuring libraries for machine learning training; platform as a services (PAAS) services featuring software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software featuring artificial intelligence and machine learning technology for deep learning, high performance computing, distributed computing, virtualization, natural language generation, statistical learning, supervised learning, un-supervised learning, data mining, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software for development and implementation of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; platform as a services (PAAS) services featuring software for modifying and enabling transmission of images, audio, audio visual and video content and data; platform as a services (PAAS) services featuring software for processing images, graphics, audio, video, and text; platform as a services (PAAS) services featuring software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; platform as a services (PAAS) services featuring software for use in operating data centers and data storage; platform as a services (PAAS) services featuring software for use as an application programming interface (API); platform as a services (PAAS) services featuring software development tools; platform as a services (PAAS) services featuring software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; platform as a services (PAAS) services featuring software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software development kits (SDKs); data mining; computer services, namely, hosting of search platforms on the Internet to allow users to index, integrate, warehouse, mine, process, share, collect, interpret, research, query, visualize, and analyze data; custom design and development of computer software; software engineering services for data processing; development and creation of computer programs for data processing and analysis; providing a website featuring information in the fields of technology, computers, computer software, computer networks, web services, mobile computing and artificial intelligence; technology consulting in the field of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing of online non-downloadable software for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; providing online non-downloadable software featuring artificial intelligence for use in data governance, namely, software that allows users to define and manage policies for gathering, querying, storing, processing, sharing, accessing and disposing of data, create data clean rooms, and track data lineage; providing online non-downloadable software for retrieval augmented generation (RAG); providing online non-downloadable software for retrieval augmented generation (RAG) for use in generative AI applications; providing online non-downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; software as a service (SAAS) services featuring software use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; platform as a services (PAAS) services featuring software for data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; application service provider services, namely, hosting, managing, developing and maintaining applications, and software of others in the fields of big data processing and analytics, data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science; computer services, namely, hosting of artificial intelligence models, machine learning models, and large language models (LLMs); computer services, namely, hosting of artificial intelligence models, machine learning models, and large language models (LLMs) to allow users to perform search queries and develop inference; technology consulting in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, and data science

22. DATABRICKS

Numéro d'application	1892527
Statut	Enregistrée
Date de dépôt	2025-05-20
Date d'enregistrement	2025-05-20
Propriétaire	Databricks, Inc. (USA)
Classes de Nice ?	09 - Appareils et instruments scientifiques et électriques 35 - Publicité; Affaires commerciales 41 - Éducation, divertissements, activités sportives et culturelles 42 - Services scientifiques, technologiques et industriels, recherche et conception

Produits et services

Software; downloadable software; downloadable software for big data processing and analytics; downloadable software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; downloadable software for application database integration; downloadable software for use in ETL (extract, transform, load) data processing; downloadable software for compiling, organizing, visualizing, sharing and analyzing business intelligence; desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; downloadable software featuring libraries for data science training; downloadable software featuring libraries for data analytics; downloadable software featuring libraries for machine learning training; downloadable software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software featuring artificial intelligence and machine learning technology; downloadable software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; downloadable software for modifying and enabling transmission of images, audio, audio visual and video content and data; downloadable software for processing images, graphics, audio, video, and text; downloadable software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; downloadable software for use in data centers and data storage; downloadable software for use as an application programming interface (API); downloadable software development tools; downloadable software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; downloadable software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software development kits (SDKs); downloadable software for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; downloadable software for use in data governance, namely, software featuring artificial intelligence that allows users to define and manage policies for gathering, querying, storing, processing, sharing, accessing and disposing of data, create data clean rooms, and track data lineage; downloadable software for retrieval augmented generation (RAG); downloadable software for retrieval augmented generation (RAG) for use in generative AI applications; downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; downloadable podcasts in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence. Database management; database management consultancy; business consulting in the field of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; business data analysis; business data management consultancy; providing an online marketplace featuring downloadable software applications, data sets, data notebooks, artificial intelligence models, machine learning models; data management and processing, namely, business data analytics and interpretation, compiling and systemization of data into computer databases, data collection, management of data lakes, and administrative support services in the fields of data wrangling, data visualization, data governance, data science. Educational services; conducting conferences, seminars, classes, workshops, courses, and webinars; training services; providing training for certification; educational services featuring podcasts; conducting conferences, seminars, classes, workshops, courses, and webinars in field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; training services in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing training for certification in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing non-downloadable electronic publications and on-line publication of journals or diaries [blog services] in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; educational services featuring podcasts in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing non-downloadable electronic publications and on-line publication of journals or diaries [blog services]. Providing online non-downloadable software; providing online non-downloadable software for big data processing and analytics; providing online non-downloadable software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; providing online non-downloadable software for application database integration; providing online non-downloadable software for use in ETL (extract, transform, load) data processing; providing online non-downloadable software for compiling, organizing, visualizing, sharing and analyzing business intelligence; providing online non-downloadable desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; providing online non-downloadable software featuring libraries for data science training; providing online non-downloadable software featuring libraries for data analytics; providing online non-downloadable software featuring libraries for machine learning training; providing online non-downloadable software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software featuring artificial intelligence and machine learning technology; providing online non-downloadable software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; providing online non-downloadable software for modifying and enabling transmission of images, audio, audio visual and video content and data; providing online non-downloadable software for processing images, graphics, audio, video, and text; providing online non-downloadable software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; providing online non-downloadable software for use in data centers and data storage; providing online non-downloadable software for use as an application programming interface (API); providing online non-downloadable software development tools; providing online non-downloadable software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; providing online non-downloadable software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software development kits (SDKs); software as a service (SAAS) services; software as a service (SAAS) services featuring software for big data processing and analytics; software as a service (SAAS) services, namely, featuring software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; software as a service (SAAS) services featuring software for application database integration; software as a service (SAAS) services featuring software for use in ETL (extract, transform, load) data processing; software as a service (SAAS) services featuring software for compiling, organizing, visualizing, sharing and analyzing business intelligence; software as a service (SAAS) services featuring desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; software as a service (SAAS) services featuring software featuring libraries for data science training; software as a service (SAAS) services featuring software featuring libraries for data analytics; software as a service (SAAS) services featuring software featuring libraries for machine learning training; software as a service (SAAS) services featuring software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software featuring artificial intelligence and machine learning technology; software as a service (SAAS) services featuring software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; software as a service (SAAS) services featuring software for modifying and enabling transmission of images, audio, audio visual and video content and data; software as a service (SAAS) services featuring software for processing images, graphics, audio, video, and text; software as a service (SAAS) services featuring software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; software as a service (SAAS) services featuring software for use in data centers and data storage; software as a service (SAAS) services featuring software for use as an application programming interface (API); software as a service (SAAS) services featuring software development tools; software as a service (SAAS) services featuring software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; software as a service (SAAS) services featuring software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software development kits (SDKs); platform as a services (PAAS) services; platform as a services (PAAS) services featuring software for big data processing and analytics; platform as a services (PAAS) services, namely, featuring software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; platform as a services (PAAS) services featuring software for use in ETL (extract, transform, load) data processing; platform as a services (PAAS) services featuring software for compiling, organizing, visualizing, sharing and analyzing business intelligence; platform as a services (PAAS) services featuring desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; platform as a services (PAAS) services featuring software featuring libraries for data science training; platform as a services (PAAS) services featuring software featuring libraries for data analytics; platform as a services (PAAS) services featuring software featuring libraries for machine learning training; platform as a services (PAAS) services featuring software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software featuring artificial intelligence and machine learning technology; platform as a services (PAAS) services featuring software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; platform as a services (PAAS) services featuring software for modifying and enabling transmission of images, audio, audio visual and video content and data; platform as a services (PAAS) services featuring software for processing images, graphics, audio, video, and text; platform as a services (PAAS) services featuring software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; platform as a services (PAAS) services featuring software for use in data centers and data storage; platform as a services (PAAS) services featuring software for use as an application programming interface (API); platform as a services (PAAS) services featuring software development tools; platform as a services (PAAS) services featuring software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; platform as a services (PAAS) services featuring software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software development kits (SDKs); data mining; computer services, namely, hosting of search platforms on the Internet to allow users to index, integrate, warehouse, mine, process, share, collect, interpret, research, query, visualize, and analyze data; custom design and development of computer software; software engineering services for data processing; development and creation of computer programs for data processing and analysis; providing technological information in the fields of technology, computers, computer software, computer networks, web services, mobile computing and artificial intelligence via a website; technology consulting in the field of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing of online non-downloadable software for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; providing online non-downloadable software featuring artificial intelligence for use in data governance, namely, software that allows users to define and manage policies for gathering, querying, storing, processing, sharing, accessing and disposing of data, create data clean rooms, and track data lineage; providing online non-downloadable software for retrieval augmented generation (RAG); providing online non-downloadable software for retrieval augmented generation (RAG) for use in generative AI applications; providing online non-downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; software as a service (SAAS) services, namely, featuring software for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance; providing online non-downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; platform as a services (PAAS) services, namely, featuring software for data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance; providing online non-downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; application service provider services, namely, hosting, managing, developing and maintaining applications, and software of others in the fields of big data processing and analytics, data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science; computer services, namely, hosting of artificial intelligence models, machine learning models, and large language models (LLMs); computer services, namely, hosting of artificial intelligence models, machine learning models, and large language models (LLMs) to allow users to perform search queries and develop inference; technology consulting in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, and building, design and management of data lakes; data migration; data mining; data warehousing; development and design of data lakes.

23. Generating Minor Compactions to Capture Aggregated Actions for Commit Ranges to Data Files

Numéro d'application	19290300
Statut	En instance
Date de dépôt	2025-08-04
Date de la première publication	2025-11-27
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Johnson, Frederick Ryan Jain, Prakhar

Abrégé

A data processing service uses minor compactions for committing transactions to a data table. The service may receive requests to commit transactions to a data table and write metadata for the transactions to log files, and generate a checkpoint file aggregating the transactions described in the log files to compute a data table state at a first time. The service may receive requests to commit a set of transactions and write metadata for the set of transactions to a set of log files. The service may determine that a number of log files in the set of log files reaches a threshold commit number, generate a minor compaction file aggregating the set of transactions, and generate a second checkpoint file aggregating the data table state at the first time with information from the minor compaction file to compute the data table state at a second time.

Classes IPC ?

G06F 16/23 - Mise à jour
G06F 16/174 - Élimination de redondances par le système de fichiers

24. FETCHING QUERY RESULTS THROUGH CLOUD OBJECT STORES

Numéro d'application	19275107
Statut	En instance
Date de dépôt	2025-07-21
Date de la première publication	2025-11-13
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Ghit, Bogdan Ionut Sompolski, Juliusz Xin, Shi Samwel, Bart

Abrégé

A cloud computation system configured to 1) receive a first request to read a first set of query results stored in a cloud based data storage; 2) transmit a first subset of the first set of query results in response to the first request; 3) transmit a second subset of the first set of query results in response to the first request; 4) receive a second request to read a second set of query results stored in the cloud based data storage; 5) transmit a first subset of the second set of query results in response to the second request. 6) transmit a second subset of the second set of query results in response to the second request.

Classes IPC ?

G06F 16/2458 - Types spéciaux de requêtes, p. ex. requêtes statistiques, requêtes floues ou requêtes distribuées
G06F 11/34 - Enregistrement ou évaluation statistique de l'activité du calculateur, p. ex. des interruptions ou des opérations d'entrée–sortie
G06F 16/242 - Formulation des requêtes
G06F 16/25 - Systèmes d’intégration ou d’interfaçage impliquant les systèmes de gestion de bases de données

25. GENERATING SYNTHETIC CAPTIONS FOR TRAINING TEXT-TO-IMAGE GENERATIVE MODELS

Numéro d'application	18653469
Statut	En instance
Date de dépôt	2024-05-02
Date de la première publication	2025-11-06
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Gokaslan, Aaron Kerem Stephenson, Cory Ryan

Abrégé

A data processing service generates synthetic captions for uncaptioned images of a set of training data. The data processing service applies a pre-trained I2T model to the uncaptioned images, generating synthetic captions as output. The data processing service uses the training data to train a T2I model to produce images from text.

Classes IPC ?

G06V 20/70 - Étiquetage du contenu de scène, p. ex. en tirant des représentations syntaxiques ou sémantiques
G06T 3/40 - Changement d'échelle d’images complètes ou de parties d’image, p. ex. agrandissement ou rétrécissement
G06V 10/774 - Génération d'ensembles de motifs de formationTraitement des caractéristiques d’images ou de vidéos dans les espaces de caractéristiquesDispositions pour la reconnaissance ou la compréhension d’images ou de vidéos utilisant la reconnaissance de formes ou l’apprentissage automatique utilisant l’intégration et la réduction de données, p. ex. analyse en composantes principales [PCA] ou analyse en composantes indépendantes [ ICA] ou cartes auto-organisatrices [SOM]Séparation aveugle de source méthodes de Bootstrap, p. ex. "bagging” ou “boosting”

26. CUSTOMIZED CODE CONFIGURATIONS FOR A MULTIPLE APPLICATION SERVICE ENVIRONMENT

Numéro d'application	19236670
Statut	En instance
Date de dépôt	2025-06-12
Date de la première publication	2025-10-30
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Khurana, Amandeep Li, Nong

Abrégé

Disclosed herein provides enhancements for operating a data access system for large data processing environments. In one implementation, a method provides for maintaining a data structure comprising a plurality of customized code configurations each associated with a data request rule for each of the multiple application services. A code configuration query from a user is then received indicating a data request rule. The code configuration query requests code configurations for data retrieval from at least one of the multiple storage services over the data access system. The data structure is queried for one or more customized code configurations for each of the multiple application services associated with the indicated data request rule. The user is then provided with the one or more customized code configurations for each of the multiple application services associated with the indicated data request rule.

Classes IPC ?

G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage
G06F 8/70 - Maintenance ou gestion de logiciel
G06F 9/54 - Communication interprogramme
G06F 16/2455 - Exécution des requêtes
G06F 16/2457 - Traitement des requêtes avec adaptation aux besoins de l’utilisateur
G06F 16/248 - Présentation des résultats de requêtes
G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet

27. MULTI-CLUSTER QUERY RESULT CACHING

Numéro d'application	19249957
Statut	En instance
Date de dépôt	2025-06-25
Date de la première publication	2025-10-23
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Ghit, Bogdan Ionut Garg, Saksham Stuart, Christian Stevens, Christopher

Abrégé

A multi-cluster computing system which includes a query result caching system is presented. The multi-cluster computing system may include a data processing service and client devices communicatively coupled over a network. The data processing service may include a control layer and a data layer. The data layer may include a data storage system, which further includes a remote query result cache Store. The query result cache store may include a cloud storage query result cache which stores data associated with results of previously executed requests. As such, when a cluster encounters a previously executed request, the cluster may efficiently retrieve the cached result of the request from the in-memory query result cache or the cloud storage query result cache.

Classes IPC ?

G06F 16/2453 - Optimisation des requêtes
G06F 16/25 - Systèmes d’intégration ou d’interfaçage impliquant les systèmes de gestion de bases de données
G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet

28. AGENTBRICKS

Numéro de série	99459958
Statut	En instance
Date de dépôt	2025-10-23
Propriétaire	Databricks, Inc. (USA)
Classes de Nice ?	09 - Appareils et instruments scientifiques et électriques 35 - Publicité; Affaires commerciales 41 - Éducation, divertissements, activités sportives et culturelles 42 - Services scientifiques, technologiques et industriels, recherche et conception

Produits et services

Downloadable software for big data processing and analytics; downloadable software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; downloadable software for application database integration; downloadable software for use in ETL (extract, transform, load) data processing; downloadable software for compiling, organizing, visualizing, sharing and analyzing business intelligence data; desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; downloadable software featuring libraries for data science training; downloadable software featuring libraries for data analytics; downloadable software featuring libraries for machine learning training; downloadable software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software featuring artificial intelligence and machine learning technology for deep learning, high performance computing, distributed computing, virtualization, natural language generation, statistical learning, supervised learning, un-supervised learning, data mining, predictive analytics and business intelligence; downloadable software for development and implementation of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; downloadable software for modifying and enabling transmission of images, audio, audio visual and video content and data; downloadable software for processing images, graphics, audio, video, and text; downloadable software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; downloadable software for use in operating data centers and data storage; downloadable software for use as an application programming interface (API); downloadable software development tools; downloadable software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; downloadable software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software development kits (SDKs); downloadable software for use in performing data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; downloadable software for use in data governance, namely, software featuring artificial intelligence that allows users to define and manage policies for gathering, querying, storing, processing, sharing, accessing and disposing of data, create data clean rooms, and track data lineage; downloadable software for retrieval augmented generation (RAG); downloadable software for retrieval augmented generation (RAG) for use in generative AI applications; downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; downloadable podcasts in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence Database management; database management consultancy for business purposes; business consulting in the field of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; business data analysis; business data consultancy; providing an online marketplace featuring downloadable software applications, data sets, data notebooks, artificial intelligence models, machine learning models; business data management and processing, namely, data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, and building, design and management of data lakes Conducting conferences, seminars, classes workshops, courses, and webinars in field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; training services in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing training for certification in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; non-downloadable publications and blogs in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; non-downloadable podcasts in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence Providing online non-downloadable software; providing online non-downloadable software for big data processing and analytics; providing online non-downloadable software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; providing online non-downloadable software for application database integration; providing online non-downloadable software for use in ETL (extract, transform, load) data processing; providing online non-downloadable software for compiling, organizing, visualizing, sharing and analyzing business intelligence data; providing online non-downloadable desktop and mobile computing and operating platforms featuring software for collection, analysis, sharing, interpretation and management of data; providing online non-downloadable software featuring libraries for data science training; providing online non-downloadable software featuring libraries for data analytics; providing online non-downloadable software featuring libraries for machine learning training; providing online non-downloadable software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software featuring artificial intelligence and machine learning technology for deep learning, high performance computing, distributed computing, virtualization, natural language generation, statistical learning, supervised learning, un-supervised learning, data mining, predictive analytics and business intelligence; providing online non-downloadable software for development and implementation of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; providing online non-downloadable software for modifying and enabling transmission of images, audio, audio visual and video content and data; providing online non-downloadable software for processing images, graphics, audio, video, and text; providing online non-downloadable software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; providing online non-downloadable software for use in operating data centers and data storage; providing online non-downloadable software for use as an application programming interface (API); providing online non-downloadable software development tools; providing online non-downloadable software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; providing online non-downloadable software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software development kits (SDKs); software as a service (SAAS) services featuring software for big data processing and analytics; software as a service (SAAS) services, namely, featuring software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; software as a service (SAAS) services featuring software for application database integration; software as a service (SAAS) services featuring software for use in ETL (extract, transform, load) data processing; software as a service (SAAS) services featuring software for compiling, organizing, visualizing, sharing and analyzing business intelligence data; software as a service (SAAS) services featuring desktop and mobile computing and operating platforms featuring software for collection, analysis, sharing, interpretation and management of data; software as a service (SAAS) services featuring software featuring libraries for data science training; software as a service (SAAS) services featuring software featuring libraries for data analytics; software as a service (SAAS) services featuring software featuring libraries for machine learning training; software as a service (SAAS) services featuring software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software featuring artificial intelligence and machine learning technology for deep learning, high performance computing, distributed computing, virtualization, natural language generation, statistical learning, supervised learning, un-supervised learning, data mining, predictive analytics and business intelligence; software as a service (SAAS) services featuring software for development and implementation of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; software as a service (SAAS) services featuring software for modifying and enabling transmission of images, audio, audio visual and video content and data; software as a service (SAAS) services featuring software for processing images, graphics, audio, video, and text; software as a service (SAAS) services featuring software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; software as a service (SAAS) services featuring software for use in operating data centers and data storage; software as a service (SAAS) services featuring software for use as an application programming interface (API); software as a service (SAAS) services featuring software development tools; software as a service (SAAS) services featuring software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; software as a service (SAAS) services featuring software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software development kits (SDKs); platform as a services (PAAS) services featuring software for big data processing and analytics; platform as a services (PAAS) services, namely, featuring software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; platform as a services (PAAS) services featuring software for use in ETL (extract, transform, load) data processing; platform as a services (PAAS) services featuring software for compiling, organizing, visualizing, sharing and analyzing business intelligence data; platform as a services (PAAS) services featuring desktop and mobile computing and operating platforms featuring software for collection, analysis, sharing, interpretation and management of data; platform as a services (PAAS) services featuring software featuring libraries for data science training; platform as a services (PAAS) services featuring software featuring libraries for data analytics; platform as a services (PAAS) services featuring software featuring libraries for machine learning training; platform as a services (PAAS) services featuring software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software featuring artificial intelligence and machine learning technology for deep learning, high performance computing, distributed computing, virtualization, natural language generation, statistical learning, supervised learning, un-supervised learning, data mining, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software for development and implementation of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; platform as a services (PAAS) services featuring software for modifying and enabling transmission of images, audio, audio visual and video content and data; platform as a services (PAAS) services featuring software for processing images, graphics, audio, video, and text; platform as a services (PAAS) services featuring software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; platform as a services (PAAS) services featuring software for use in operating data centers and data storage; platform as a services (PAAS) services featuring software for use as an application programming interface (API); platform as a services (PAAS) services featuring software development tools; platform as a services (PAAS) services featuring software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; platform as a services (PAAS) services featuring software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software development kits (SDKs); data mining; computer services, namely, hosting of search platforms on the Internet to allow users to index, integrate, warehouse, mine, process, share, collect, interpret, research, query, visualize, and analyze data; custom design and development of computer software; software engineering services for data processing; development and creation of computer programs for data processing and analysis; providing a website featuring information in the fields of technology, computers, computer software, computer networks, web services, mobile computing and artificial intelligence; technology consulting in the field of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing of online non-downloadable software for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; providing online non-downloadable software featuring artificial intelligence for use in data governance, namely, software that allows users to define and manage policies for gathering, querying, storing, processing, sharing, accessing and disposing of data, create data clean rooms, and track data lineage; providing online non-downloadable software for retrieval augmented generation (RAG); providing online non-downloadable software for retrieval augmented generation (RAG) for use in generative AI applications; providing online non-downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; software as a service (SAAS) services featuring software use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; platform as a services (PAAS) services featuring software for data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; application service provider services, namely, hosting, managing, developing and maintaining applications, and software of others in the fields of big data processing and analytics, data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science; computer services, namely, hosting of artificial intelligence models, machine learning models, and large language models (LLMs); computer services, namely, hosting of artificial intelligence models, machine learning models, and large language models (LLMs) to allow users to perform search queries and develop inference; technology consulting in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, and building, design and management of data lakes

29. Data maintenance transaction rollbacks

Numéro d'application	18806557
Numéro de brevet	12430294
Statut	Délivré - en vigueur
Date de dépôt	2024-08-15
Date de la première publication	2025-09-30
Date d'octroi	2025-09-30
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Jain, Prakhar Samwel, Bart Yavuz, Burak

Abrégé

The present application discloses a method, system, and computer system for managing a data in a storage system. The method includes receiving a first transaction that modifies or deletes first data stored in a storage system, determining that the first data is subject to an intervening re-arrangement transaction, and in response to determining that the first data is subject to the intervening re-arrangement transaction, rolling back the re-arrangement transaction at least with respect to the first data and committing the first transaction.

Classes IPC ?

G06F 16/174 - Élimination de redondances par le système de fichiers

30. TRAINING MACHINE LEARNING TRANSFORMER ARCHITECTURES WITH HYBRID MOVING AVERAGE

Numéro d'application	18610072
Statut	En instance
Date de dépôt	2024-03-19
Date de la première publication	2025-09-25
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Jacobson, Austin Joshua Patel, Mihir Vipul Seguin, Landan Joseph Stephenson, Cory Ryan

Abrégé

A data processing service trains a transformer model in two stages. In a first stage, for a first number of iterations, the data processing service trains the model without computing moving average parameters. In a second stage, for a second number of iterations, the data processing service trains the model using parameters that follow a moving average of the training parameters. In the second stage, the data processing service obtains moving average parameters for a current iteration and generates training parameters for the current iteration. The data processing service computes moving average parameters for a next iteration by combining the training parameters for the current iteration and the moving average parameters for the current iteration. The data processing service updates the moving average parameters for the next iteration as the moving average parameters for the current iteration.

Classes IPC ?

G06N 20/00 - Apprentissage automatique

31. MECHANISM FOR AUTOMATED DETERMINATION AND EXCHANGE OF TRUST CREDENTIALS FOR COMPUTATIONAL DECISION SYSTEMS

Numéro d'application	18612603
Statut	En instance
Date de dépôt	2024-03-21
Date de la première publication	2025-09-25
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Arikapudi, Abhiram Khawaja, Omar Farooq Pamulapati, Arun Veuve, David Lebrocq

Abrégé

A method for generating a trust credential for an AI-driven application is presented. The method includes identifying one or more risk factors, receiving a request to generate a trust credential for an AI-driven application, and receiving the AI-driven application and associated data, wherein the AI-driven application has one or more subcomponents. The method includes applying a risk determination function to each of the one or more subcomponents of the AI-driven application and the associated data to generate a risk score for each of the one or more subcomponents. The method further includes applying a weighting function to the risk score of each subcomponent to generate a trust score for each of the one or more subcomponents, and generating the trust credential for the AI-driven application based on the trust scores of each of the one or more subcomponents.

Classes IPC ?

G06Q 10/0635 - Analyse des risques liés aux activités d’entreprises ou d’organisations

32. Feature Store with Integrated Tracking

Numéro d'application	19228554
Statut	En instance
Date de dépôt	2025-06-04
Date de la première publication	2025-09-18
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Parkhe, Mani Mewald, Clemens Zaharia, Matei Singh, Avesh

Abrégé

The present application discloses a method, system, and computer system for managing a plurality of features and storing lineage information pertaining to the features. The method includes obtaining one or more datasets, determining a first feature, wherein the first feature is determined based at least in part on the one or more datasets, and storing the first feature in a feature store. The first feature is stored in association with a dataset indication of the one or more datasets from which the first feature is determined. The feature store comprises a plurality of features.

Classes IPC ?

G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet
G06F 30/27 - Optimisation, vérification ou simulation de l’objet conçu utilisant l’apprentissage automatique, p. ex. l’intelligence artificielle, les réseaux neuronaux, les machines à support de vecteur [MSV] ou l’apprentissage d’un modèle

33. ACCESS CONTROL OF MANAGED CLUSTERS PERFORMING DATA PROCESSING ON CLOUD PLATFORMS

Numéro d'application	18593794
Statut	En instance
Date de dépôt	2024-03-01
Date de la première publication	2025-09-04
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Li, Nong

Abrégé

A system manages access control for database queries in accordance with access control policies. The techniques may be used for access control of managed clusters in cloud platforms used for data processing, for example, for MapReduce operations. According to an embodiment, the access control policies are fine grained access control policies that allow a user to access a subset of datasets including data of a dataset. The system receives and compiles a database query to generate a query plan for processing the database query. The query plan includes: one or more data access operators, one or more data processing operators, and one or more access control filters. The system executes the query plan using an executor process and a helper process. The helper process executes a data access operator and a corresponding data access filter and the executor process executes one or more data processing operators.

Classes IPC ?

G06F 21/62 - Protection de l’accès à des données via une plate-forme, p. ex. par clés ou règles de contrôle de l’accès
G06F 16/2453 - Optimisation des requêtes

34. Constructing batches with dataloader workers with disjoint shard downloads

Numéro d'application	18753859
Numéro de brevet	12399865
Statut	Délivré - en vigueur
Date de dépôt	2024-06-25
Date de la première publication	2025-08-26
Date d'octroi	2025-08-26
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Knighton, James Douglas Jariwala, Karan Kishorkumar Narayan, Saaketh Ram-Rachakonda Shah, Bandish Bimal Venigalla, Abhinav Sai

Abrégé

A data processing service accesses data files from data streams, each data file including samples to be processed for training a machine-learning model. The service converts the data files to discrete shard files, each shard file comprising a subset of samples. Each sample ID is mapped to a shard index of a respective shard file that includes the sample. The service may generate partition tensors that partition sample index spaces into a number of physical nodes, devices, workers, and batches. The service may shuffle the shard files and divide the sample IDs into a number of logical nodes and shuffle the sample IDs. The service generates shuffled sample ID arrays that map the sample indices to the sample IDs. During training, workers download disjoint shard files and map the assigned sample indices to corresponding batches of sample IDs based on the shuffled sample ID arrays.

Classes IPC ?

G06F 16/16 - Opérations sur les fichiers ou les dossiers, p. ex. détails des interfaces utilisateur spécialement adaptées aux systèmes de fichiers
G06F 16/11 - Administration des systèmes de fichiers, p. ex. détails de l’archivage ou d’instantanés
G06F 16/176 - Support d’accès partagé aux fichiersSupport de partage de fichiers

35. Integrated workspace file system

Numéro d'application	17515013
Numéro de brevet	12400011
Statut	Délivré - en vigueur
Date de dépôt	2021-10-29
Date de la première publication	2025-08-26
Date d'octroi	2025-08-26
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Davidson, Aaron Daniel Rex, Anders Leif Christian Kim, Jason Yongjoon Cheung, Ka-Hing Suram, Sai Teja Pratap Reddy Gui, Xinmei

Abrégé

The present application discloses a method, system, and computer system for providing an integrated workspace file system. The method includes receive a receiving a request to access a first file, determining a first user associated with the request to access the first file, determining whether the first user is authorized to access the first file, and in response to determining that the first user is authorized to access the first file: generating a uniform resource identifier (URI) associated with the first file, wherein the URI comprises a credential for accessing the first file, wherein the credential is based on a user credential and a first file access authorization, and providing the URI to a user system.

Classes IPC ?

G06F 21/62 - Protection de l’accès à des données via une plate-forme, p. ex. par clés ou règles de contrôle de l’accès

36. LEARNING RATE SCHEDULE FOR TRAINING MACHINE LEARNING BASED LANGUAGE MODELS

Numéro d'application	18424774
Statut	En instance
Date de dépôt	2024-01-27
Date de la première publication	2025-07-31
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Paul, Mansheej

Abrégé

A system trains a machine learning model, such as a language model for a set of iterations using a learning rate that is a piecewise function comprising: (1) a first range of inputs for which the learning rate is linearly increasing in value with the number of iterations, (2) a second range of inputs after the first range of inputs for which the learning rate comprises: a first term that varies as inverse square root of the number of iterations, and a second term that has a constant value with respect to the number of iterations, and (3) a third range of inputs for which the learning rate is linearly decreasing in value with the number of iterations. The system evaluates the trained language model and determines based on the evaluation, whether the trained language model should be deployed.

Classes IPC ?

G06N 20/00 - Apprentissage automatique

37. BATCH SELECTION FOR TRAINING MACHINE-LEARNED LARGE LANGUAGE MODELS

Numéro d'application	18425893
Statut	En instance
Date de dépôt	2024-01-29
Date de la première publication	2025-07-31
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Anker, Zachary Paul, Mansheej

Abrégé

A system performs a batch selection process on a small model and uses the results of batch selection to train a large language model (LLM). The system receives training examples and splits the training examples into a holdout set and an evaluation set. Each training example corresponds to a label. The system uses trains a small model using the training examples of the holdout set. The system evaluates the small model on the training examples of the evaluation set, generating a prediction for each training example and computing a loss between the prediction and the training example's label. The system generates an LLM training set by selecting a set of training examples from the evaluation set with the highest loss. The system trains the LLM using the LLM training set.

Classes IPC ?

G06N 3/0455 - Réseaux auto-encodeursRéseaux encodeurs-décodeurs
G06N 3/084 - Rétropropagation, p. ex. suivant l’algorithme du gradient

38. Synchronization of access control policies with external data platforms

Numéro d'application	18417396
Numéro de brevet	12450389
Statut	Délivré - en vigueur
Date de dépôt	2024-01-19
Date de la première publication	2025-07-24
Date d'octroi	2025-10-21
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Li, Nong Neeman, Itay Alfred

Abrégé

Classes IPC ?

G06F 21/62 - Protection de l’accès à des données via une plate-forme, p. ex. par clés ou règles de contrôle de l’accès

39. TRAINING MACHINE-LEARNED TRANSFORMER ARCHITECTURES BY CLIPPING QUERIES, KEYS, AND VALUES

Numéro d'application	18414270
Statut	En instance
Date de dépôt	2024-01-16
Date de la première publication	2025-07-17
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Chiley, Vitaliy A.

Abrégé

A data processing service performs a training process to train a transformer architecture including a set of decoders coupled to receive a set of inputs and generate a set of outputs. At least one decoder or encoder includes an attention block coupled to receive a query, a key, and a value and generate an attention output. For one or more iterations, the data processing service obtains a batch of training instances for a current iteration. The parameters of the transformer architecture for the current iteration are applied to a set of inputs obtained from the batch of training instances to generate a set of estimated outputs. The applying includes obtaining a query, a key, and a value from the set of inputs, and applying a clipping function to values of the query, the key, the value.

Classes IPC ?

G06N 3/084 - Rétropropagation, p. ex. suivant l’algorithme du gradient

40. Generating minor compactions to capture aggregated actions for commit ranges to data files

Numéro d'application	18415396
Numéro de brevet	12405943
Statut	Délivré - en vigueur
Date de dépôt	2024-01-17
Date de la première publication	2025-07-17
Date d'octroi	2025-09-02
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Johnson, Frederick Ryan Jain, Prakhar

Abrégé

Classes IPC ?

G06F 16/23 - Mise à jour
G06F 16/174 - Élimination de redondances par le système de fichiers

41. CHECKPOINT AND RESTORE BASED STARTUP OF EXECUTOR NODES OF A DISTRIBUTED COMPUTING ENGINE FOR PROCESSING QUERIES

Numéro d'application	19039504
Statut	En instance
Date de dépôt	2025-01-28
Date de la première publication	2025-07-17
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Ge, Xinyang Ao, Lixiang Jing, Haonan Davidson, Aaron Daniel

Abrégé

A system performs efficient startup of executors of a distributed computing engine used for processing queries, for example, database queries. The system starts an executor node and processes a set of queries using the executor node to warm up the executor node. The system performs a checkpoint of the warmed-up executor node to create an image. The image is restored in the target executor nodes. The system may store a checkpoint image for each configuration of an executor node. The configuration is determined based on various factors including the hardware of the executor node, memory allocation of the processes, and so on. The user or restore based on checkpoint images improves efficiency of execution of the startup of executor nodes.

Classes IPC ?

G06F 16/2453 - Optimisation des requêtes

42. Priority for autoscaling of streaming workloads

Numéro d'application	17728383
Numéro de brevet	12360806
Statut	Délivré - en vigueur
Date de dépôt	2022-04-25
Date de la première publication	2025-07-15
Date d'octroi	2025-07-15
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Neumann, Andreas Kianfar, Kiavash Zhang, Li Das, Tathagata

Abrégé

The present application discloses a method, system, and computer system for automatically scaling task-processing capacity. The method includes obtaining, at a data layer, a current measure of queued tasks and/or a task-processing capacity, obtaining, by one or more processors, a cost-prioritized criterion or a latency-prioritized criterion, determining a set of tasks to process using the task-processing capacity based at least in part on a set of an input data to process, and automatically scaling the task-processing capacity based at least in part on the current measure of queued tasks and/or the task-processing capacity and either (i) the cost-prioritized criterion or (ii) the latency-prioritized criterion.

Classes IPC ?

G06F 9/48 - Lancement de programmes Commutation de programmes, p. ex. par interruption
G06F 9/355 - Adressage indexé
G06F 9/445 - Chargement ou démarrage de programme

43. Incremental execution of extract, transform, load process using microtechniques architecture

Numéro d'application	18608776
Numéro de brevet	12386833
Statut	Délivré - en vigueur
Date de dépôt	2024-03-18
Date de la première publication	2025-07-03
Date d'octroi	2025-08-12
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Armbrust, Michael Paul Ercegovac, Vuk Lappas, Paul Liang, Xi Murthy, Mukul Papakonstantinou, Yannis Sharma, Nitin Sismanis, John Torres, Joseph Yang, Min

Abrégé

A system receives ETL specification for processing stream data, including a transform operation represented using a database query specification. The system generates a dataflow graph of a sequence of database queries by decomposing the database query into a first database query that generates an intermediate results table, and a second database query that receives as input the intermediate results table and outputs data used for performing the transform operation. The system executes the sequence of database queries for performing the transform operation on stream data received from the source. When receiving an incremental data set, the system determines an output change set based on the received incremental data set by traversing an execution plan and processing each operator in the execution plan, and computing a change set of a particular operator from the change sets output by the one or more other operators based on the incremental data set.

Classes IPC ?

G06F 16/25 - Systèmes d’intégration ou d’interfaçage impliquant les systèmes de gestion de bases de données
G06F 16/2455 - Exécution des requêtes

44. Compile time processing of extract, transform, load process

Numéro d'application	18608779
Numéro de brevet	12517905
Statut	Délivré - en vigueur
Date de dépôt	2024-03-18
Date de la première publication	2025-07-03
Date d'octroi	2026-01-06
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Armbrust, Michael Paul Ercegovac, Vuk Lappas, Paul Liang, Xi Murthy, Mukul Papakonstantinou, Yannis Sharma, Nitin Sismanis, John Torres, Joseph Yang, Min

Abrégé

Classes IPC ?

G06F 16/00 - Recherche d’informationsStructures de bases de données à cet effetStructures de systèmes de fichiers à cet effet
G06F 16/2453 - Optimisation des requêtes
G06F 16/2455 - Exécution des requêtes
G06F 16/25 - Systèmes d’intégration ou d’interfaçage impliquant les systèmes de gestion de bases de données

45. Reducing cluster start up time

Numéro d'application	18162546
Numéro de brevet	12340256
Statut	Délivré - en vigueur
Date de dépôt	2023-01-31
Date de la première publication	2025-06-24
Date d'octroi	2025-06-24
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Mao, Yandong Davidson, Aaron Daniel

Abrégé

The present application discloses a method, system, and computer system for starting up and maintaining a cluster in a warmed up state, and/or allocating clusters from a warmed up state. The method includes instantiating a set of virtual machines, wherein instantiating the set of virtual machines includes setting a temporary security credential for each virtual machine of the set of virtual machines, receiving a virtual machine allocation request associated with a workspace, a customer, or a tenant, in response to the virtual machine allocation request: allocating a virtual machine, wherein allocating the virtual machine comprises replacing the temporary security credential with a security credential associated with the workspace, the customer, or the tenant.

Classes IPC ?

G06F 9/50 - Allocation de ressources, p. ex. de l'unité centrale de traitement [UCT]
G06F 21/45 - Structures ou outils d’administration de l’authentification

46. Managed Metastore

Numéro d'application	19072814
Statut	En instance
Date de dépôt	2025-03-06
Date de la première publication	2025-06-19
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Zaharia, Matei Lewis, David Lian, Cheng Huo, Yuchen Ghodsi, Ali

Abrégé

The present application discloses a method, system, and computer system for providing access to information stored on system for data storage. The method includes receiving a data request from a user, determining data corresponding to the data request, determining whether the user has requisite permissions to access the data, and in response to determining that the user has requisite permissions to access the data: determining a manner by which to provide access to the data, wherein the data comprises a filtered subset of stored data, and generating a token based at least in part on the user and the manner by which access to the data is to be provided.

Classes IPC ?

G06F 21/62 - Protection de l’accès à des données via une plate-forme, p. ex. par clés ou règles de contrôle de l’accès
G06F 3/06 - Entrée numérique à partir de, ou sortie numérique vers des supports d'enregistrement

47. Nested array batch processing

Numéro d'application	17884099
Numéro de brevet	12332875
Statut	Délivré - en vigueur
Date de dépôt	2022-08-09
Date de la première publication	2025-06-17
Date d'octroi	2025-06-17
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Palkar, Shoumik Behm, Alexander Cashman, David

Abrégé

The present application discloses a method, system, and computer system for processing data. The method includes obtaining a query plan for processing input data in response to a query, obtaining the input data, selecting a batch of the input data, creating a metadata structure for the batch, allocating one or more contiguous parts of a memory for processing the batch, processing the batch in accordance with the metadata structure to generate resulting data, and storing each array of the resulting data for the batch in one of the one or more contiguous parts of the memory.

Classes IPC ?

G06F 16/23 - Mise à jour
G06F 16/2453 - Optimisation des requêtes
G06F 16/2455 - Exécution des requêtes

48. SELECTING OPTIMAL HARDWARE CONFIGURATIONS

Numéro d'application	18527111
Statut	En instance
Date de dépôt	2023-12-01
Date de la première publication	2025-06-05
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Bilal, Ahmed Chen, Steven Yikun Fontaine, Bruce Laurent Khudia, Daya Shanker Li, Chenran Mathur, Ankit

Abrégé

A data processing service builds a container for a customer to run a trained large language model (LLM). The data processing service receives a trained LLM and a desired configuration from a user of a client device. Based on the desired configuration, the data processing service selects a hardware configuration and structures weights of the trained LLM based on the hardware configuration. The data processing service generates a container image reflecting the hardware configuration, registers the container image to a container registry, and generates a container from the container image as well as an application programming interface (API) endpoint for the container. The data processing service deploys the trained LLM in the API endpoint using the container such that the trained LLM is accessible through API calls.

Classes IPC ?

G06F 9/445 - Chargement ou démarrage de programme

49. Clean Room Generation for Data Collaboration and Executing Clean Room Task in Data Processing Pipeline

Numéro d'application	19050371
Statut	En instance
Date de dépôt	2025-02-11
Date de la première publication	2025-06-05
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Chau, William Chakankar, Abhijit Mahoney, Stephen Michael Morris, Daniel Seth Weiss, Itai Shlomo

Abrégé

A data processing service facilitates the creation and processing of data processing pipelines that process data processing jobs defined with respect to a set of tasks in a sequence and with data dependencies associated with each separate task such that the output from one task is used as input for a subsequent task. In various embodiments, the set of tasks include at least one cleanroom task that is executed in a cleanroom station and at least one non-cleanroom task executed in an execution environment of a user where each task is configured to read one or more input datasets and transform the one or more input datasets into one or more output datasets.

Classes IPC ?

G06F 21/62 - Protection de l’accès à des données via une plate-forme, p. ex. par clés ou règles de contrôle de l’accès

50. Pipelined execution of database queries processing streaming data

Numéro d'application	18511902
Numéro de brevet	12430339
Statut	Délivré - en vigueur
Date de dépôt	2023-11-16
Date de la première publication	2025-05-22
Date d'octroi	2025-09-30
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Armbrust, Michael Paul Balikov, Alexander Peng, Boyang

Abrégé

Classes IPC ?

G06F 16/2455 - Exécution des requêtes
G06F 9/48 - Lancement de programmes Commutation de programmes, p. ex. par interruption
G06F 16/2453 - Optimisation des requêtes

51. Query Watchdog

Numéro d'application	19030032
Statut	En instance
Date de dépôt	2025-01-17
Date de la première publication	2025-05-22
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Luszczak, Alicja Shankar, Srinath Xin, Shi

Abrégé

A system for monitoring job execution includes an interface and a processor. The interface is configured to receive an indication to start a cluster processing job. The processor is configured to determine whether processing a data instance associated with the cluster processing job satisfies a watchdog criterion; and in the event that processing the data instance satisfies the watchdog criterion, cause the processing of the data instance to be killed.

Classes IPC ?

G06F 11/07 - Réaction à l'apparition d'un défaut, p. ex. tolérance de certains défauts
G06F 11/30 - Surveillance du fonctionnement
G06F 11/34 - Enregistrement ou évaluation statistique de l'activité du calculateur, p. ex. des interruptions ou des opérations d'entrée–sortie

52. DATABRICKS

Numéro d'application	244431500
Statut	En instance
Date de dépôt	2025-05-20
Propriétaire	Databricks, Inc. (USA)
Classes de Nice ?	09 - Appareils et instruments scientifiques et électriques 35 - Publicité; Affaires commerciales 41 - Éducation, divertissements, activités sportives et culturelles 42 - Services scientifiques, technologiques et industriels, recherche et conception

Produits et services

(1) Software; downloadable software; downloadable software for big data processing and analytics; downloadable software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; downloadable software for application database integration; downloadable software for use in ETL (extract, transform, load) data processing; downloadable software for compiling, organizing, visualizing, sharing and analyzing business intelligence; desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; downloadable software featuring libraries for data science training; downloadable software featuring libraries for data analytics; downloadable software featuring libraries for machine learning training; downloadable software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software featuring artificial intelligence and machine learning technology; downloadable software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; downloadable software for modifying and enabling transmission of images, audio, audio visual and video content and data; downloadable software for processing images, graphics, audio, video, and text; downloadable software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; downloadable software for use in data centers and data storage; downloadable software for use as an application programming interface (API); downloadable software development tools; downloadable software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; downloadable software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software development kits (SDKs); downloadable software for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; downloadable software for use in data governance, namely, software featuring artificial intelligence that allows users to define and manage policies for gathering, querying, storing, processing, sharing, accessing and disposing of data, create data clean rooms, and track data lineage; downloadable software for retrieval augmented generation (RAG); downloadable software for retrieval augmented generation (RAG) for use in generative AI applications; downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; downloadable podcasts in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence. (1) Database management; database management consultancy; business consulting in the field of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; business data analysis; business data management consultancy; providing an online marketplace featuring downloadable software applications, data sets, data notebooks, artificial intelligence models, machine learning models; data management and processing, namely, business data analytics and interpretation, compiling and systemization of data into computer databases, data collection, management of data lakes, and administrative support services in the fields of data wrangling, data visualization, data governance, data science. (2) Educational services; conducting conferences, seminars, classes, workshops, courses, and webinars; training services; providing training for certification; educational services featuring podcasts; conducting conferences, seminars, classes, workshops, courses, and webinars in field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; training services in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing training for certification in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing non-downloadable electronic publications and on-line publication of journals or diaries [blog services] in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; educational services featuring podcasts in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing non-downloadable electronic publications and on-line publication of journals or diaries [blog services]. (3) Providing online non-downloadable software; providing online non-downloadable software for big data processing and analytics; providing online non-downloadable software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; providing online non-downloadable software for application database integration; providing online non-downloadable software for use in ETL (extract, transform, load) data processing; providing online non-downloadable software for compiling, organizing, visualizing, sharing and analyzing business intelligence; providing online non-downloadable desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; providing online non-downloadable software featuring libraries for data science training; providing online non-downloadable software featuring libraries for data analytics; providing online non-downloadable software featuring libraries for machine learning training; providing online non-downloadable software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software featuring artificial intelligence and machine learning technology; providing online non-downloadable software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; providing online non-downloadable software for modifying and enabling transmission of images, audio, audio visual and video content and data; providing online non-downloadable software for processing images, graphics, audio, video, and text; providing online non-downloadable software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; providing online non-downloadable software for use in data centers and data storage; providing online non-downloadable software for use as an application programming interface (API); providing online non-downloadable software development tools; providing online non-downloadable software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; providing online non-downloadable software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software development kits (SDKs); software as a service (SAAS) services; software as a service (SAAS) services featuring software for big data processing and analytics; software as a service (SAAS) services, namely, featuring software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; software as a service (SAAS) services featuring software for application database integration; software as a service (SAAS) services featuring software for use in ETL (extract, transform, load) data processing; software as a service (SAAS) services featuring software for compiling, organizing, visualizing, sharing and analyzing business intelligence; software as a service (SAAS) services featuring desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; software as a service (SAAS) services featuring software featuring libraries for data science training; software as a service (SAAS) services featuring software featuring libraries for data analytics; software as a service (SAAS) services featuring software featuring libraries for machine learning training; software as a service (SAAS) services featuring software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software featuring artificial intelligence and machine learning technology; software as a service (SAAS) services featuring software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; software as a service (SAAS) services featuring software for modifying and enabling transmission of images, audio, audio visual and video content and data; software as a service (SAAS) services featuring software for processing images, graphics, audio, video, and text; software as a service (SAAS) services featuring software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; software as a service (SAAS) services featuring software for use in data centers and data storage; software as a service (SAAS) services featuring software for use as an application programming interface (API); software as a service (SAAS) services featuring software development tools; software as a service (SAAS) services featuring software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; software as a service (SAAS) services featuring software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software development kits (SDKs); platform as a services (PAAS) services; platform as a services (PAAS) services featuring software for big data processing and analytics; platform as a services (PAAS) services, namely, featuring software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; platform as a services (PAAS) services featuring software for use in ETL (extract, transform, load) data processing; platform as a services (PAAS) services featuring software for compiling, organizing, visualizing, sharing and analyzing business intelligence; platform as a services (PAAS) services featuring desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; platform as a services (PAAS) services featuring software featuring libraries for data science training; platform as a services (PAAS) services featuring software featuring libraries for data analytics; platform as a services (PAAS) services featuring software featuring libraries for machine learning training; platform as a services (PAAS) services featuring software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software featuring artificial intelligence and machine learning technology; platform as a services (PAAS) services featuring software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; platform as a services (PAAS) services featuring software for modifying and enabling transmission of images, audio, audio visual and video content and data; platform as a services (PAAS) services featuring software for processing images, graphics, audio, video, and text; platform as a services (PAAS) services featuring software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; platform as a services (PAAS) services featuring software for use in data centers and data storage; platform as a services (PAAS) services featuring software for use as an application programming interface (API); platform as a services (PAAS) services featuring software development tools; platform as a services (PAAS) services featuring software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; platform as a services (PAAS) services featuring software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software development kits (SDKs); data mining; computer services, namely, hosting of search platforms on the Internet to allow users to index, integrate, warehouse, mine, process, share, collect, interpret, research, query, visualize, and analyze data; custom design and development of computer software; software engineering services for data processing; development and creation of computer programs for data processing and analysis; providing technological information in the fields of technology, computers, computer software, computer networks, web services, mobile computing and artificial intelligence via a website; technology consulting in the field of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing of online non-downloadable software for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; providing online non-downloadable software featuring artificial intelligence for use in data governance, namely, software that allows users to define and manage policies for gathering, querying, storing, processing, sharing, accessing and disposing of data, create data clean rooms, and track data lineage; providing online non-downloadable software for retrieval augmented generation (RAG); providing online non-downloadable software for retrieval augmented generation (RAG) for use in generative AI applications; providing online non-downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; software as a service (SAAS) services, namely, featuring software for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance; providing online non-downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; platform as a services (PAAS) services, namely, featuring software for data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance; providing online non-downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; application service provider services, namely, hosting, managing, developing and maintaining applications, and software of others in the fields of big data processing and analytics, data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science; computer services, namely, hosting of artificial intelligence models, machine learning models, and large language models (LLMs); computer services, namely, hosting of artificial intelligence models, machine learning models, and large language models (LLMs) to allow users to perform search queries and develop inference; technology consulting in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, and building, design and management of data lakes; data migration; data mining; data warehousing; development and design of data lakes.

53. DATABRICKS

Numéro de série	99192548
Statut	En instance
Date de dépôt	2025-05-19
Propriétaire	Databricks, Inc. (USA)
Classes de Nice ?	09 - Appareils et instruments scientifiques et électriques 35 - Publicité; Affaires commerciales 41 - Éducation, divertissements, activités sportives et culturelles 42 - Services scientifiques, technologiques et industriels, recherche et conception

Produits et services

Downloadable software for big data processing and analysis; downloadable software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files; downloadable software for application database integration; downloadable software for use in ETL (extract, transform, load) data processing; downloadable software for compiling, organizing, visualizing, sharing and analyzing business intelligence data; desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; downloadable software featuring libraries for data science training; downloadable software featuring libraries for data analytics; downloadable software featuring libraries for machine learning training; downloadable software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software featuring artificial intelligence and machine learning technology for deep learning, high performance computing, distributed computing, virtualization, natural language generation, statistical learning, supervised learning, un-supervised learning, data mining, predictive analytics and business intelligence; downloadable software for development and implementation of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; downloadable software for modifying and enabling transmission of images, audio, audio visual and video content and data; downloadable software for processing images, graphics, audio, video, and text; downloadable software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; downloadable software for use in operating data centers and data storage; downloadable software for use as an application programming interface (API); downloadable computer software development tools; downloadable software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; downloadable software development tools for building designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; downloadable software development kits (SDKs); downloadable software for use in performing data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; downloadable software for use in data governance, namely, software featuring artificial intelligence that allows users to define and manage policies for gathering, querying, storing, processing, sharing, accessing and disposing of data, create data clean rooms, and track data lineage; downloadable software for retrieval augmented generation (RAG); downloadable software for retrieval augmented generation (RAG) for use in generative AI applications; downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; downloadable podcasts in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence Database management; database management consultancy for business purposes; business consulting in the field of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; business data analysis; business data consultancy; providing an online marketplace featuring downloadable software applications, data sets, data notebooks, artificial intelligence models, machine learning models; business data management and processing, namely, data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, and building, design and management of data lakes Conducting conferences, seminars, classes workshops, courses, and webinars in field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; training services in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing training for certification in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; non-downloadable publications and blogs in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; non-downloadable podcasts in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence providing online non-downloadable software for big data processing and analysis; providing online non-downloadable software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; providing online non-downloadable software for application database integration; providing online non-downloadable software for use in ETL (extract, transform, load) data processing; providing online non-downloadable software for compiling, organizing, visualizing, sharing and analyzing business intelligence data; providing online non-downloadable desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; providing online non-downloadable software featuring libraries for data science training; providing online non-downloadable software featuring libraries for data analytics; providing online non-downloadable software featuring libraries for machine learning training; providing online non-downloadable software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, , predictive analytics and business intelligence; providing online non-downloadable software featuring artificial intelligence and machine learning technology; providing online non-downloadable software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; providing online non-downloadable software for modifying and enabling transmission of images, audio, audio visual and video content and data; providing online non-downloadable software for processing images, graphics, audio, video, and text; providing online non-downloadable software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; providing online non-downloadable software for use in data centers and data storage; providing online non-downloadable software for use as an application programming interface (API); providing online non-downloadable software development tools; providing online non-downloadable software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; providing online non-downloadable software development tools for building designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing online non-downloadable software development kits (SDKs); software as a service (SAAS) services featuring software for big data processing and analytics; software as a service (SAAS) services, namely, featuring software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; software as a service (SAAS) services featuring software for application database integration; software as a service (SAAS) services featuring software for use in ETL (extract, transform, load) data processing; software as a service (SAAS) services featuring software for compiling, organizing, visualizing, sharing and analyzing business intelligence; software as a service (SAAS) services featuring desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; software as a service (SAAS) services featuring software featuring libraries for data science training; software as a service (SAAS) services featuring software featuring libraries for data analytics; software as a service (SAAS) services featuring software featuring libraries for machine learning training; software as a service (SAAS) services featuring software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software featuring artificial intelligence and machine learning technology for deep learning, high performance computing, distributed computing, virtualization, natural language generation, statistical learning, supervised learning, un-supervised learning, data mining, predictive analytics and business intelligence; software as a service (SAAS) services featuring software for development and implementation of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; software as a service (SAAS) services featuring software for modifying and enabling transmission of images, audio, audio visual and video content and data; software as a service (SAAS) services featuring software for processing images, graphics, audio, video, and text; software as a service (SAAS) services featuring software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; software as a service (SAAS) services featuring software for use in data centers and electronic data storage; software as a service (SAAS) services featuring software for use as an application programming interface (API); software as a service (SAAS) services featuring software development tools; software as a service (SAAS) services featuring software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; software as a service (SAAS) services featuring software development tools for building designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software development kits (SDKs); platform as a services (PAAS) services; platform as a services (PAAS) services featuring software for big data processing and analysis; platform as a services (PAAS) services, featuring software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; platform as a services (PAAS) services featuring software for use in ETL (extract, transform, load) data processing; platform as a services (PAAS) services featuring software for compiling, organizing, visualizing, sharing and analyzing business intelligence; platform as a services (PAAS) services featuring desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; platform as a services (PAAS) services featuring software featuring libraries for data science training; platform as a services (PAAS) services featuring software featuring libraries for data analytics; platform as a services (PAAS) services featuring software featuring libraries for machine learning training; platform as a services (PAAS) services featuring software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software featuring artificial intelligence and machine learning technology for deep learning, high performance computing, distributed computing, virtualization, natural language generation, statistical learning, supervised learning, un-supervised learning, data mining, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software for development and implementation of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; platform as a services (PAAS) services featuring software for modifying and enabling transmission of images, audio, audio visual and video content and data; platform as a services (PAAS) services featuring software for processing images, graphics, audio, video, and text; platform as a services (PAAS) services featuring software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; platform as a services (PAAS) services featuring software for use in data centers and data storage; platform as a services (PAAS) services featuring software for use as an application programming interface (API); platform as a services (PAAS) services featuring software development tools; platform as a services (PAAS) services featuring software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; platform as a services (PAAS) services featuring software development tools for building designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software development kits (SDKs); data mining; computer services, namely, hosting of search platforms on the Internet to allow users to index, integrate, warehouse, mine, process, share, collect, interpret, research, query, visualize, and analyze data; custom design and development of computer software; software engineering services for data processing; development and creation of computer programs for data processing and analysis; providing a website featuring information in the fields of technology, computers, computer software, computer networks, web services, mobile computing and artificial intelligence; technology consulting in the field of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, un-supervised learning, predictive analytics and business intelligence; providing of online non-downloadable software for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; providing online non-downloadable software featuring artificial intelligence for use in data governance, namely, software that allows users to define and manage policies for gathering, querying, storing, processing, sharing, accessing and disposing of data, create data clean rooms, and track data lineage; providing online non-downloadable software for retrieval augmented generation (RAG); providing online non-downloadable software for retrieval augmented generation (RAG) for use in generative AI applications; providing online non-downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; software as a service (SAAS) services, namely, featuring software use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance; providing online non-downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; platform as a services (PAAS) services, namely, featuring software for data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance; • providing online non-downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; application service provider services, namely, hosting, managing, developing and maintaining applications, and software of others in the fields of big data processing and analytics, data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science; computer services, namely, hosting of artificial intelligence models, machine learning models, and large language models (LLMs); computer services, namely, hosting of artificial intelligence models, machine learning models, and large language models (LLMs) to allow users to perform search queries and develop inference; technology consulting in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, and building, design and management of data lakes

54. SHORT QUERY PRIORITIZATION FOR DATA PROCESSING SERVICE

Numéro d'application	18991083
Statut	En instance
Date de dépôt	2024-12-20
Date de la première publication	2025-05-15
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Gudesa, Venkata Sai Akhil Van Hövell Tot Westerflier, Herman Rudolf Petrus Catharina Nakandala, Supun Chathuranga

Abrégé

A cluster computing system maintains a first set of queues for short queries and a set second set for longer queries. The first set is allocated a majority of the cluster's processing resources and processes queries on a first in first out basis. The second set is allocated a minority of the cluster's processing resources which are shared among queries in the second set. Accordingly, the system assigns each query to the first set of queues for a fixed amount of resource time. While a query is processing, the system monitors the query's resource time and reassigns the query to the second set of queues if the query has not completed within the allotted amount of resource time. Thus, short queries receive the necessary resources to complete quickly without getting stuck behind longer queries while ensuring that longer queries continue to make progress.

Classes IPC ?

G06F 16/2453 - Optimisation des requêtes
G06F 9/48 - Lancement de programmes Commutation de programmes, p. ex. par interruption
G06F 11/34 - Enregistrement ou évaluation statistique de l'activité du calculateur, p. ex. des interruptions ou des opérations d'entrée–sortie
G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet

55. RETRIEVAL AND CACHING OF OBJECT METADATA ACROSS DATA SOURCES AND STORAGE SYSTEMS

Numéro d'application	18983280
Statut	En instance
Date de dépôt	2024-12-16
Date de la première publication	2025-05-15
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Li, Zhaoxing Singh, Rayman Preet Efeoglu, Fuat Can Tenedorio, Daniel Cai, Sarah

Abrégé

A system for retrieving and caching metadata from a remote data source is described. The system may receive a request from a client device. The request is to perform a query operation on a set of data objects stored in the remote data source. The system may access a metadata cache storing metadata information on one or more data objects of the remote data source and identify metadata corresponding to the set of data objects for the query operation in the metadata cache. The system may determine whether the identified metadata for the set of data objects meets an update condition. In response to the identified metadata meeting the update condition, the system may fetch updated metadata for at least the set of data objects from the remote data source, and store the updated metadata in the metadata cache.

Classes IPC ?

G06F 16/23 - Mise à jour
G06F 16/2455 - Exécution des requêtes

56. UPDATE AND QUERY OF A LARGE COLLECTION OF FILES THAT REPRESENT A SINGLE DATASET STORED ON A BLOB STORE

Numéro d'application	18985397
Statut	En instance
Date de dépôt	2024-12-18
Date de la première publication	2025-05-15
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Armbrust, Michael Paul Zhu, Shixiong Yavuz, Burak

Abrégé

A system includes an interface and a processor. The interface is configured to receive a table indication of a data table and to receive a transaction indication to perform a transaction. The processor is configured to determine a current position N in a transaction log; determine a current state of the metadata; determine a read set associated with a transaction; attempt to write an update to the transaction log associated with a next position N+1; in response to a transaction determination that a simultaneous transaction associated with the next position N+1 already exists, determine a set of updated files; and in response to a determination that there is not an overlap between the read set associated with the current transaction and the set of updated files associated with the simultaneous transaction, attempt to write the update to the transaction to the transaction log associated with a further position N+2.

Classes IPC ?

G06F 16/23 - Mise à jour
G06F 16/14 - Détails de la recherche de fichiers basée sur les métadonnées des fichiers
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage

57. EVALUATING EXPRESSIONS OVER DICTIONARY DATA

Numéro d'application	19000466
Statut	En instance
Date de dépôt	2024-12-23
Date de la première publication	2025-05-15
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Agarwal, Utkarsh Palkar, Shoumik Behm, Alexander Krishnamurthy, Sriram

Abrégé

Disclosed herein is a method, system, or non-transitory computer readable medium for evaluating a query on a columnar dataset comprising one or more dictionaries associated with columns in the dataset. The method includes receiving a request to perform a query comprising at least an operator for a columnar dataset on cloud storage. At least one column in the dataset is based on a dictionary, and the dictionary maps one or more values for a column to one or more respective identifiers. The method evaluates the operator on one or more values of the dictionary to generate an updated dictionary comprising updated values. The method may decode the updated dictionary into an updated column comprising updated data values.

Classes IPC ?

G06F 16/2455 - Exécution des requêtes
G06F 11/34 - Enregistrement ou évaluation statistique de l'activité du calculateur, p. ex. des interruptions ou des opérations d'entrée–sortie
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage

58. Clustering Key Selection Based on Machine-Learned Key Selection Models for Data Processing Service

Numéro d'application	19022884
Statut	En instance
Date de dépôt	2025-01-15
Date de la première publication	2025-05-15
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Kim, Terry Ma, Lin Mahadev, Rahul Shivu Potharaju, Rahul

Abrégé

The disclosed configurations provide a method (and/or a computer-readable medium or system) for determining, from a table schema describing keys of a data table, one or more clustering keys that can be used to cluster data files of a data table. The method includes generating features for the data table, generating tokens from the features, generating a prediction for each token by applying to the token a machine-learned transformer model trained to predict a likelihood that the key associated with the token is a clustering key for the data table, determining clustering keys based on the predictions, and clustering data records of the data table into data files based on key-values for the clustering keys.

Classes IPC ?

G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet
G06F 16/21 - Conception, administration ou maintenance des bases de données
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage

59. Multiple pass sort with subset splitting

Numéro d'application	17875180
Numéro de brevet	12298952
Statut	Délivré - en vigueur
Date de dépôt	2022-07-27
Date de la première publication	2025-05-13
Date d'octroi	2025-05-13
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Armstrong, Timothy Krishnan, Arvind Sai Guliyev, Khayyam

Abrégé

A system for multipass sort with subsplitting includes a communication interface and a processor. The communication interface is configured to receive from a client device a request to sort a dataset that includes a plurality of rows, where the size of the dataset is greater than a threshold size. The processor is configured to: subdivide the dataset into a plurality of data subsets; sort each of the plurality of data subsets; merge the plurality of sorted data subsets utilizing a binary merge tree to generate a sorted dataset; and provide the sorted dataset to the client device.

Classes IPC ?

G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage
G06F 16/2453 - Optimisation des requêtes

60. Data asset sharing between accounts at a data processing service using cloud tokens

Numéro d'application	18491500
Numéro de brevet	12481735
Statut	Délivré - en vigueur
Date de dépôt	2023-10-20
Date de la première publication	2025-04-24
Date d'octroi	2025-11-25
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Sun, Xiaotong Chakankar, Abhijit Chandra, Ramesh

Abrégé

A data processing service receives indication that a recipient will request access to data assets of a provider and provides a request for credentials from a recipient governance module. The recipient governance module stores a recipient metastore including an object for a provider metastore. In response to determining that the assets are associated with the provider metastore, the service provides a request for credentials to a provider governance module. The provider governance module stores the provider metastore describing data assets of the provider and permissions for accessing data assets. The provider metastore includes a recipient object attached to the data assets with an identifier for the recipient metastore. In response to verifying that the recipient was provided access to the data assets, the service provides a token to the recipient governance module. The service then provides the token to a computing resource to provide access to the data assets.

Classes IPC ?

G06F 21/31 - Authentification de l’utilisateur

61. DATA SHARING FOR NETWORK CONNECTED SYSTEMS

Numéro d'application	18958728
Statut	En instance
Date de dépôt	2024-11-25
Date de la première publication	2025-04-24
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Zaharia, Matei Zhu, Shixiong Sun, Xiaotong Chandra, Ramesh Armbrust, Michael Paul Ghodsi, Ali

Abrégé

The present application discloses a method, system, and computer system for providing access to data. The method includes receiving, by a data manager service from a data requesting service, a request using an identifier for a high-level data object to access a set of data associated with the high-level data object, determining, by the data manager service, low-level data object(s) corresponding to the set of data based on the identifier for the high-level data object, determining whether a user associated with the request has permission to access at least a subset of the low-level data object(s), and in response to determining that the user associated has permission to access the at least the subset of the low-level data object(s), generating, by the data manager service, a uniform resource locator (URL) via which the at least the subset of the one or more low-level data objects is accessible by the user.

Classes IPC ?

G06F 21/62 - Protection de l’accès à des données via une plate-forme, p. ex. par clés ou règles de contrôle de l’accès
G06F 21/60 - Protection de données

62. AUTO MAINTENANCE FOR DATA TABLES IN CLOUD STORAGE

Numéro d'application	18986345
Statut	En instance
Date de dépôt	2024-12-18
Date de la première publication	2025-04-24
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Prabhakaran, Vijayan Raja, Himanshu Potharaju, Rahul Bhanoori, Naga Raju Ma, Lin Parangi Sharabhalingappa, Rajesh Liang, Jintian Schuermann, Zachary Vaughn Ting, Kam Cheung

Abrégé

Disclosed is a configuration for managing the organization of data tables in cloud-based storage. The configuration receives metrics for data processing operations on the data table. Metrics include at least one of a size of the data table, a size of each file in the data table, and metadata describing the data table. The configuration automatically executes a cost-benefit analysis based on the one or more metrics for each candidate maintenance operation in a plurality of candidate maintenance operations. The configuration automatically selects a maintenance operation from the candidate maintenance operations to automate based on the cost-benefit analysis of the one or more candidate maintenance operations. The selected maintenance operation is automated and scheduled on the data table.

Classes IPC ?

G06F 16/21 - Conception, administration ou maintenance des bases de données
G06F 11/34 - Enregistrement ou évaluation statistique de l'activité du calculateur, p. ex. des interruptions ou des opérations d'entrée–sortie
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage

63. DATABRICKS

Numéro d'application	019176233
Statut	Enregistrée
Date de dépôt	2025-04-22
Date d'enregistrement	2025-11-20
Propriétaire	Databricks, Inc. (USA)
Classes de Nice ?	09 - Appareils et instruments scientifiques et électriques 35 - Publicité; Affaires commerciales 41 - Éducation, divertissements, activités sportives et culturelles 42 - Services scientifiques, technologiques et industriels, recherche et conception

Produits et services

Software; downloadable software; downloadable software for big data processing and analytics; downloadable software for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; downloadable software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; downloadable software for application database integration; downloadable software for use in ETL (extract, transform, load) data processing; downloadable software for compiling, organizing, visualizing, sharing and analyzing business intelligence; downloadable software for use in data governance, namely, software featuring artificial intelligence that allows users to define and manage policies for gathering, querying, storing, processing, sharing, accessing and disposing of data, create data clean rooms, and track data lineage; desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; downloadable software featuring libraries for data science training; downloadable software featuring libraries for data analytics; downloadable software featuring libraries for machine learning training; downloadable software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence; downloadable software featuring artificial intelligence and machine learning technology; downloadable software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence; downloadable software for retrieval augmented generation (RAG); downloadable software for retrieval augmented generation (RAG) for use in generative AI applications; downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; downloadable software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; downloadable software for modifying and enabling transmission of images, audio, audio visual and video content and data; downloadable software for processing images, graphics, audio, video, and text; downloadable software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; downloadable software for use in data centers and data storage; downloadable software for use as an application programming interface (API); downloadable software development tools; downloadable software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; downloadable software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence; downloadable software development kits (SDKs); downloadable podcasts in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence. Database management; Database management consultancy; data management and processing, namely, data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, and building, design and management of data lakes; business consulting in the field of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence; Business data analysis; Business data consultancy; providing an online marketplace featuring downloadable software applications, data sets, data notebooks, artificial intelligence models, machine learning models. Educational services; Conducting conferences, seminars, classes workshops, courses, and webinars; Conducting conferences, seminars, classes workshops, courses, and webinars in field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence; Training services; Training services in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence; Providing training for certification; Providing training for certification in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence; Non-downloadable publications and blogs; Non-downloadable publications and blogs in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence; Non-downloadable podcasts; Non-downloadable podcasts in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science, artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence. providing online non-downloadable software; providing online non-downloadable software for big data processing and analytics; providing of online non-downloadable software for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; providing online non-downloadable software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; providing online non-downloadable software for application database integration; providing online non-downloadable software for use in ETL (extract, transform, load) data processing; providing online non-downloadable software for compiling, organizing, visualizing, sharing and analyzing business intelligence; providing online non-downloadable software featuring artificial intelligence for use in data governance, namely, software that allows users to define and manage policies for gathering, querying, storing, processing, sharing, accessing and disposing of data, create data clean rooms, and track data lineage; providing online non-downloadable desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; providing online non-downloadable software featuring libraries for data science training; providing online non-downloadable software featuring libraries for data analytics; providing online non-downloadable software featuring libraries for machine learning training; providing online non-downloadable software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence; providing online non-downloadable software featuring artificial intelligence and machine learning technology; providing online non-downloadable software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence; providing online non-downloadable software for retrieval augmented generation (RAG); providing online non-downloadable software for retrieval augmented generation (RAG) for use in generative AI applications; providing online non-downloadable software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; providing online non-downloadable software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; providing online non-downloadable software for modifying and enabling transmission of images, audio, audio visual and video content and data; providing online non-downloadable software for processing images, graphics, audio, video, and text; providing online non-downloadable software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; providing online non-downloadable software for use in data centers and data storage; providing online non-downloadable software for use as an application programming interface (API); providing online non-downloadable software development tools; providing online non-downloadable software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; providing online non-downloadable software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence; providing online non-downloadable software development kits (SDKs); software as a service (SAAS) services; software as a service (SAAS) services featuring software for big data processing and analytics; software as a service (SAAS) services, namely, featuring software use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; software as a service (SAAS) services, namely, featuring software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; software as a service (SAAS) services featuring software for application database integration; software as a service (SAAS) services featuring software for use in ETL (extract, transform, load) data processing; software as a service (SAAS) services featuring software for compiling, organizing, visualizing, sharing and analyzing business intelligence; software as a service (SAAS) services featuring software featuring artificial intelligence for use in data governance, namely, software that allows users to define and manage policies for gathering, querying, storing, processing, sharing, accessing and disposing of data, create data clean rooms, and track data lineage; software as a service (SAAS) services featuring desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; software as a service (SAAS) services featuring software featuring libraries for data science training; software as a service (SAAS) services featuring software featuring libraries for data analytics; software as a service (SAAS) services featuring software featuring libraries for machine learning training; software as a service (SAAS) services featuring software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, , predictive analytics and business intelligence; software as a service (SAAS) services featuring software featuring artificial intelligence and machine learning technology; software as a service (SAAS) services featuring software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software for retrieval augmented generation (RAG); software as a service (SAAS) services featuring software for retrieval augmented generation (RAG) for use in generative AI applications; software as a service (SAAS) services featuring software for retrieval augmented generation (RAG) for use in data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; software as a service (SAAS) services featuring software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; software as a service (SAAS) services featuring software for modifying and enabling transmission of images, audio, audio visual and video content and data; software as a service (SAAS) services featuring software for processing images, graphics, audio, video, and text; software as a service (SAAS) services featuring software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; software as a service (SAAS) services featuring software for use in data centers and data storage; software as a service (SAAS) services featuring software for use as an application programming interface (API); software as a service (SAAS) services featuring software development tools; software as a service (SAAS) services featuring software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; software as a service (SAAS) services featuring software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence; software as a service (SAAS) services featuring software development kits (SDKs); platform as a services (PAAS) services; platform as a services (PAAS) services featuring software for big data processing and analytics; platform as a services (PAAS) services, namely, featuring software for data analytics, data migration, data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data governance, data science; platform as a services (PAAS) services, namely, featuring software for accessing, managing and connecting to data lakes, data warehouses, data assets, data files, data sources; platform as a services (PAAS) services featuring software for use in ETL (extract, transform, load) data processing; platform as a services (PAAS) services featuring software for compiling, organizing, visualizing, sharing and analyzing business intelligence; platform as a services (PAAS) services featuring desktop and mobile computing and operating platforms consisting of data transceivers, wireless networks and gateways, for collection, analysis, sharing, interpretation and management of data; platform as a services (PAAS) services featuring software featuring libraries for data science training; platform as a services (PAAS) services featuring software featuring libraries for data analytics; platform as a services (PAAS) services featuring software featuring libraries for machine learning training; platform as a services (PAAS) services featuring software featuring libraries for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, , predictive analytics and business intelligence; platform as a services (PAAS) services featuring software featuring artificial intelligence and machine learning technology; platform as a services (PAAS) services featuring software for artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software for collecting, managing, editing, organizing, modifying, transmitting, sharing, and storing of data; platform as a services (PAAS) services featuring software for modifying and enabling transmission of images, audio, audio visual and video content and data; platform as a services (PAAS) services featuring software for processing images, graphics, audio, video, and text; platform as a services (PAAS) services featuring software for transmitting, sharing, receiving, downloading, displaying, interacting with and transferring content, text, visual works, audio works, audiovisual works, literary works, data, files, documents and electronic works; platform as a services (PAAS) services featuring software for use in data centers and data storage; platform as a services (PAAS) services featuring software for use as an application programming interface (API); platform as a services (PAAS) services featuring software development tools; platform as a services (PAAS) services featuring software development tools for building, designing, deploying, and monitoring applications for use in data integration, data warehousing, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, and data analytics; platform as a services (PAAS) services featuring software development tools for building, designing, deploying, and monitoring applications featuring artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence; platform as a services (PAAS) services featuring software development kits (SDKs); Data mining; computer services, namely, hosting of search platforms on the Internet to allow users to index, integrate, warehouse, mine, process, share, collect, interpret, research, query, visualize, and analyze data; computer services, namely, hosting of artificial intelligence models, machine learning models, and large language models (LLMs); computer services, namely, hosting of artificial intelligence models, machine learning models, and large language models (LLMs) to allow users to perform search queries and develop inference; application service provider services, namely, hosting, managing, developing and maintaining applications, and software of others in the fields of big data processing and analytics, data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science; custom design and development of computer software; Software engineering services for data processing; development and creation of computer programs for data processing and analysis; Providing a website featuring information in the fields of technology, computers, computer software, computer networks, web services, mobile computing and artificial intelligence; Technology consulting in the field of data analytics, data migration, data importing, data wrangling, data mining, data processing, data sharing, data collection, data interpretation, data queries, data visualization, data integration, data warehousing, data processing, data governance, data science and building, design and management of data lakes; Technology consulting in the field of artificial intelligence, machine learning, deep learning, large language models (LLMs), natural language generation, statistical learning, supervised learning, unsupervised learning, predictive analytics and business intelligence.

64. Using LLM functions to evaluate and compare large text outputs of LLMs

Numéro d'application	18518155
Numéro de brevet	12579378
Statut	Délivré - en vigueur
Date de dépôt	2023-11-22
Date de la première publication	2025-04-17
Date d'octroi	2026-03-17
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Gupta, Ridhima Kannan, Prithvi Sheth, Sunish Sohil Uhlenhuth, Kasey Zub, Hubert Zumar, Corey

Abrégé

A method for evaluating textual output of one or more machine-learned language models is presented. The method includes receiving, from a user of a client device, a first prompt for input to one or more machine-learned language models, providing the first prompt to the one or more models for execution, and receiving a set of generated responses to the first prompt from the one or more models. The method further includes generating a user interface (UI) on the client device displaying the first prompt and generated responses as a table user interface element. The method applies a selected evaluation function to the generated response to evaluate the response with respect to an evaluation objective and identifies words that influence the evaluation. The method generates one or more UI elements on the UI to display the results of the evaluation for the generated responses.

Classes IPC ?

G06F 40/40 - Traitement ou traduction du langage naturel
G06F 40/103 - Mise en forme, c.-à-d. modification de l’apparence des documents
G06F 40/30 - Analyse sémantique

65. Concurrent optimistic transactions for tables with deletion vectors

Numéro d'application	18928982
Numéro de brevet	12596700
Statut	Délivré - en vigueur
Date de dépôt	2024-10-28
Date de la première publication	2025-03-27
Date d'octroi	2026-04-07
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Samwel, Bart Stavrakakis, Christos

Abrégé

A disclosed configuration receives a first indication that a first transaction is committed to update a first subset of records in a data table at a first version to generate a second version of the data table and receiving a second indication to commit a second transaction to update a second subset of records in a data file of the data table at the first version. The configuration determines a logical prerequisite based on whether the first subset of records changes content of one or more records in the second subset of records and determining a physical prerequisite on whether the second subset of records corresponds to respective data records in data files of the second version of the data table. The configuration commits the second transaction to generate a third version of the data table by updating elements of the deletion vector if the prerequisites are satisfied.

Classes IPC ?

G06F 16/00 - Recherche d’informationsStructures de bases de données à cet effetStructures de systèmes de fichiers à cet effet
G06F 16/23 - Mise à jour

66. Clean room generation for data collaboration and executing clean room task in data processing pipeline

Numéro d'application	18474708
Numéro de brevet	12260003
Statut	Délivré - en vigueur
Date de dépôt	2023-09-26
Date de la première publication	2025-03-25
Date d'octroi	2025-03-25
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Chau, William Chakankar, Abhijit Mahoney, Stephen Michael Morris, Daniel Seth Weiss, Itai Shlomo

Abrégé

Classes IPC ?

G06F 21/00 - Dispositions de sécurité pour protéger les calculateurs, leurs composants, les programmes ou les données contre une activité non autorisée
G06F 21/62 - Protection de l’accès à des données via une plate-forme, p. ex. par clés ou règles de contrôle de l’accès

67. RESOURCE MANAGEMENT WITH INTERMEDIARY NODE IN KUBERNETES ENVIRONMENT

Numéro d'application	18368919
Statut	En instance
Date de dépôt	2023-09-15
Date de la première publication	2025-03-20
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Davidson, Aaron Daniel Garnier, Thomas Guo, Lin He, Zhe Li, Manlin Liu, Yang Wang, Feng Zhang, Hong Zhu, Weirong

Abrégé

A resource management configuration may receive an API request from an API server. The API request specifies task information from a plurality of tenants. The configuration transmits status information of a plurality of VMs to the API server to assign tasks to one or more VMs based on the task information and the status information. Tasks assigned to a VM of the plurality of VMs are for one tenant of the plurality of tenants. The configuration configures on an untrusted network, network security groups for managing communications of tenants such that a network security group configured for a tenant permits communications between VMs assigned to the same tenant but prevents communications between VMs assigned to different tenants. The configuration pins each assigned VM of the one or more assigned VMs to perform the task based on the task information of the corresponding tenant.

Classes IPC ?

G06F 9/455 - ÉmulationInterprétationSimulation de logiciel, p. ex. virtualisation ou émulation des moteurs d’exécution d’applications ou de systèmes d’exploitation
G06F 9/54 - Communication interprogramme

68. STRUCTURED CLUSTER EXECUTION FOR DATA STREAMS

Numéro d'application	18745847
Statut	En instance
Date de dépôt	2024-06-17
Date de la première publication	2025-03-13
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Armbrust, Michael Paul Das, Tathagata Xin, Shi Zaharia, Matei

Abrégé

A system for executing a streaming query includes an interface and a processor. The interface is configured to receive a logical query plan. The processor is configured to determine a physical query plan based at least in part on the logical query plan. The physical query plan comprises an ordered set of operators. Each operator of the ordered set of operators comprises an operator input mode and an operator output mode. The processor is further configured to execute the physical query plan using the operator input mode and the operator output mode for each operator of the query.

Classes IPC ?

G06F 16/2453 - Optimisation des requêtes
G06F 16/2455 - Exécution des requêtes

69. K-D tree balanced splitting

Numéro d'application	18772758
Numéro de brevet	12561303
Statut	Délivré - en vigueur
Date de dépôt	2024-07-15
Date de la première publication	2025-03-13
Date d'octroi	2026-02-24
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Samwel, Bart Jain, Prakhar

Abrégé

A system for clustering data into corresponding files comprises one or more processors and a memory. The one or more processors is/are configured to: 1) determine to cluster a set of data into a set of files; 2) determine a set of split points in a corresponding set of dimensions of the set of data to determine the set of files, wherein each file of the set of files has an approximate target size; and 3) store one or more items of the set of data into a corresponding file of the set of files based at least in part on the set of split points. The memory is coupled to the one or more processors and configured to provide the processor with instructions.

Classes IPC ?

G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage
G06F 16/27 - Réplication, distribution ou synchronisation de données entre bases de données ou dans un système de bases de données distribuéesArchitectures de systèmes de bases de données distribuées à cet effet
G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet

70. Reducing cluster start up time

Numéro d'application	17514988
Numéro de brevet	12248818
Statut	Délivré - en vigueur
Date de dépôt	2021-10-29
Date de la première publication	2025-03-11
Date d'octroi	2025-03-11
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Mao, Yandong Davidson, Aaron Daniel

Abrégé

Classes IPC ?

G06F 9/50 - Allocation de ressources, p. ex. de l'unité centrale de traitement [UCT]
G06F 21/45 - Structures ou outils d’administration de l’authentification

71. Data lineage tracking

Numéro d'application	18162562
Numéro de brevet	12242441
Statut	Délivré - en vigueur
Date de dépôt	2023-01-31
Date de la première publication	2025-03-04
Date d'octroi	2025-03-04
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Feng, Tao Sun, Menglei Wang, Zhuoying

Abrégé

Classes IPC ?

G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet
G06F 11/07 - Réaction à l'apparition d'un défaut, p. ex. tolérance de certains défauts
G06F 16/215 - Amélioration de la qualité des donnéesNettoyage des données, p. ex. déduplication, suppression des entrées non valides ou correction des erreurs typographiques
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage
G06F 16/23 - Mise à jour
G06F 16/906 - GroupementClassement
G06F 17/18 - Opérations mathématiques complexes pour l'évaluation de données statistiques

72. Automated Processing of Multiple Prediction Generation Including Model Tuning

Numéro d'application	18738025
Statut	En instance
Date de dépôt	2024-06-09
Date de la première publication	2025-02-20
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Wilson, Benjamin Thomas Zumar, Corey

Abrégé

The present application discloses a method, system, and computer system for building a model associated with a dataset. The method includes receiving a data set, the dataset comprising a plurality of keys and a plurality of key-value relationships, determining a plurality of models to build based at least in part on the dataset, wherein determining the plurality of models to build comprises using the dataset format information to identify the plurality of models, building the plurality of models, and optimizing at least one of the plurality of models.

Classes IPC ?

G06N 20/00 - Apprentissage automatique
G06F 18/20 - Analyse
G06F 18/2132 - Extraction de caractéristiques, p. ex. en transformant l'espace des caractéristiquesSynthétisationsMappages, p. ex. procédés de sous-espace basée sur des critères de discrimination, p. ex. l'analyse discriminante

73. STATE REBALANCING IN STRUCTURED STREAMING

Numéro d'application	18822023
Statut	En instance
Date de dépôt	2024-08-30
Date de la première publication	2025-02-20
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Balikov, Alexander Das, Tathagata Ramasamy, Karthikeyan

Abrégé

A data processing service performs a rebalancing process for rebalancing stateful tasks on a cluster computing system. In one instance, the method for rebalancing stateful tasks is performed such that the per-operator partitions are spread across available executors of a cluster of the cluster computing system with respect to one or more statistics of the tasks. In one instance, the method for rebalancing stateful tasks is also performed such that the total number of stateful tasks are balanced per executor as long as this rebalancing does not imbalance the per-operator placements. In this way, the processing of stateful tasks can be spread across multiple executors in a relatively uniform manner, even though there may be an upfront cost of breaking the local caching on an executor.

Classes IPC ?

G06F 16/27 - Réplication, distribution ou synchronisation de données entre bases de données ou dans un système de bases de données distribuéesArchitectures de systèmes de bases de données distribuées à cet effet
G06F 16/2455 - Exécution des requêtes

74. Checkpoint and restore based startup of executor nodes of a distributed computing engine for processing queries

Numéro d'application	18412438
Numéro de brevet	12229137
Statut	Délivré - en vigueur
Date de dépôt	2024-01-12
Date de la première publication	2025-02-18
Date d'octroi	2025-02-18
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Ge, Xinyang Ao, Lixiang Jing, Haonan Davidson, Aaron Daniel

Abrégé

Classes IPC ?

G06F 16/2453 - Optimisation des requêtes

75. Clustering key selection based on machine-learned key selection models for data processing service

Numéro d'application	18501830
Numéro de brevet	12229169
Statut	Délivré - en vigueur
Date de dépôt	2023-11-03
Date de la première publication	2025-02-18
Date d'octroi	2025-02-18
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Kim, Terry Ma, Lin Mahadev, Rahul Shivu Potharaju, Rahul

Abrégé

Classes IPC ?

G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet
G06F 16/21 - Conception, administration ou maintenance des bases de données
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage

76. MESSAGING DEDPULICATION IN PUBLISH / SUBSCRIBE SYSTEM

Numéro d'application	18224981
Statut	En instance
Date de dépôt	2023-07-21
Date de la première publication	2025-01-23
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Anand, Pranav Gattu, Praveen Shrigondekar, Anish Wang, Huanli

Abrégé

A device for using message identifiers for Publish/subscribe messaging deduplication is described. The system may fetch one or more sets of data records from a data source, and each data record is associated with a message identifier. The system may store the one or more sets of data records in a data file, which is associated with a metadata comprising the message identifier, a file path and a row number for each data record. The system may determine whether one or more of the data records are duplicated based on the associated message identifiers. In response to determining that the one or more data records are duplicated, the system may generate a second metadata comprising the file paths and row numbers associated with the duplicated data records.

Classes IPC ?

G06F 16/174 - Élimination de redondances par le système de fichiers
G06F 16/14 - Détails de la recherche de fichiers basée sur les métadonnées des fichiers
G06F 16/16 - Opérations sur les fichiers ou les dossiers, p. ex. détails des interfaces utilisateur spécialement adaptées aux systèmes de fichiers

77. Model ML registry and model serving

Numéro d'application	18885322
Numéro de brevet	12541491
Statut	Délivré - en vigueur
Date de dépôt	2024-09-13
Date de la première publication	2025-01-16
Date d'octroi	2026-02-03
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Davidson, Aaron Daniel Mewald, Clemens Nykodym, Tomas

Abrégé

A system includes an interface, a processor, and a memory. The interface is configured to receive a version of a model from a model registry. The processor is configured to store the version of the model, start a process running the version of the model, and update a proxy with version information associated with the version of the model, wherein the updated proxy indicates to redirect an indication to invoke the version of the model to the process. The memory is coupled to the processor and configured to provide the processor with instructions.

Classes IPC ?

G06F 16/00 - Recherche d’informationsStructures de bases de données à cet effetStructures de systèmes de fichiers à cet effet
G06F 16/21 - Conception, administration ou maintenance des bases de données
G06F 16/955 - Recherche dans le Web utilisant des identifiants d’information, p. ex. des localisateurs uniformisés de ressources [uniform resource locators - URL]
G06N 5/022 - Ingénierie de la connaissanceAcquisition de la connaissance

78. Clean room generation for data collaboration

Numéro d'application	18473992
Numéro de brevet	12197400
Statut	Délivré - en vigueur
Date de dépôt	2023-09-25
Date de la première publication	2025-01-14
Date d'octroi	2025-01-14
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Chau, William Chakankar, Abhijit Mahoney, Stephen Michael Morris, Daniel Seth Weiss, Itai Shlomo

Abrégé

A data processing service receives a request from a first collaborator to create a clean room for data sharing collaboration with at least a second collaborator. In response, the data processing service creates an execution environment separate from the data environment of the first collaborator and the second collaborator. The first and second collaborators can then add content into the clean room in the form of data tables and executable notebooks. Approval from each collaborator is required before a notebook can be executed using any data table shared into the clean room. Upon receiving notebook approval from each collaborator, the data processing service creates a notebook job to execute the notebook on one or more cluster computing resources of the data processing service to generate an output.

Classes IPC ?

G06F 16/00 - Recherche d’informationsStructures de bases de données à cet effetStructures de systèmes de fichiers à cet effet
G06F 16/21 - Conception, administration ou maintenance des bases de données

79. Efficient Merging of Tabular Data with Post-Processing Compaction

Numéro d'application	18769269
Statut	En instance
Date de dépôt	2024-07-10
Date de la première publication	2025-01-09
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Samwel, Bart Das, Tathagata Kroll, Lars Cui, Yijia Sompolski, Juliusz Van Bussel, Tom Jain, Prakhar

Abrégé

A method, system, and computer system for performing an operation with respect to a target table are disclosed. The method includes performing first and second jobs, obtaining one or more other resulting files based at least in part on unmatched rows, and obtaining a set of processed files based at least in part on performing a post-processing operation with respect to the set of resulting files. The set of processed files has less files than the set of resulting files. Performing the first job includes determining a set of matching target table files and storing target table information indicating for each of the set of matching target table files, a particular set of rows having matching rows. Performing the second job includes performing a matching action based on matched rows and obtaining the second job resulting file(s).

Classes IPC ?

G06F 16/2453 - Optimisation des requêtes
G06F 11/34 - Enregistrement ou évaluation statistique de l'activité du calculateur, p. ex. des interruptions ou des opérations d'entrée–sortie
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage
G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet

80. Data file clustering with KD-classifier trees

Numéro d'application	18218410
Numéro de brevet	12405920
Statut	Délivré - en vigueur
Date de dépôt	2023-07-05
Date de la première publication	2025-01-09
Date d'octroi	2025-09-02
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Jain, Prakhar Johnson, Frederick Ryan Kim, Terry Prabhakaran, Vijayan Samwel, Bart

Abrégé

A data processing service generates a data classifier tree for managing data files of a data table. The data classifier tree may be configured as a KD-classifier tree and includes a plurality of nodes and edges. A node of the data classifier tree may represent a splitting condition with respect to key-values for a respective key. A node of the data classifier tree may be associated with one or more data files assigned to the node. The data files assigned to the node each include a subset of records having key-values that satisfy the conditions represented by the node and parent nodes of the node. The data processing service may efficiently cluster the data in the data table while reducing the number of data files that are rewritten when data is modified or added to the data table.

Classes IPC ?

G06F 16/10 - Systèmes de fichiersServeurs de fichiers
G06F 16/13 - Structures d’accès aux fichiers, p. ex. indices distribués
G06F 16/16 - Opérations sur les fichiers ou les dossiers, p. ex. détails des interfaces utilisateur spécialement adaptées aux systèmes de fichiers
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage
G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet

81. Data file clustering with KD-epsilon trees

Numéro d'application	18218766
Numéro de brevet	12332862
Statut	Délivré - en vigueur
Date de dépôt	2023-07-06
Date de la première publication	2025-01-09
Date d'octroi	2025-06-17
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Jain, Prakhar Johnson, Frederick Ryan Samwel, Bart

Abrégé

A data tree for managing data files of a data table and performing one or more transaction operations to the data table is described. The data tree is configured as a KD-epsilon tree and includes a plurality of nodes and edges. A node of the data tree may represent a splitting condition with respect to key-values for a respective key. A leaf node of the data tree may correspond to a data file for a data table that includes a subset of records having key-values that satisfy the condition for the node and conditions associated with parent nodes of the node. A parent node may correspond to a file including a buffer that stores changes to data files reachable by this parent node, and also includes dedicated storage to pointers of the child nodes. By using the data tree, the data processing system may efficiently cluster the data in the data table while reducing the number of data files that are rewritten.

Classes IPC ?

G06F 16/20 - Recherche d’informationsStructures de bases de données à cet effetStructures de systèmes de fichiers à cet effet de données structurées, p. ex. de données relationnelles
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage
G06F 16/23 - Mise à jour
G06F 16/245 - Traitement des requêtes
G06F 16/2453 - Optimisation des requêtes
G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet

82. Data Retrieval Using Distributed Workers in a Large-Scale Data Access System

Numéro d'application	18771892
Statut	En instance
Date de dépôt	2024-07-12
Date de la première publication	2025-01-02
Propriétaire	DATABRICKS, INC. (USA)
Inventeur(s)	Khurana, Amandeep Li, Nong

Abrégé

Disclosed herein provides enhancements for operating a data access application service executing on a data access server system and an external computing system. In the data access server system, a request is received from a client device executing at least one of multiple application services for a dataset from one or more of multiple storage systems. In the data access server system, a data retrieval instruction is generated for the client device to access the dataset from the one or more of the multiple storage systems. The data retrieval instruction comprises task descriptions and a temporary credential. The data retrieval instruction is transferred to the external computing system via the client device and the requested dataset is retrieved and deployed based on the task descriptions and the temporary credential from the one or more of the multiple storage systems.

Classes IPC ?

G06F 16/27 - Réplication, distribution ou synchronisation de données entre bases de données ou dans un système de bases de données distribuéesArchitectures de systèmes de bases de données distribuées à cet effet
G06F 9/54 - Communication interprogramme
G06F 16/2455 - Exécution des requêtes
G06F 16/25 - Systèmes d’intégration ou d’interfaçage impliquant les systèmes de gestion de bases de données
G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet

83. Data sharing for network connected systems

Numéro d'application	18162353
Numéro de brevet	12182292
Statut	Délivré - en vigueur
Date de dépôt	2023-01-31
Date de la première publication	2024-12-31
Date d'octroi	2024-12-31
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Zaharia, Matei Zhu, Shixiong Sun, Xiaotong Chandra, Ramesh Armbrust, Michael Paul Ghodsi, Ali

Abrégé

Classes IPC ?

G06F 21/62 - Protection de l’accès à des données via une plate-forme, p. ex. par clés ou règles de contrôle de l’accès
G06F 21/00 - Dispositions de sécurité pour protéger les calculateurs, leurs composants, les programmes ou les données contre une activité non autorisée
G06F 21/60 - Protection de données

84. FEATURE FUNCTION BASED COMPUTATION OF ON-DEMAND FEATURES OF MACHINE LEARNING MODELS

Numéro d'application	18206460
Statut	En instance
Date de dépôt	2023-06-06
Date de la première publication	2024-12-12
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Zaharia, Matei Singh, Avesh Parkhe, Mani Lukiyanov, Maxim Meng, Xiangrui Talati, Aakrati Liang, Chenen Uhlenhuth, Kasey

Abrégé

A system performs training and execution of machine learning models that use on-demand features using feature functions. The system receives commands for registering metadata associated with a machine learning model. The machine learning model may process a set of features including on-demand features as well as other features such as batch features. The system executes the command by storing an association between the machine learning model and the feature functions associated with any on-demand features processed by the machine learning model. The feature functions are executed using an end point of a data asset service. The use of the data asset service for invoking the feature functions ensures that the same set of instructions is executed during model training and model inferencing, thereby avoiding model skew.

Classes IPC ?

G06N 20/00 - Apprentissage automatique

85. Fetching query results through cloud object stores

Numéro d'application	18614380
Numéro de brevet	12399901
Statut	Délivré - en vigueur
Date de dépôt	2024-03-22
Date de la première publication	2024-11-28
Date d'octroi	2025-08-26
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Ghit, Bogdan Ionut Sompolski, Juliusz Xin, Shi Samwel, Bart

Abrégé

The system is configured to: 1) receive a client request; 2) determine executor(s) to generate a response to the user request; 3) provide each of the executor(s) with an indication; 4) receive for each indication a response including an output of either a cloud output or an in-line output to generate a group of in-line outputs and a group of cloud outputs; 5) determine whether the group of in-line outputs comprises all outputs; and 6) in response to the group of in-line outputs not comprising all the outputs for the client request: a) convert the group of in-line outputs to a converted group of cloud outputs; b) generate metadata for the converted group of cloud outputs and the group of cloud outputs; and c) provide response to the client request including the metadata for the converted group of cloud outputs and the group of cloud outputs.

Classes IPC ?

G06F 16/2458 - Types spéciaux de requêtes, p. ex. requêtes statistiques, requêtes floues ou requêtes distribuées
G06F 11/34 - Enregistrement ou évaluation statistique de l'activité du calculateur, p. ex. des interruptions ou des opérations d'entrée–sortie
G06F 16/242 - Formulation des requêtes
G06F 16/25 - Systèmes d’intégration ou d’interfaçage impliquant les systèmes de gestion de bases de données

86. Hash based rollup with passthrough

Numéro d'application	18162093
Numéro de brevet	12153558
Statut	Délivré - en vigueur
Date de dépôt	2023-01-31
Date de la première publication	2024-11-26
Date d'octroi	2024-11-26
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Behm, Alexander Dave, Ankur

Abrégé

A system includes a plurality of computing units. A first computing unit of the plurality of computing units comprises: a communication interface configured to receive an indication to roll up data in a data table; and a processor coupled to the communication interface and configured to: build a preaggregation hash table based at least in part on a set of columns and the data table by aggregating input rows of the data table; for each preaggregated hash table entry of the preaggregated hash table: provide the preaggregated hash table entry to a second computing unit of the plurality of computing units based at least in part on a distribution hash value; receive a set of received entries from computing units of the plurality of computing units; and build an aggregation hash table based at least in part on the set of received entries by aggregating the set of received entries.

Classes IPC ?

G06F 16/00 - Recherche d’informationsStructures de bases de données à cet effetStructures de systèmes de fichiers à cet effet
G06F 16/13 - Structures d’accès aux fichiers, p. ex. indices distribués
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage
G06F 16/242 - Formulation des requêtes
G06F 16/2455 - Exécution des requêtes
G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet

87. Data sharing for network connected systems

Numéro d'application	17733485
Numéro de brevet	12147555
Statut	Délivré - en vigueur
Date de dépôt	2022-04-29
Date de la première publication	2024-11-19
Date d'octroi	2024-11-19
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Zaharia, Matei Zhu, Shixiong Sun, Xiaotong Chandra, Ramesh Armbrust, Michael Paul Ghodsi, Ali

Abrégé

Classes IPC ?

G06F 21/62 - Protection de l’accès à des données via une plate-forme, p. ex. par clés ou règles de contrôle de l’accès
G06F 21/00 - Dispositions de sécurité pour protéger les calculateurs, leurs composants, les programmes ou les données contre une activité non autorisée
G06F 21/60 - Protection de données

88. Auto maintenance for data tables in cloud storage

Numéro d'application	18144647
Numéro de brevet	12204510
Statut	Délivré - en vigueur
Date de dépôt	2023-05-08
Date de la première publication	2024-11-14
Date d'octroi	2025-01-21
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Prabhakaran, Vijayan Raja, Himanshu Potharaju, Rahul Bhanoori, Naga Raju Ma, Lin Parangi Sharabhalingappa, Rajesh Liang, Jintian Schuermann, Zachary Vaughn Ting, Kam Cheung

Abrégé

Classes IPC ?

G06F 16/21 - Conception, administration ou maintenance des bases de données
G06F 11/34 - Enregistrement ou évaluation statistique de l'activité du calculateur, p. ex. des interruptions ou des opérations d'entrée–sortie
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage

89. Short query prioritization for data processing service

Numéro d'application	18140323
Numéro de brevet	12210521
Statut	Délivré - en vigueur
Date de dépôt	2023-04-27
Date de la première publication	2024-10-31
Date d'octroi	2025-01-28
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Gudesa, Venkata Sai Akhil Van Hövell Tot Westerflier, Herman Rudolf Petrus Catharina Nakandala, Supun Chathuranga

Abrégé

Classes IPC ?

G06F 16/24 - Requêtes
G06F 9/48 - Lancement de programmes Commutation de programmes, p. ex. par interruption
G06F 11/34 - Enregistrement ou évaluation statistique de l'activité du calculateur, p. ex. des interruptions ou des opérations d'entrée–sortie
G06F 16/2453 - Optimisation des requêtes
G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet

90. Retrieval and caching of object metadata across data sources and storage systems

Numéro d'application	18135078
Numéro de brevet	12204523
Statut	Délivré - en vigueur
Date de dépôt	2023-04-14
Date de la première publication	2024-10-17
Date d'octroi	2025-01-21
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Li, Zhaoxing Singh, Rayman Preet Efeoglu, Fuat Can Tenedorio, Daniel Cai, Sarah

Abrégé

Classes IPC ?

G06F 16/00 - Recherche d’informationsStructures de bases de données à cet effetStructures de systèmes de fichiers à cet effet
G06F 16/23 - Mise à jour
G06F 16/2455 - Exécution des requêtes

91. Multiple pass sort

Numéro d'application	17875176
Numéro de brevet	12105690
Statut	Délivré - en vigueur
Date de dépôt	2022-07-27
Date de la première publication	2024-10-01
Date d'octroi	2024-10-01
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Armstrong, Timothy Krishnan, Arvind Sai Guliyev, Khayyam

Abrégé

Classes IPC ?

G06F 16/00 - Recherche d’informationsStructures de bases de données à cet effetStructures de systèmes de fichiers à cet effet
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage
G06F 16/2455 - Exécution des requêtes

92. Scaling delta table optimize command

Numéro d'application	18093916
Numéro de brevet	12079167
Statut	Délivré - en vigueur
Date de dépôt	2023-01-06
Date de la première publication	2024-09-03
Date d'octroi	2024-09-03
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Mahadev, Rahul Shivu Yavuz, Burak Das, Tathagata

Abrégé

Classes IPC ?

G06F 16/172 - Mise en cache, pré-extraction ou accumulation de fichiers
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage

93. Data ingestion using data file clustering with KD-epsilon trees

Numéro d'application	18218400
Numéro de brevet	12072863
Statut	Délivré - en vigueur
Date de dépôt	2023-07-05
Date de la première publication	2024-08-27
Date d'octroi	2024-08-27
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Jain, Prakhar Johnson, Frederick Ryan Samwel, Bart

Abrégé

Classes IPC ?

G06F 16/20 - Recherche d’informationsStructures de bases de données à cet effetStructures de systèmes de fichiers à cet effet de données structurées, p. ex. de données relationnelles
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage
G06F 16/23 - Mise à jour
G06F 16/245 - Traitement des requêtes
G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet

94. Data maintenance transaction rollbacks

Numéro d'application	17580475
Numéro de brevet	12072843
Statut	Délivré - en vigueur
Date de dépôt	2022-01-20
Date de la première publication	2024-08-27
Date d'octroi	2024-08-27
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Jain, Prakhar Samwel, Bart Yavuz, Burak

Abrégé

Classes IPC ?

G06F 16/174 - Élimination de redondances par le système de fichiers

95. Multi-cluster query result caching

Numéro d'application	18221735
Numéro de brevet	12360995
Statut	Délivré - en vigueur
Date de dépôt	2023-07-13
Date de la première publication	2024-08-08
Date d'octroi	2025-07-15
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Ghit, Bogdan Ionut Garg, Saksham Stuart, Christian Stevens, Christopher

Abrégé

A multi-cluster computing system which includes a query result caching system is presented. The multi-cluster computing system may include a data processing service and client devices communicatively coupled over a network. The data processing service may include a control layer and a data layer. The control layer may be configured to receive and process requests from the client devices and manage resources in the data layer. The data layer may be configured to include instances of clusters of computing resources for executing jobs. The data layer may include a data storage system, which further includes a remote query result cache Store. The query result cache store may include a cloud storage query result cache which stores data associated with results of previously executed requests. As such, when a cluster encounters a previously executed request, the cluster may efficiently retrieve the cached result of the request from the in-memory query result cache or the cloud storage query result cache.

Classes IPC ?

G06F 16/24 - Requêtes
G06F 16/2453 - Optimisation des requêtes
G06F 16/25 - Systèmes d’intégration ou d’interfaçage impliquant les systèmes de gestion de bases de données
G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet

96. Multi-cluster query result caching

Numéro d'application	18222343
Numéro de brevet	12189625
Statut	Délivré - en vigueur
Date de dépôt	2023-07-14
Date de la première publication	2024-08-08
Date d'octroi	2025-01-07
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Ghit, Bogdan Ionut Garg, Saksham Stuart, Christian Stevens, Christopher

Abrégé

A multi-cluster computing system which includes a query result caching system is presented. The multi-cluster computing system may include a data processing service and client devices communicatively coupled over a network. The data processing service may include a control layer and a data layer. The control layer may be configured to receive and process requests from the client devices and manage resources in the data layer. The data layer may be configured to include instances of clusters of computing resources for executing jobs. The data layer may include a data storage system, which further includes a remote query result cache Store. The query result cache store may include a cloud storage query result cache which stores data associated with results of previously executed requests. As such, when a cluster encounters a previously executed request, the cluster may efficiently retrieve the cached result of the request from the in-memory query result cache or the cloud storage query result cache.

Classes IPC ?

G06F 16/24 - Requêtes
G06F 16/2453 - Optimisation des requêtes
G06F 16/25 - Systèmes d’intégration ou d’interfaçage impliquant les systèmes de gestion de bases de données
G06F 16/28 - Bases de données caractérisées par leurs modèles, p. ex. des modèles relationnels ou objet

97. RUNTIME ERROR ATTRIBUTION FOR DATABASE QUERIES SPECIFIED USING A DECLARATIVE DATABASE QUERY LANGUAGE

Numéro d'application	CN2023073691
Numéro de publication	2024/156113
Statut	Délivré - en vigueur
Date de dépôt	2023-01-29
Date de publication	2024-08-02
Propriétaire	DATABRICKS , INC. (USA)
Inventeur(s)	Fan, Wenchen Rielau, Serge Shen, Entong

Abrégé

A system executes database queries specified using a declarative database query language such as the structured query language (SQL). The system determines whether a runtime error is encountered during execution of a query, for example, a division by zero error, resource usage errors such as out of memory error, time out error, and so on. The system reports such runtime errors encountered during execution of a database query. The system identifies one or more origins of the runtime error in the database query. The origin identifies a portion of the database query that represents a cause of the runtime error. Reporting the origin of a runtime error in the database query simplifies the task of development and testing of database queries.

Classes IPC ?

G06F 16/21 - Conception, administration ou maintenance des bases de données
G06F 16/24 - Requêtes

98. STATIC APPROACH TO LAZY MATERIALIZATION IN DATABASE SCANS USING PUSHED FILTERS

Numéro d'application	18160850
Statut	En instance
Date de dépôt	2023-01-27
Date de la première publication	2024-08-01
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Palkar, Shoumik Behm, Alexander Mokhtar, Mostafa Krishnamurthy, Sriram

Abrégé

Disclosed herein is a method for determining whether to apply a lazy materialization technique to a query run. The method includes receiving a request to perform a new query in a columnar database containing a plurality of columns. A step in the method includes accessing a set of data in a column of the plurality of columns based on the query. The method includes generating an input to a machine-learned model comprising characteristics of the set of data in the column. From the machine-learned model, the method includes generating a likelihood value indicative of whether a filter of a first portion of the set of data in the column has greater efficiency than a download followed by a filter of the set of data in the column. The method further includes comparing the likelihood value to a threshold value. Based on the comparison, the method includes filtering the first portion of the set of data before downloading the set of data if the likelihood value is equal to or above the threshold value.

Classes IPC ?

G06F 16/2453 - Optimisation des requêtes
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage

99. Adaptive approach to lazy materialization in database scans using pushed filters

Numéro d'application	18160861
Numéro de brevet	12124450
Statut	Délivré - en vigueur
Date de dépôt	2023-01-27
Date de la première publication	2024-08-01
Date d'octroi	2024-10-22
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Palkar, Shoumik Behm, Alexander Mokhtar, Mostafa Krishnamurthy, Sriram

Abrégé

Disclosed herein is a method for determining whether to apply a lazy materialization technique to a query run. A data processing service receives a request to perform a query identifying a filter column and a non-filter column in a columnar database. The data processing service accesses a first task of contiguous rows in the filter column from a cloud-based object storage. The data processing service applies a filter defined by the query to the first task. The data processing service generates filter results for the first task that may include a percentage of the first task discarded and a run-time. The data processing service determines, based on the filter results for the first task, a likelihood value that indicates a likelihood of gaining a performance benefit by applying the lazy materialization technique to a second task of the query.

Classes IPC ?

G06F 16/2453 - Optimisation des requêtes
G06F 11/34 - Enregistrement ou évaluation statistique de l'activité du calculateur, p. ex. des interruptions ou des opérations d'entrée–sortie
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage

100. Evaluating expressions over dictionary data

Numéro d'application	18162607
Numéro de brevet	12210528
Statut	Délivré - en vigueur
Date de dépôt	2023-01-31
Date de la première publication	2024-08-01
Date d'octroi	2025-01-28
Propriétaire	Databricks, Inc. (USA)
Inventeur(s)	Agarwal, Utkarsh Palkar, Shoumik Behm, Alexander Krishnamurthy, Sriram

Abrégé

Classes IPC ?

G06F 16/2455 - Exécution des requêtes
G06F 11/34 - Enregistrement ou évaluation statistique de l'activité du calculateur, p. ex. des interruptions ou des opérations d'entrée–sortie
G06F 16/22 - IndexationStructures de données à cet effetStructures de stockage

1 2 Prochaine page