Ab Initio Technology LLC

United States of America

Back to Profile

1-100 of 810 for Ab Initio Technology LLC Sort by
Query
Aggregations
Jurisdiction
        United States 355
        Canada 237
        World 218
Date
2026 May 2
2026 April 1
2026 (YTD) 7
2025 40
2024 48
See more
IPC Class
G06F 17/30 - Information retrieval; Database structures therefor 131
G06F 9/44 - Arrangements for executing specific programs 91
G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU] 76
G06F 8/34 - Graphical or visual programming 72
G06F 16/901 - IndexingData structures thereforStorage structures 56
See more
Status
Pending 89
Registered / In Force 721
Found results for  patents
  1     2     3     ...     9        Next Page

1.

AUTOMATED MODIFICATION OF COMPUTER PROGRAMS

      
Application Number 19440179
Status Pending
Filing Date 2026-01-05
First Publication Date 2026-05-07
Owner Ab Initio Technology LLC (USA)
Inventor
  • Joyce, John
  • Isman, Marshall A.
  • Bach, Edward Alan

Abstract

A method for updating a computer program includes receiving a computer program hosted on and configured to be executed by a first computing system. The method includes analyzing the computer program to obtain characterization of a lineage, an architecture, and an operation of the computer program. The lineage includes relationships among elements of the computer program, the architecture includes a characteristic of the data source, the data target, and one or more processors configured to process the data contained in data records, and the operation includes processes that are executed to process the data from the data records. The method includes receiving a characterization of an update to be made to the computer program, in which when the computer program is modified according to the update, at least some of the modified computer program is configured to be hosted on and executed by a second computing system; and modifying the computer program to implement the update to generate the modified computer program.

IPC Classes  ?

  • G06F 8/65 - Updates
  • G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database systemDistributed database system architectures therefor
  • H04L 67/00 - Network arrangements or protocols for supporting network services or applications
  • H04L 67/06 - Protocols specially adapted for file transfer, e.g. file transfer protocol [FTP]

2.

DATA PROCESSING SYSTEM FOR AUTOMATIC PROCESSING OF CONTINUOUS FLOWS OR BATCH DATA

      
Application Number US2025053064
Publication Number 2026/096598
Status In Force
Filing Date 2025-10-29
Publication Date 2026-05-07
Owner
  • AB INITIO TECHNOLOGY LLC (USA)
  • AB INITIO ORIGINAL WORKS LLC (USA)
  • AB INITIO SOFTWARE LLC (USA)
Inventor Wholey, Joseph, Skeffington

Abstract

Techniques for executing a data processing application in an environment in which there can be a plurality of data sources including continuous data sources and batch data sources. A data processing application may be representable as a plurality of input nodes and a plurality of processing nodes. The techniques include: for a node of the plurality of processing nodes having a first input configured at the time of execution of the application to receive batch data and a second input configured to receive continuous data: computing first data by executing data processing operations of the data processing application between the first input of the node and one or more data sources of the plurality of data sources on data from the one or more data sources; and storing the first data; and configuring the data processing system to, when executing the data processing application, use the stored first data as the first input to the node.

IPC Classes  ?

  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]
  • G06F 8/34 - Graphical or visual programming
  • G06F 8/35 - Creation or generation of source code model driven
  • G06F 8/41 - Compilation
  • G06F 9/448 - Execution paradigms, e.g. implementations of programming paradigms

3.

DATA PROCESSING SYSTEM FOR AUTOMATIC PROCESSING OF CONTINUOUS FLOWS OR BATCH DATA

      
Application Number 19373041
Status Pending
Filing Date 2025-10-29
First Publication Date 2026-04-30
Owner Ab Initio Technology LLC (USA)
Inventor Wholey, Iii, Joseph Skeffington

Abstract

Techniques for executing a data processing application in an environment in which there can be a plurality of data sources including continuous data sources and batch data sources. A data processing application may be representable as a plurality of input nodes and a plurality of processing nodes. The techniques include: for a node of the plurality of processing nodes having a first input configured at the time of execution of the application to receive batch data and a second input configured to receive continuous data: computing first data by executing data processing operations of the data processing application between the first input of the node and one or more data sources of the plurality of data sources on data from the one or more data sources; and storing the first data; and configuring the data processing system to, when executing the data processing application, use the stored first data as the first input to the node.

IPC Classes  ?

  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

4.

GENERATING, ACCESSING, AND DISPLAYING LINEAGE METADATA

      
Application Number 19314968
Status Pending
Filing Date 2025-08-29
First Publication Date 2026-04-02
Owner Ab Initio Technology LLC (USA)
Inventor
  • Clemens, David
  • Radivojevic, Dusan
  • Galarneau, Neil

Abstract

Among other things, we describe a method of receiving a portion of metadata from a data source, the portion of metadata describing nodes and edges; generating instances of a data structure representing the portion of metadata, at least one instance of the data structure including an identification value that identifies a corresponding node, one or more property values representing respective properties of the corresponding node, and one or more pointers to respective identification values, each pointer representing an edge associated with a node identified by the corresponding respective identification value; storing the instances of the data structure in random access memory; receiving a query that includes an identification of at least one particular element of data; and using at least one instance of the data structure to cause a display of a computer system to display a representation of lineage of the particular element of data.

IPC Classes  ?

5.

DATASET MULTIPLEXER FOR DATA PROCESSING SYSTEM

      
Application Number 19204217
Status Pending
Filing Date 2025-05-09
First Publication Date 2026-01-29
Owner Ab Initio Technology LLC (USA)
Inventor
  • Weisman, Amit
  • Fantasia, Cory Christopher James
  • Becker, Matthew Douglas
  • Schechter, Ian Robert
  • Bach, Edward Alan
  • Parks, Robert

Abstract

A data processing system with a dataset multiplexer that enables applications to be written to specify access to datasets as operations on logical datasets. During execution of an application by the data processing system, operations that access a dataset are implemented by accessing an entry in a dataset catalog for the logical dataset. That entry includes information to access the physical data source storing the logical dataset, including conversion of data from the format of the physical data source to the format of the logical dataset. An entry in the catalog may be created based on registration of a data source with the dataset multiplexer and may be updated automatically based on changes in storage of the dataset. This maintenance of the catalog may be partially or totally automated such that the system automatically adjusts to any changes in storage of the dataset without need for modification of any application.

IPC Classes  ?

  • G06F 16/23 - Updating
  • G06F 16/21 - Design, administration or maintenance of databases
  • G06F 16/25 - Integrating or interfacing systems involving database management systems

6.

Identity Management in a Heterogeneous Cloud Computing System

      
Application Number 19280488
Status Pending
Filing Date 2025-07-25
First Publication Date 2026-01-29
Owner
  • AB INITIO ORIGINAL WORKS LLC (USA)
  • AB INITIO SOFTWARE LLC (USA)
  • AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Yeracaris, Anthony M.
  • Roman, John

Abstract

A method for managing credentials in a heterogeneous cloud computing system, includes receiving, at a local computing system, a request to access a cloud resource on behalf of an end user, the request including a unique identifier associated with the end user and a resource identifier associated with the cloud resource, identifying, using the resource identifier, a predefined procedure for obtaining credentials for accessing the cloud resource, performing the predefined procedure to obtain the credentials for accessing the cloud resource, and accessing the cloud resource on behalf of the end user using the credentials.

IPC Classes  ?

7.

IDENTITY MANAGEMENT IN A HETEROGENEOUS CLOUD COMPUTING SYSTEM

      
Application Number US2025039216
Publication Number 2026/025011
Status In Force
Filing Date 2025-07-25
Publication Date 2026-01-29
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Yeracaris, Anthony M.
  • Roman, John

Abstract

A method for managing credentials in a heterogeneous cloud computing system, includes receiving, at a local computing system, a request to access a cloud resource on behalf of an end user, the request including a unique identifier associated with the end user and a resource identifier associated with the cloud resource, identifying, using the resource identifier, a predefined procedure for obtaining credentials for accessing the cloud resource, performing the predefined procedure to obtain the credentials for accessing the cloud resource, and accessing the cloud resource on behalf of the end user using the credentials.

IPC Classes  ?

  • G06F 21/41 - User authentication where a single sign-on provides access to a plurality of computers
  • H04L 9/40 - Network security protocols

8.

METADATA DRIVEN INGESTION AND DATA PROCESSING

      
Application Number 19241987
Status Pending
Filing Date 2025-06-18
First Publication Date 2025-12-11
Owner Ab Initio Technology LLC (USA)
Inventor
  • Isman, Marshall A.
  • Weiss, Adam
  • Egenolf, Jonah
  • Parks, Robert
  • Maclean, John
  • Mellon, Richard
  • Radivojevic, Dusan
  • Veiser, Paul
  • Khader, Mazin Woodrow

Abstract

A method implemented by a data processing system for enabling a system to pipeline or otherwise process data in conformance with specified criteria by providing a graphical user interface for selecting data to be processed, determining metadata of selected data, and, based on the metadata, automatically processing the selected data in conformance with the specified criteria.

IPC Classes  ?

  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06F 3/048 - Interaction techniques based on graphical user interfaces [GUI]
  • G06F 9/448 - Execution paradigms, e.g. implementations of programming paradigms
  • G06F 9/451 - Execution arrangements for user interfaces
  • G06F 16/908 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content

9.

Metadata change triggers

      
Application Number 18885179
Grant Number 12524413
Status In Force
Filing Date 2024-09-13
First Publication Date 2025-10-30
Grant Date 2026-01-13
Owner Ab Initio Technology LLC (USA)
Inventor
  • Parks, Robert
  • Gylfason, Halldor

Abstract

A method implemented by a data processing system for automatically detecting a change in entities of a first entity class, and, in response to the detecting, generating a notification specifying that a change relating to a second entity class has been detected, including: storing an entity model for entity classes; determining that a notification is to be generated upon detection of a change to entities of a first entity class of the entity model, the notification specifying that the detected change relates to a second entity class; detecting a change to an entity of the first entity class; responsive to the change, traversing edges of the entity model to identify a node representing an entity of the second entity class related to another node representing the entity of the first entity class; generating the notification for the entity of the second entity class; and storing the notification for transmission.

IPC Classes  ?

10.

METADATA CHANGE TRIGGERS

      
Application Number US2025026211
Publication Number 2025/226952
Status In Force
Filing Date 2025-04-24
Publication Date 2025-10-30
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Parks, Robert
  • Gylfason, Halldor

Abstract

A method implemented by a data processing system for automatically detecting a change in entities of a first entity class, and, in response to the detecting, generating a notification specifying that a change relating to a second entity class has been detected, including: storing an entity model for entity classes; determining that a notification is to be generated upon detection of a change to entities of a first entity class of the entity model, the notification specifying that the detected change relates to a second entity class; detecting a change to an entity of the first entity class; responsive to the change, traversing edges of the entity model to identify a node representing an entity of the second entity class related to another node representing the entity of the first entity class; generating the notification for the entity of the second entity class; and storing the notification for transmission.

IPC Classes  ?

11.

GUIDING A MACHINE LEARNING MODEL IN GENERATING RULES FOR DATA PROCESSING

      
Application Number 19174721
Status Pending
Filing Date 2025-04-09
First Publication Date 2025-10-16
Owner Ab Initio Technology LLC (USA)
Inventor
  • Radivojevic, Dusan
  • Parks, Robert
  • Gracely, Fred
  • Polstra, Drew
  • Wilkins, Sam
  • Elmaliki, Nour

Abstract

A method implemented by a data processing system for dynamically and automatically guiding a machine learning model in generating a rule from natural language content by controlling the machine learning model to select from candidates that will enable the rule to operate efficiently includes: receiving, by a data processing system, natural language content specifying one or more criteria, identifying candidates for generating a rule representing at least one of the criteria specified by the natural language content, providing the identified candidates and at least a portion of the natural language content to a machine learning model, receiving an indication of at least one of the candidates selected by the machine learning model, generating the rule using the at least one of the candidates selected by the machine learning model, and storing, in a data store, the generated rule.

IPC Classes  ?

12.

MIGRATION OF DATASETS AMONG FEDERATED DATABASE SYSTEMS

      
Application Number 19253294
Status Pending
Filing Date 2025-06-27
First Publication Date 2025-10-16
Owner Ab Initio Technology LLC (USA)
Inventor
  • Bach, Edward Alan
  • Eads, Matthew

Abstract

In an aspect, a method for migrating data records to a federated database system includes obtaining data records from a data source in a first federated database system; generating a data snapshot file based on the obtained data records and data indicative of a characteristic associated with the obtained data records; generating a hash of the data snapshot file to prevent modification of the data snapshot file; storing the data snapshot file and the generated hash in a data storage; migrating the obtained data records from the data snapshot file to a data target in a second federated database system, the migrating including: retrieving the data records from the data snapshot file stored in the data storage; providing the retrieved data records to the data target according to a mapping between a characteristic of the data source and a characteristic of the data target.

IPC Classes  ?

  • G06F 16/11 - File system administration, e.g. details of archiving or snapshots
  • G06F 16/13 - File access structures, e.g. distributed indices
  • G06F 16/25 - Integrating or interfacing systems involving database management systems

13.

GUIDING A MACHINE LEARNING MODEL IN GENERATING RULES FOR DATA PROCESSING

      
Application Number US2025024008
Publication Number 2025/217363
Status In Force
Filing Date 2025-04-10
Publication Date 2025-10-16
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Radivojevic, Dusan
  • Parks, Robert
  • Gracely, Fred
  • Polstra, Drew
  • Wilkins, Sam
  • Elmaliki, Nour

Abstract

A method implemented by a data processing system for dynamically and automatically guiding a machine learning model in generating a rule from natural language content by controlling the machine learning model to select from candidates that will enable the rule to operate efficiently includes: receiving, by a data processing system, natural language content specifying one or more criteria, identifying candidates for generating a rule representing at least one of the criteria specified by the natural language content, providing the identified candidates and at least a portion of the natural language content to a machine learning model, receiving an indication of at least one of the candidates selected by the machine learning model, generating the rule using the at least one of the candidates selected by the machine learning model, and storing, in a data store, the generated rule.

IPC Classes  ?

14.

SENDING CONTEXTUAL DATA ON DATAFLOWS OF DATAFLOW GRAPHS

      
Application Number US2025021313
Publication Number 2025/212318
Status In Force
Filing Date 2025-03-25
Publication Date 2025-10-09
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Epstein, Aaron
  • Lynch, Frank
  • Schmidt, Stephen

Abstract

A method implemented by a data processing system for processing records with a dataflow graph that includes transmitting the records over a dataflow of the dataflow graph and transmitting, among the records, information, associated with the processing of the records, over the same dataflow without modifying the records. The information can include trace data describing how the records are being processed.

IPC Classes  ?

15.

Sending Contextual Data on Dataflows of Dataflow Graphs

      
Application Number 18796730
Status Pending
Filing Date 2024-08-07
First Publication Date 2025-10-02
Owner Ab Initio Technology LLC (USA)
Inventor
  • Epstein, Aaron
  • Lynch, Frank
  • Schmidt, Stephen

Abstract

A method implemented by a data processing system for processing records with a dataflow graph that includes transmitting the records over a dataflow of the dataflow graph and transmitting, among the records, information, associated with the processing of the records, over the same dataflow without modifying the records. The information can include trace data describing how the records are being processed.

IPC Classes  ?

  • G06F 16/22 - IndexingData structures thereforStorage structures

16.

DYNAMIC DISTRIBUTION OF CONTAINER IMAGES

      
Application Number 19196812
Status Pending
Filing Date 2025-05-02
First Publication Date 2025-10-02
Owner Ab Initio Technology LLC (USA)
Inventor
  • Lynch, Frank
  • Wakeling, Tim

Abstract

A method implemented by a data processing system including: accessing the container image that includes the first application and a second application; determining, by the data processing system, the number of parallel executions of the given module of the first application; for the given module, generating a plurality of instances of the container image in accordance with the number of parallel executions determined, for each instance, configuring that instance to execute the given module of the first application; causing each of the plurality of configured instances to execute on one or more of the host systems; and for at least one of the plurality of configured instances, causing, by the second application of that configured instance, communication between the data processing system and the one or more of the host systems executing that configured instance.

IPC Classes  ?

  • G06F 9/445 - Program loading or initiating
  • G06F 8/61 - Installation
  • G06F 9/451 - Execution arrangements for user interfaces
  • G06F 9/455 - EmulationInterpretationSoftware simulation, e.g. virtualisation or emulation of application or operating system execution engines
  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

17.

CONTROLLING QUALITY OF DYNAMICALLY IDENTIFIED METADATA TRANSFERRED AMONG ENVIRONMENTS

      
Application Number US2025017654
Publication Number 2025/184381
Status In Force
Filing Date 2025-02-27
Publication Date 2025-09-04
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Parks, Robert
  • Rossi, Larry, Paul
  • Gylfason, Halldor
  • Brooks, Nathaniel
  • Bach, Ted
  • Millis, Tyler

Abstract

A method for delivering a dynamically identified set of related metadata of a specified quality to a target environment for metadata-driven processing of data includes, in a source environment, dynamically identifying a related set of metadata including given metadata and metadata related to the given metadata, processing identified metadata corresponding to the related set of metadata with one or more quality rules to determine whether the identified metadata has a specified quality, and in accordance with the identified metadata having the specified quality, making the identified metadata available for metadata-driven processing of data in the target environment.

IPC Classes  ?

  • G06F 16/907 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

18.

CONTROLLING QUALITY OF DYNAMICALLY IDENTIFIED METADATA TRANSFERRED AMONG ENVIRONMENTS

      
Application Number 18662539
Status Pending
Filing Date 2024-05-13
First Publication Date 2025-09-04
Owner Ab Initio Technology LLC (USA)
Inventor
  • Parks, Robert
  • Rossi, Larry Paul
  • Gylfason, Halldor
  • Brooks, Nathaniel
  • Bach, Ted
  • Millis, Tyler

Abstract

A method for delivering a dynamically identified set of related metadata of a specified quality to a target environment for metadata-driven processing of data includes, in a source environment, dynamically identifying a related set of metadata including given metadata and metadata related to the given metadata, processing identified metadata corresponding to the related set of metadata with one or more quality rules to determine whether the identified metadata has a specified quality, and in accordance with the identified metadata having the specified quality, making the identified metadata available for metadata-driven processing of data in the target environment.

IPC Classes  ?

  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
  • G06F 11/36 - Prevention of errors by analysis, debugging or testing of software
  • G06F 16/21 - Design, administration or maintenance of databases

19.

REMOTE FILE SERVICE SYSTEM FOR FILE OPERATIONS

      
Application Number US2025014994
Publication Number 2025/171249
Status In Force
Filing Date 2025-02-07
Publication Date 2025-08-14
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Lynch, Frank
  • Popov, Arkadi

Abstract

A method implemented by data processing systems enables usage of a cloud object storage for storing data accessed based on execution of file operations that are unsupported by the cloud object storage. The method includes executing an application configured to access application data; as the application executes, generating instructions to perform operations, unsupported by cloud object storage, to modify the application data; transmitting the instructions to a first file service system representing a location for the application data to be stored in the cloud object storage; recording an indication of the operations specified by the instructions; and transmitting, to the cloud object storage, the indication of the operations specified by the instructions, whereby transmittal of the indication to cloud object storage enables a second data processing system to access the application data that is modified in accordance with the operations unsupported by cloud object storage.

IPC Classes  ?

  • G06F 16/16 - File or folder operations, e.g. details of user interfaces specifically adapted to file systems
  • G06F 16/18 - File system types
  • G06F 16/182 - Distributed file systems

20.

Remote file service system for file operations

      
Application Number 18778121
Grant Number 12585553
Status In Force
Filing Date 2024-07-19
First Publication Date 2025-08-14
Grant Date 2026-03-24
Owner Ab Initio Technology LLC (USA)
Inventor
  • Lynch, Frank
  • Popov, Arkadi

Abstract

A method implemented by data processing systems enables usage of a cloud object storage for storing data accessed based on execution of file operations that are unsupported by the cloud object storage. The method includes executing an application configured to access application data; as the application executes, generating instructions to perform operations, unsupported by cloud object storage, to modify the application data; transmitting the instructions to a first file service system representing a location for the application data to be stored in the cloud object storage; recording an indication of the operations specified by the instructions; and transmitting, to the cloud object storage, the indication of the operations specified by the instructions, whereby transmittal of the indication to cloud object storage enables a second data processing system to access the application data that is modified in accordance with the operations unsupported by cloud object storage.

IPC Classes  ?

21.

TECHNIQUES FOR CONVERTING SQL DIALECT APPLICATION PROGRAMS TO DATAFLOW GRAPHS

      
Application Number US2025013368
Publication Number 2025/165740
Status In Force
Filing Date 2025-01-28
Publication Date 2025-08-07
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Bradshaw, Kevin
  • Smythe, Jared
  • Isman, Marshall

Abstract

Techniques for converting application programs written in a source structured query language (SQL) dialect to respective computer programs embodied by dataflow graphs, the method comprising using at least one computer hardware processor to perform: obtaining a source SQL dialect (SSD) application program comprising one or more SSD scripts, the one or more SSD scripts comprising a plurality of SSD statements; translating the plurality of SSD statements into a respective plurality of target SQL dialect (TSD) statements; converting the plurality of TSD statements into a respective plurality of dataflow graphs; and merging dataflow graphs in the plurality of dataflow graphs to obtain one or more merged dataflow graphs.

IPC Classes  ?

22.

SYSTEMS AND METHODS FOR ACCESSING DATA ENTITIES MANAGED BY A DATA PROCESSING SYSTEM

      
Application Number 19043795
Status Pending
Filing Date 2025-02-03
First Publication Date 2025-08-07
Owner Ab Initio Technology LLC (USA)
Inventor
  • Gylfason, Halldor Isak
  • Parks, Robert

Abstract

Techniques for obtaining information about data entity instances managed by a data processing system using at least one data store. The techniques include obtaining a query comprising a first portion comprising information for identifying instances of a first data entity stored in at least one data store; and a second portion indicating at least one attribute of the first data entity; generating, from the query, a plurality of executable queries including a first set of one or more executable queries and a second set of one or more executable queries, the generating comprising: generating, using the first portion, the first set of executable queries for identifying instances of the first data entity, and generating, using the second portion, the second set of executable queries for obtaining attribute values for instances of the first data entity; and executing the plurality of executable queries to obtain results for the query.

IPC Classes  ?

23.

TECHNIQUES FOR CONVERTING SQL DIALECT APPLICATION PROGRAMS TO DATAFLOW GRAPHS

      
Application Number 19039118
Status Pending
Filing Date 2025-01-28
First Publication Date 2025-07-31
Owner Ab Initio Technology LLC (USA)
Inventor
  • Bradshaw, Kevin
  • Smythe, Jared
  • Isman, Marshall

Abstract

Techniques for converting application programs written in a source structured query language (SQL) dialect to respective computer programs embodied by dataflow graphs, the method comprising using at least one computer hardware processor to perform: obtaining a source SQL dialect (SSD) application program comprising one or more SSD scripts, the one or more SSD scripts comprising a plurality of SSD statements; translating the plurality of SSD statements into a respective plurality of target SQL dialect (TSD) statements; converting the plurality of TSD statements into a respective plurality of dataflow graphs; and merging dataflow graphs in the plurality of dataflow graphs to obtain one or more merged dataflow graphs.

IPC Classes  ?

24.

TECHNIQUES FOR ASSIGNING LABELS TO DATASET FIELDS

      
Application Number 18999645
Status Pending
Filing Date 2024-12-23
First Publication Date 2025-07-03
Owner Ab Initio Technology LLC (USA)
Inventor
  • Melbouci, Sandrick
  • Schon, Andrew

Abstract

Techniques for processing a dataset comprising data stored in fields to identify field labels. The field labels describe data stored in the dataset fields. The techniques determine whether any field labels in a field label glossary match a field. If none of the field labels in the field label glossary match the field, the techniques generate a new field label using the name of the field. The generated field label may be assigned to the field.

IPC Classes  ?

  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models

25.

TECHNIQUES FOR ASSIGNING LABELS TO DATASET FIELDS

      
Application Number US2024061691
Publication Number 2025/144787
Status In Force
Filing Date 2024-12-23
Publication Date 2025-07-03
Owner
  • AB INITIO TECHNOLOGY LLC (USA)
  • AB INITIO SOFTWARE LLC (USA)
  • AB INITIO ORIGINAL WORKS LLC (USA)
Inventor
  • Melbouci, Sandrick
  • Schon, Andrew

Abstract

Techniques for processing a dataset comprising data stored in fields to identify field labels. The field labels describe data stored in the dataset fields. The techniques determine whether any field labels in a field label glossary match a field. If none of the field labels in the field label glossary match the field, the techniques generate a new field label using the name of the field. The generated field label may be assigned to the field.

IPC Classes  ?

  • G06F 16/81 - Indexing, e.g. XML tagsData structures thereforStorage structures
  • G06F 16/901 - IndexingData structures thereforStorage structures

26.

DEVELOPMENT ENVIRONMENT FOR AUTOMATICALLY GENERATING CODE USING A MULTI-TIERED METADATA MODEL

      
Application Number 18987691
Status Pending
Filing Date 2024-12-19
First Publication Date 2025-06-26
Owner Ab Initio Technology LLC (USA)
Inventor
  • Radivojevic, Dusan
  • Egenolf, Jonah

Abstract

A method for using a development environment to automatically generate code from a multi-tiered metadata model includes: receiving a specification to process a dataset, and, in response, accessing dataset characteristics and identifying controls received from a development environment to be applied to a field of the dataset in accordance with a metadata model by: accessing a first instance of a data structure that corresponds to the dataset; based on a reference in the first instance, accessing a second instance of a data structure associated with the field; based on a reference in the second instance, accessing a third instance of a data structure associated with metadata describing the field, and based on a reference in the third instance, accessing a fourth instance of a data structure storing a control defined based on the metadata. Based on the dataset characteristics, code is generated to apply the identified control to the field.

IPC Classes  ?

27.

LIVE INTERVAL ANALYSIS

      
Application Number US2024034454
Publication Number 2025/136448
Status In Force
Filing Date 2024-06-18
Publication Date 2025-06-26
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor Bason, Iain

Abstract

An approach to allocation of referenced objects to memory resources addresses a situation in which there are a far greater number of memory resources, for example, 216 elements in the set of memory resources, and yet the objects referenced in a program specification exceeds this number. The approach is applicable to compilation of a program specification for execution on a physical or virtual processor.

IPC Classes  ?

28.

A DEVELOPMENT ENVIRONMENT FOR AUTOMATICALLY GENERATING CODE USING A MULTI-TIERED METADATA MODEL

      
Application Number US2024061392
Publication Number 2025/137522
Status In Force
Filing Date 2024-12-20
Publication Date 2025-06-26
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Radivojevic, Dusan
  • Egenolf, Jonah

Abstract

A method for using a development environment to automatically generate code from a multi-tiered metadata model includes: receiving a specification to process a dataset, and, in response, accessing dataset characteristics and identifying controls received from a development environment to be applied to a field of the dataset in accordance with a metadata model by: accessing a first instance of a data structure that corresponds to the dataset; based on a reference in the first instance, accessing a second instance of a data structure associated with the field; based on a reference in the second instance, accessing a third instance of a data structure associated with metadata describing the field, and based on a reference in the third instance, accessing a fourth instance of a data structure storing a control defined based on the metadata. Based on the dataset characteristics, code is generated to apply the identified control to the field.

IPC Classes  ?

  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
  • G06F 16/23 - Updating
  • G06F 16/242 - Query formulation
  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  • G06F 16/26 - Visual data miningBrowsing structured data
  • G06F 40/30 - Semantic analysis
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models

29.

Live Interval Analysis

      
Application Number 18746432
Status Pending
Filing Date 2024-06-18
First Publication Date 2025-06-26
Owner Ab Initio Technology LLC (USA)
Inventor Bason, Iain

Abstract

An approach to allocation of referenced objects to memory resources addresses a situation in which there are a far greater number of memory resources, for example, 216 elements in the set of memory resources, and yet the objects referenced in a program specification exceeds this number. The approach is applicable to compilation of a program specification for execution on a physical or virtual processor.

IPC Classes  ?

  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

30.

PARTITION-BASED ESCROW IN A DISTRIBUTED COMPUTING SYSTEM

      
Document Number 03311150
Status Pending
Filing Date 2024-12-13
Open to Public Date 2025-06-19
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor Babroudi, Zori

IPC Classes  ?

  • G06F 11/14 - Error detection or correction of the data by redundancy in operation, e.g. by using different operation sequences leading to the same result

31.

PARTITION-BASED ESCROW IN A DISTRIBUTED COMPUTING SYSTEM

      
Application Number US2024060078
Publication Number 2025/129027
Status In Force
Filing Date 2024-12-13
Publication Date 2025-06-19
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor Babroudi, Zori

Abstract

A method for fault-tolerant processing of a number of data elements using a distributed computing cluster. The distributed computing cluster includes a number of data processors associated with a corresponding number of data stores. The method includes storing the data elements in the distributed computing cluster, wherein the data elements are distributed across the data stores according to a number of partitions of data elements, processing data elements of a first set of partitions stored at a first data store using a first data processor to generate first result data for the data elements of the first set of partitions, sending the first result data from the distributed computing cluster to a processing component of the first result data outside the distributed computing cluster, and storing the first result data in a first buffer located in the distributed computing cluster and associated with the first data processor until the processing component has persistently stored the first result data outside the distributed computing cluster.

IPC Classes  ?

  • G06F 11/14 - Error detection or correction of the data by redundancy in operation, e.g. by using different operation sequences leading to the same result

32.

Partition-based Escrow in a Distributed Computing System

      
Application Number 18980527
Status Pending
Filing Date 2024-12-13
First Publication Date 2025-06-19
Owner Ab Initio Technology LLC (USA)
Inventor Babroudi, Zori

Abstract

A method for fault-tolerant processing of a number of data elements using a distributed computing cluster. The distributed computing cluster includes a number of data processors associated with a corresponding number of data stores. The method includes storing the data elements in the distributed computing cluster, wherein the data elements are distributed across the data stores according to a number of partitions of data elements, processing data elements of a first set of partitions stored at a first data store using a first data processor to generate first result data for the data elements of the first set of partitions, sending the first result data from the distributed computing cluster to a consumer of the first result data outside the distributed computing cluster, and storing the first result data in a first buffer located in the distributed computing cluster and associated with the first data processor until the consumer has persistently stored the first result data outside the distributed computing cluster.

IPC Classes  ?

  • G06F 11/16 - Error detection or correction of the data by redundancy in hardware
  • G06F 16/23 - Updating

33.

AUTOMATED IDENTIFICATION AND MIGRATION OF INPUT AND BASELINE TEST DATA

      
Document Number 03312680
Status Pending
Filing Date 2024-12-02
Open to Public Date 2025-06-05
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Hawkins, Miranda
  • Isman, Marshall A.
  • Jankovic, Maja
  • Samoylovich, Alexander
  • Hurd, Ben

IPC Classes  ?

34.

DATASET MULTIPLEXER WITH DATASET RESOLVER FOR DATA PROCESSING SYSTEM

      
Document Number 03312822
Status Pending
Filing Date 2024-11-27
Open to Public Date 2025-06-05
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Weisman, Amit
  • Fantasia, Cory Christopher James
  • Parks, Robert
  • Eads, Matthew Austin
  • Zimmerman, William Robert
  • Huang, Aaron
  • Becker, Matthew Douglas

IPC Classes  ?

  • G06F 16/25 - Integrating or interfacing systems involving database management systems

35.

TECHNIQUES FOR RESOLVING DATA FIELDS AVAILABLE AT POINTS IN A SOFTWARE APPLICATION

      
Document Number 03313386
Status Pending
Filing Date 2024-11-29
Open to Public Date 2025-06-05
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Dickie, Garth Allen
  • Sciarappa, Luke Bryant
  • Miller, Zachary James
  • Schechter, Ian Robert

Abstract

Some embodiments relate to generating a list of data fields referenceable at a point in a graph (there are different lists for each point). This list may be used as part of programming a dataflow graph to select data (e.g., at an input node of a component to select data processed in that component). One aspect relates to display of the list of data fields, because some of the data field names may be overloaded. Accordingly, the data fields may be presented hierarchically if necessary, showing the source for each overloaded data field name. Otherwise, the user may select whether the list of referenceable fields is grouped by source.

IPC Classes  ?

36.

DATASET MULTIPLEXER WITH DATASET RESOLVER FOR DATA PROCESSING SYSTEM

      
Application Number 18961714
Status Pending
Filing Date 2024-11-27
First Publication Date 2025-06-05
Owner
  • AB INITIO ORIGINAL WORKS LLC (USA)
  • AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Weisman, Amit
  • Fantasia, Cory Christopher James
  • Parks, Robert
  • Eads, Matthew Austin
  • Zimmerman, William Robert
  • Huang, Aaron
  • Becker, Matthew Douglas

Abstract

A data processing system with a dataset multiplexer that enables applications to be written to specify access to datasets as operations on logical datasets. During execution of an application by the data processing system, the physical dataset used for performing data access operations may be selected based on current context. Current context may be specified based on values of system parameters and/or user specified values. The physical dataset accessed may be identified by selecting a record from multiple records in a dataset catalog associated with the logical dataset. Each record includes information to access a physical dataset associated with the selected record and context information to indicate the context in which the specific physical dataset is to be selected.

IPC Classes  ?

  • G06F 16/908 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
  • G06F 16/901 - IndexingData structures thereforStorage structures

37.

TECHNIQUES FOR RESOLVING DATA FIELDS AVAILABLE AT POINTS IN A SOFTWARE APPLICATION

      
Application Number 18964167
Status Pending
Filing Date 2024-11-29
First Publication Date 2025-06-05
Owner
  • AB INITIO ORIGINAL WORKS LLC (USA)
  • AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Dickie, Garth Allen
  • Sciarappa, Luke Bryant
  • Miller, Zachary James
  • Schechter, Ian Robert

Abstract

Some embodiments relate to generating a list of data fields referenceable at a point in a graph (there are different lists for each point). This list may be used as part of programming a dataflow graph to select data (e.g., at an input node of a component to select data processed in that component). One aspect relates to display of the list of data fields, because some of the data field names may be overloaded. Accordingly, the data fields may be presented hierarchically if necessary, showing the source for each overloaded data field name. Otherwise, the user may select whether the list of referenceable fields is grouped by source.

IPC Classes  ?

38.

TECHNIQUES FOR RESOLVING DATA FIELDS AVAILABLE AT POINTS IN A SOFTWARE APPLICATION

      
Application Number US2024057981
Publication Number 2025/117907
Status In Force
Filing Date 2024-11-29
Publication Date 2025-06-05
Owner
  • AB INITIO TECHNOLOGY LLC (USA)
  • AB INITIO SOFTWARE LLC (USA)
  • AB INITIO ORIGINAL WORKS LLC (USA)
Inventor
  • Dickie, Garth, Allen
  • Sciarappa, Luke, Bryant
  • Miller, Zachary, James
  • Schechter, Ian, Robert

Abstract

Some embodiments relate to generating a list of data fields referenceable at a point in a graph (there are different lists for each point). This list may be used as part of programming a dataflow graph to select data (e.g., at an input node of a component to select data processed in that component). One aspect relates to display of the list of data fields, because some of the data field names may be overloaded. Accordingly, the data fields may be presented hierarchically if necessary, showing the source for each overloaded data field name. Otherwise, the user may select whether the list of referenceable fields is grouped by source.

IPC Classes  ?

39.

AUTOMATED IDENTIFICATION AND MIGRATION OF INPUT AND BASELINE TEST DATA

      
Application Number US2024058077
Publication Number 2025/117946
Status In Force
Filing Date 2024-12-02
Publication Date 2025-06-05
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Hawkins, Miranda
  • Isman, Marshall A.
  • Jankovic, Maja
  • Samoylovich, Alexander
  • Hurd, Ben

Abstract

A computer-implemented method for defining a test for a computer program includes receiving operational data generated during execution of a computer program in a first computing environment, the operational data indicative of (i) a data source accessed by the computer program during execution of the computer program and (ii) a destination to where baseline data records are output by the computer program during execution of the computer program. Based on the received operational data, a data storage object is generated that includes (i) input data records from the data source and the baseline data records from the destination, and (ii) test definition data for the first computing environment. Responsive to migration of the computer program to a second computing environment, the input and baseline data records from the data storage object are stored in the second computing environment. A test configuration is defined for the migrated computer program in the second computing environment according to the test definition data in the data storage object and the mapping between the first computing environment and the second computing environment, the test configuration for the migrated computer program identifying a location of the input data records and a location of the baseline data records in the second computing environment. Execution of the migrated computer program in the second computing environment is tested using the input data records and baseline data records in the second computing environment and according to the defined test configuration for the migrated computer program.

IPC Classes  ?

40.

AUTOMATED IDENTIFICATION AND MIGRATION OF INPUT AND BASELINE TEST DATA

      
Application Number 18962909
Status Pending
Filing Date 2024-11-27
First Publication Date 2025-06-05
Owner Ab Initio Technology LLC (USA)
Inventor
  • Hawkins, Miranda
  • Isman, Marshall A.
  • Jankovic, Maja
  • Samoylovich, Alexander
  • Hurd, Ben

Abstract

A computer-implemented method for defining a test for a computer program includes receiving operational data generated during execution of a computer program in a first computing environment, the operational data indicative of (i) a data source accessed by the computer program during execution of the computer program and (ii) a destination to where baseline data records are output by the computer program during execution of the computer program. Based on the received operational data, a data storage object is generated that includes (i) input data records from the data source and the baseline data records from the destination, and (ii) test definition data for the first computing environment. Responsive to migration of the computer program to a second computing environment, the input and baseline data records from the data storage object are stored in the second computing environment. A test configuration is defined for the migrated computer program in the second computing environment according to the test definition data in the data storage object and the mapping between the first computing environment and the second computing environment, the test configuration for the migrated computer program identifying a location of the input data records and a location of the baseline data records in the second computing environment. Execution of the migrated computer program in the second computing environment is tested using the input data records and baseline data records in the second computing environment and according to the defined test configuration for the migrated computer program.

IPC Classes  ?

  • G06F 16/36 - Creation of semantic tools, e.g. ontology or thesauri

41.

DATASET MULTIPLEXER WITH DATASET RESOLVER FOR DATA PROCESSING SYSTEM

      
Application Number US2024057602
Publication Number 2025/117635
Status In Force
Filing Date 2024-11-27
Publication Date 2025-06-05
Owner
  • AB INITIO TECHNOLOGY LLC (USA)
  • AB INITIO ORIGINAL WORKS LLC (USA)
  • AB INITIO SOFTWARE LLC (USA)
Inventor
  • Weisman, Amit
  • Fantasia, Cory, Christopher James
  • Parks, Robert
  • Eads, Matthew, Austin
  • Zimmerman, William, Robert
  • Huang, Aaron
  • Becker, Matthew, Douglas

Abstract

A data processing system with a dataset multiplexer that enables applications to be written to specify access to datasets as operations on logical datasets. During execution of an application by the data processing system, the physical dataset used for performing data access operations may be selected based on current context. Current context may be specified based on values of system parameters and/or user specified values. The physical dataset accessed may be identified by selecting a record from multiple records in a dataset catalog associated with the logical dataset. Each record includes information to access a physical dataset associated with the selected record and context information to indicate the context in which the specific physical dataset is to be selected.

IPC Classes  ?

  • G06F 16/25 - Integrating or interfacing systems involving database management systems

42.

CONVERSION OF DATA LINEAGES

      
Application Number US2024043393
Publication Number 2025/049234
Status In Force
Filing Date 2024-08-22
Publication Date 2025-03-06
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor Wagner, Philippe

Abstract

The present disclosure relates to a computer-implemented method for conversion of a first data lineage to a second data lineage, the method comprising: obtaining a first data lineage specifying relationships among physical components of a plurality of physical components; receiving an identification of a portion of the first data lineage; generating a second data lineage from the identified portion of the first data lineage, the second data lineage specifying relationships among second components of a plurality of second components, wherein the second components of the plurality of second components are associated with at least some of the physical components of the identified portion of the first data lineage. A corresponding computer-readable medium, a corresponding a data processing system, and a corresponding computer program are also described.

IPC Classes  ?

43.

DATA SET EVALUATION BASED ON DATA LINEAGE ANALYSIS

      
Application Number US2024042084
Publication Number 2025/038623
Status In Force
Filing Date 2024-08-13
Publication Date 2025-02-20
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor De Jager, Dirk

Abstract

The present disclosure relates to a computer-implemented method, the method comprising: obtaining a data lineage whose structure specifies relationships among data sets of a plurality of data sets; analyzing the structure of the data lineage; based on a result of the analyzing, identifying a subset of the plurality of data sets for which a parameter is to be evaluated, wherein the subset includes one or more of the data sets, and wherein the parameter is for indicating a potential error within a data set; and outputting an indication of the identified subset of one or more data sets. A computer-readable medium, computer program, a corresponding data processing apparatus, and a data structure are described as well.

IPC Classes  ?

  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors

44.

MACHINE LEARNING TECHNIQUES FOR DISCOVERING KEYS IN RELATIONAL DATASETS

      
Application Number US2024039543
Publication Number 2025/029579
Status In Force
Filing Date 2024-07-25
Publication Date 2025-02-06
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Chang, Andrew
  • Oberdorf, Richard
  • Wang, Song
  • Smith, Nathan

Abstract

Techniques for discovering primary, unique, and/or foreign keys for relational datasets are described. The techniques include profiling the relational datasets to obtain respective data profiles; identifying one or more primary key candidates for a first relational dataset using a first data profile of the first relational dataset and a first trained machine learning model; identifying one or more foreign key proposals for a second relational dataset using the one or more primary key candidates by performing a subset analysis of the second relational dataset with respect to the first relational dataset; identifying one or more foreign key candidates for the second relational dataset using the first data profile, a second data profile of the second relational dataset, and a second trained machine learning model different from the first trained machine learning model; and outputting the at primary key candidate(s) and the foreign key candidate(s).

IPC Classes  ?

45.

SYSTEMS AND METHODS FOR PERFORMING DATA PROCESSING OPERATIONS USING VARIABLE LEVEL PARALLELISM

      
Application Number 18736974
Status Pending
Filing Date 2024-06-07
First Publication Date 2025-01-30
Owner Ab Initio Technology LLC (USA)
Inventor Dickie, Garth Allen

Abstract

Techniques for determining processing layouts to nodes of a dataflow graph. The techniques include: obtaining information specifying a dataflow graph, the dataflow graph comprising a plurality of nodes and a plurality of edges connecting the plurality nodes, the plurality of edges representing flows of data among nodes in the plurality of nodes, the plurality of nodes comprising: a first set of one or more nodes; and a second set of one or more nodes disjoint from the first set of nodes; obtaining a first set of one or more processing layouts for the first set of nodes; and determining a processing layout for each node in the second set of nodes based on the first set of processing layouts and one or more layout determination rules, the one or more layout determination rules including at least one rule for selecting among processing layouts having different degrees of parallelism, and information indicating that data generated by at least one node in the first and/or third set of nodes is not used by any nodes in the dataflow graph downstream from the at least one node.

IPC Classes  ?

  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

46.

MACHINE LEARNING TECHNIQUES FOR DISCOVERING KEYS IN RELATIONAL DATASETS

      
Application Number 18783851
Status Pending
Filing Date 2024-07-25
First Publication Date 2025-01-30
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Chang, Andrew
  • Oberdorf, Richard
  • Wang, Song
  • Smith, Nathan

Abstract

Techniques for discovering primary, unique, and/or foreign keys for relational datasets are described. The techniques include profiling the relational datasets to obtain respective data profiles; identifying one or more primary key candidates for a first relational dataset using a first data profile of the first relational dataset and a first trained machine learning model; identifying one or more foreign key proposals for a second relational dataset using the one or more primary key candidates by performing a subset analysis of the second relational dataset with respect to the first relational dataset; identifying one or more foreign key candidates for the second relational dataset using the first data profile, a second data profile of the second relational dataset, and a second trained machine learning model different from the first trained machine learning model; and outputting the at primary key candidate(s) and the foreign key candidate(s).

IPC Classes  ?

  • G06F 16/21 - Design, administration or maintenance of databases
  • G06F 16/22 - IndexingData structures thereforStorage structures
  • G06F 40/30 - Semantic analysis

47.

FOR DATA OBJECTS MANAGED BY A DATA PROCESSING SYSTEM

      
Application Number 18909013
Status Pending
Filing Date 2024-10-08
First Publication Date 2025-01-23
Owner Ab Initio Technology LLC (USA)
Inventor
  • Parks, Robert
  • Yeracaris, Anthony
  • Radivojevic, Dusan

Abstract

Techniques for using finite state machines (FSMs) to implement workflows in a data processing system comprising at least one data store storing data objects and a workflow management system (WMS). The WMS is configured to perform: determining a current value of an attribute of a first data object by accessing the current value in the at least one data store; identifying, using the current value and metadata specifying relationships among at least some of the data objects, an actor authorized to perform a workflow task for the first data object; generating a GUI through which the actor can provide the input that the workflow task is to be performed; and in response to receiving, from the actor and through the GUI, input specifying that the workflow task is to be performed: performing the workflow task; and updating the current workflow state of the first FSM to a second workflow state.

IPC Classes  ?

  • G06Q 10/10 - Office automationTime management
  • G06F 9/448 - Execution paradigms, e.g. implementations of programming paradigms
  • G06F 16/906 - ClusteringClassification

48.

REUSABLE DATA PROCESSING PROGRAM GENERATION

      
Document Number 03293980
Status Pending
Filing Date 2024-04-12
Open to Public Date 2024-12-19
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Weisman, Amit
  • Pribble, Molly
  • Keare, Avery
  • Stahlecker, Dirk
  • Abaya, Victor T.
  • Angiolillo, Carl

IPC Classes  ?

  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06F 16/904 - BrowsingVisualisation therefor

49.

REUSABLE DATA PROCESSING PROGRAM GENERATION

      
Application Number US2024024224
Publication Number 2024/258483
Status In Force
Filing Date 2024-04-12
Publication Date 2024-12-19
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Weisman, Amit
  • Pribble, Molly
  • Keare, Avery
  • Stahlecker, Dirk
  • Abaya, Victor, T.
  • Angiolillo, Carl

Abstract

A method for developing a reusable data processing program including a set of data transformation steps by displaying a set of records and enabling a user to select one or more data transformation steps, applying the data transformation steps to the records, and displaying the transformed records.

IPC Classes  ?

  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06F 16/904 - BrowsingVisualisation therefor

50.

REUSABLE DATA PROCESSING PROGRAM GENERATION

      
Application Number 18633887
Status Pending
Filing Date 2024-04-12
First Publication Date 2024-12-12
Owner Ab Initio Technology LLC (USA)
Inventor
  • Weisman, Amit
  • Pribble, Molly
  • Keare, Avery
  • Stahlecker, Dirk
  • Abaya, Victor T.
  • Angiolillo, Carl

Abstract

A method for developing a reusable data processing program including a set of data transformation steps by displaying a set of records and iteratively enabling a user to select one or more data transformation steps, iteratively applying the data transformation steps to the records, and iteratively displaying the transformed records.

IPC Classes  ?

51.

MIGRATION OF DATASETS AMONG FEDERATED DATABASE SYSTEMS

      
Document Number 03291655
Status Pending
Filing Date 2024-05-09
Open to Public Date 2024-11-14
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Bach, Edward Alan
  • Eads, Matthew

IPC Classes  ?

  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  • G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database systemDistributed database system architectures therefor

52.

MIGRATION OF DATASETS AMONG FEDERATED DATABASE SYSTEMS

      
Application Number US2024028621
Publication Number 2024/233801
Status In Force
Filing Date 2024-05-09
Publication Date 2024-11-14
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Bach, Edward, Alan
  • Eads, Matthew

Abstract

In an aspect, a method for migrating data records to a federated database system includes obtaining data records from a data source in a first federated database system; generating a data snapshot file based on the obtained data records and data indicative of a characteristic associated with the obtained data records; generating a hash of the data snapshot file to prevent modification of the data snapshot file; storing the data snapshot file and the generated hash in a data storage; migrating the obtained data records from the data snapshot file to a data target in a second federated database system, the migrating including: retrieving the data records from the data snapshot file stored in the data storage; providing the retrieved data records to the data target according to a mapping between a characteristic of the data source and a characteristic of the data target.

IPC Classes  ?

  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  • G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database systemDistributed database system architectures therefor

53.

Migration of datasets among federated database systems

      
Application Number 18442567
Grant Number 12346288
Status In Force
Filing Date 2024-02-15
First Publication Date 2024-11-14
Grant Date 2025-07-01
Owner Ab Initio Technology LLC (USA)
Inventor
  • Bach, Edward Alan
  • Eads, Matthew

Abstract

In an aspect, a method for migrating data records to a federated database system includes obtaining data records from a data source in a first federated database system; generating a data snapshot file based on the obtained data records and data indicative of a characteristic associated with the obtained data records; generating a hash of the data snapshot file to prevent modification of the data snapshot file; storing the data snapshot file and the generated hash in a data storage; migrating the obtained data records from the data snapshot file to a data target in a second federated database system, the migrating including: retrieving the data records from the data snapshot file stored in the data storage; providing the retrieved data records to the data target according to a mapping between a characteristic of the data source and a characteristic of the data target.

IPC Classes  ?

  • G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
  • G06F 16/11 - File system administration, e.g. details of archiving or snapshots
  • G06F 16/13 - File access structures, e.g. distributed indices
  • G06F 16/25 - Integrating or interfacing systems involving database management systems

54.

Meta-data driven ingestion and data processing

      
Application Number 18496543
Grant Number 12346350
Status In Force
Filing Date 2023-10-27
First Publication Date 2024-10-17
Grant Date 2025-07-01
Owner Ab Initio Technology LLC (USA)
Inventor
  • Isman, Marshall A.
  • Weiss, Adam
  • Egenolf, Jonah
  • Parks, Robert
  • Maclean, John
  • Mellon, Richard
  • Radivojevic, Dusan
  • Veiser, Paul
  • Khader, Mazin Woodrow

Abstract

A method implemented by a data processing system for enabling a system to pipeline or otherwise process data in conformance with specified criteria by providing a graphical user interface for selecting data to be processed, determining metadata of selected data, and, based on the metadata, automatically processing the selected data in conformance with the specified criteria.

IPC Classes  ?

  • G06F 16/20 - Information retrievalDatabase structures thereforFile system structures therefor of structured data, e.g. relational data
  • G06F 3/048 - Interaction techniques based on graphical user interfaces [GUI]
  • G06F 9/448 - Execution paradigms, e.g. implementations of programming paradigms
  • G06F 9/451 - Execution arrangements for user interfaces
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06F 16/908 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content

55.

METADATA DRIVEN INGESTION AND DATA PROCESSING

      
Application Number US2024024348
Publication Number 2024/216088
Status In Force
Filing Date 2024-04-12
Publication Date 2024-10-17
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Isman, Marshall A.
  • Weiss, Adam
  • Egenolf, Jonah
  • Parks, Robert
  • Maclean, John
  • Mellon, Richard
  • Radivojevic, Dusan
  • Veiser, Paul
  • Khader, Mazin Woodrow

Abstract

A method implemented by a data processing system for enabling a system to pipeline or otherwise process data in conformance with specified criteria by providing a graphical user interface for selecting data to be processed, determining metadata of selected data, and, based on the metadata, automatically processing the selected data in conformance with the specified criteria.

IPC Classes  ?

  • G06F 16/21 - Design, administration or maintenance of databases

56.

META-DATA DRIVEN INGESTION AND DATA PROCESSING

      
Document Number 03288731
Status Pending
Filing Date 2024-04-12
Open to Public Date 2024-10-17
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Isman, Marshall A.
  • Weiss, Adam
  • Egenolf, Jonah
  • Parks, Robert
  • Maclean, John
  • Mellon, Richard
  • Radivojevic, Dusan
  • Veiser, Paul
  • Khader, Mazin Woodrow

IPC Classes  ?

  • G06F 16/21 - Design, administration or maintenance of databases

57.

Systems and methods for accessing data entities managed by a data processing system

      
Application Number 18415067
Grant Number 12254002
Status In Force
Filing Date 2024-01-17
First Publication Date 2024-10-17
Grant Date 2025-03-18
Owner Ab Initio Technology LLC (USA)
Inventor
  • Gylfason, Halldor Isak
  • Parks, Robert

Abstract

Techniques for obtaining information about data entity instances managed by a data processing system using at least one data store. The techniques include obtaining a query comprising a first portion comprising information for identifying instances of a first data entity stored in at least one data store; and a second portion indicating at least one attribute of the first data entity; generating, from the query, a plurality of executable queries including a first set of one or more executable queries and a second set of one or more executable queries, the generating comprising: generating, using the first portion, the first set of executable queries for identifying instances of the first data entity, and generating, using the second portion, the second set of executable queries for obtaining attribute values for instances of the first data entity; and executing the plurality of executable queries to obtain results for the query.

IPC Classes  ?

58.

On-demand integration of records with data catalog identifiers

      
Application Number 18491324
Grant Number 12339826
Status In Force
Filing Date 2023-10-20
First Publication Date 2024-10-10
Grant Date 2025-06-24
Owner Ab Initio Technology LLC (USA)
Inventor
  • Gould, Joel
  • Hunter, Matthew Jonathon

Abstract

A method implemented by a data processing system for enabling a user to browse a data catalog and select fields from multiple data sources to be integrated into a data profile so that, when a request is received for the data profile, data from those fields can be made available efficiently and immediately.

IPC Classes  ?

  • G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
  • G06F 16/21 - Design, administration or maintenance of databases
  • G06F 16/22 - IndexingData structures thereforStorage structures

59.

ON-DEMAND INTEGRATION OF RECORDS WITH DATA CATALOG IDENTIFIERS

      
Application Number US2024023227
Publication Number 2024/211674
Status In Force
Filing Date 2024-04-05
Publication Date 2024-10-10
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Gould, Joel
  • Hunter, Matthew Jonathon

Abstract

A method implemented by a data processing system for enabling a user to browse a data catalog and select fields of datasets from multiple data sources to be integrated into a data profile so that, when a request is received for the data profile, data from those fields can be made available efficiently and immediately.

IPC Classes  ?

  • G06F 16/242 - Query formulation
  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06F 16/9035 - Filtering based on additional data, e.g. user or group profiles
  • G06F 16/9535 - Search customisation based on user profiles and personalisation

60.

USING METADATA TO DRIVE BATCH AND REAL-TIME PROCESSING IN DATA PROCESSING SYSTEMS

      
Document Number 03284047
Status Pending
Filing Date 2024-03-15
Open to Public Date 2024-09-26
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Isman, Marshall A.
  • Egenolf, Jonah

IPC Classes  ?

  • G06F 16/21 - Design, administration or maintenance of databases

61.

LOGICAL ACCESS FOR PREVIEWING EXPANDED VIEW DATASETS

      
Document Number 03286845
Status Pending
Filing Date 2024-03-22
Open to Public Date 2024-09-26
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Parks, Robert
  • Egenolf, Jonah
  • Schechter, Ian

IPC Classes  ?

  • G06F 16/26 - Visual data miningBrowsing structured data
  • G06F 16/957 - Browsing optimisation, e.g. caching or content distillation
  • G06F 16/958 - Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

62.

Logical Access for Previewing Expanded View Datasets

      
Application Number 18492904
Status Pending
Filing Date 2023-10-24
First Publication Date 2024-09-26
Owner Ab Initio Technology LLC (USA)
Inventor
  • Parks, Robert
  • Egenolf, Jonah
  • Schechter, Ian

Abstract

A method implemented by a data processing system for: enabling a user to preview attributes of fields of an expanded view of a base dataset and to specify one or more of the fields to use in downstream data processing and generating a dataset that includes the one or more of the fields from the preview specified to be used in the downstream data processing, with the generated dataset having increased efficiency with respect to speed and data memory, relative to an efficiency of generating a dataset including all the fields of the expanded view when only the specified one or more of the fields are used in the downstream data processing.

IPC Classes  ?

63.

USING METADATA TO DRIVE BATCH AND REAL-TIME PROCESSING IN DATA PROCESSING SYSTEMS

      
Application Number US2024020140
Publication Number 2024/196759
Status In Force
Filing Date 2024-03-15
Publication Date 2024-09-26
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Isman, Marshall A.
  • Egenolf, Jonah

Abstract

Described are techniques for causing a data processing system to perform real-time decisioning by generating a data record (e.g., a dynamic data record) based on a request for the real-time decisioning, wherein the data record includes batch data and real-time data retrieved from one or more operational systems responsive to receipt of the request, with real-time being with regard to when the request is received by the data processing system.

IPC Classes  ?

  • G06F 16/21 - Design, administration or maintenance of databases

64.

LOGICAL ACCESS FOR PREVIEWING EXPANDED VIEW DATASETS

      
Application Number US2024021174
Publication Number 2024/197264
Status In Force
Filing Date 2024-03-22
Publication Date 2024-09-26
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Parks, Robert
  • Egenolf, Jonah
  • Schechter, Ian

Abstract

A method implemented by a data processing system for: enabling a user to preview attributes of fields of an expanded view of a base dataset and to specify one or more of the fields to use in downstream data processing and generating a dataset that includes the one or more of the fields from the preview specified to be used in the downstream data processing, with the generated dataset having increased efficiency with respect to speed and data memory, relative to an efficiency of generating a dataset including all the fields of the expanded view when only the specified one or more of the fields are used in the downstream data processing.

IPC Classes  ?

  • G06F 16/26 - Visual data miningBrowsing structured data
  • G06F 16/957 - Browsing optimisation, e.g. caching or content distillation
  • G06F 16/958 - Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

65.

Using Metadata to Drive Batch and Real-Time Processing in Data Processing Systems

      
Application Number 18491601
Status Pending
Filing Date 2023-10-20
First Publication Date 2024-09-19
Owner Ab Initio Technology LLC (USA)
Inventor
  • Isman, Marshall A.
  • Egenolf, Jonah

Abstract

Described are techniques for causing a data processing system to perform real-time decisioning by generating a record (e.g., dynamic record) based on a request for the real-time decisioning, wherein the record (e.g., dynamic record) includes batch data and real-time data retrieved from one or more operational systems responsive to receipt of the request, with real-time being with regard to when the request is received by the data processing system.

IPC Classes  ?

66.

SYSTEMS AND METHODS FOR DATAFLOW GRAPH OPTIMIZATION

      
Application Number 18670461
Status Pending
Filing Date 2024-05-21
First Publication Date 2024-09-19
Owner Ab Initio Technology LLC (USA)
Inventor Dickie, Garth Allen

Abstract

At least one non-transitory computer-readable storage medium storing processor-executable instructions that, when executed by at least one computer hardware processor, cause the at least one computer hardware processor to perform: obtaining an automatically generated initial dataflow graph, the initial dataflow graph comprising a first plurality of nodes representing a first plurality of data processing operations and a first plurality of links representing flows of data among nodes in the first plurality of nodes; and generating an updated dataflow graph by iteratively applying dataflow graph optimization rules to update the initial dataflow graph, the updated dataflow graph comprising a second plurality of nodes representing a second plurality of data processing operations and a second plurality of links representing flows of data among nodes in the second plurality of nodes.

IPC Classes  ?

67.

BACK-CALCULATION OF AGGREGATES

      
Application Number US2024014447
Publication Number 2024/167838
Status In Force
Filing Date 2024-02-05
Publication Date 2024-08-15
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Gould, Joel
  • Roche, Benjamin James Robbie

Abstract

Described are techniques for back-calculating one or more values of a new, real-time aggregate before sufficient data to calculate the new, real-time aggregate has been collected, wherein the back-calculating is based on data collected for one or more aggregates that have been executing prior to start of execution of the new, real-time aggregate.

IPC Classes  ?

  • G06F 17/18 - Complex mathematical operations for evaluating statistical data
  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

68.

PROCESSING STRUCTURED DATA FOR REAL-TIME SEGMENTATION

      
Document Number 03279820
Status Pending
Filing Date 2024-02-02
Open to Public Date 2024-08-08
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Gould, Joel
  • Roche, Benjamin James Robbie
  • Banks, Matthew John

IPC Classes  ?

69.

Processing Structured Data for Real-time Segmentation

      
Application Number 18486300
Status Pending
Filing Date 2023-10-13
First Publication Date 2024-08-08
Owner Ab Initio Technology LLC (USA)
Inventor
  • Gould, Joel
  • Roche, Benjamin James Robbie
  • Banks, Matthew John

Abstract

A method for performing real-time segmentation by updating a wide record based on receipt of real-time data, wherein an item of real-time data represents a transaction, detecting that the updated wide record satisfies criteria for performing real-time segmentation, and performing real-time segmentation on the updated, wide record, wherein real-time is relative to when a transaction represented in the updated wide record occurs.

IPC Classes  ?

  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  • H04L 67/55 - Push-based network services

70.

Back-Calculation of Aggregates

      
Application Number 18233418
Status Pending
Filing Date 2023-08-14
First Publication Date 2024-08-08
Owner Ab Initio Technology LLC (USA)
Inventor
  • Gould, Joel
  • Roche, Benjamin James Robbie

Abstract

Described are techniques for back-calculating one or more values of a new, real-time aggregate before sufficient data to calculate the new, real-time aggregate has been collected, wherein the back-calculating is based on data collected for one or more aggregates that have been executing prior to start of execution of the new, real-time aggregate.

IPC Classes  ?

  • G06F 16/2458 - Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
  • G06F 16/242 - Query formulation

71.

PROCESSING STRUCTURED DATA FOR REAL-TIME SEGMENTATION

      
Application Number US2024014305
Publication Number 2024/163937
Status In Force
Filing Date 2024-02-02
Publication Date 2024-08-08
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Gould, Joel
  • Roche, Benjamin James Robbie
  • Banks, Matthew John

Abstract

A method for performing real-time segmentation by updating a wide record based on receipt of real-time data, wherein an item of real-time data represents a transaction, detecting that the updated wide record satisfies criteria for performing real-time segmentation, and performing real-time segmentation on the updated, wide record, wherein real-time is relative to when a transaction represented in the updated wide record occurs.

IPC Classes  ?

72.

ON-DEMAND RETRIEVAL OF STRUCTURED DATA IN AGGREGATING DATA ACROSS DISTINCT SOURCES

      
Application Number US2024012793
Publication Number 2024/158920
Status In Force
Filing Date 2024-01-24
Publication Date 2024-08-02
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor Gould, Joel

Abstract

A method for enabling a user to generate a complex aggregation on their own by providing the user with a graphical user interface that displays data items in a data catalog and that provides controls for the user to select data items to be used in generating the complex aggregation, and to select a type of aggregation, and based on the user's selections, automatically generating computer instructions to generate a value of the complex aggregation is described.

IPC Classes  ?

  • G06F 16/11 - File system administration, e.g. details of archiving or snapshots
  • G06F 16/16 - File or folder operations, e.g. details of user interfaces specifically adapted to file systems
  • G06F 8/34 - Graphical or visual programming
  • G06F 9/44 - Arrangements for executing specific programs

73.

DATA PROCESSING SYSTEM WITH MANIPULATION OF LOGICAL DATASET GROUPS

      
Application Number 18434546
Status Pending
Filing Date 2024-02-06
First Publication Date 2024-08-01
Owner Ab Initio Technology LLC (USA)
Inventor Weisman, Amit

Abstract

A data processing system that receives user input specifying datasets on which operations are performed with user interfaces that enable manipulation of hierarchical groups of datasets. A user interface may enable individual datasets or a previously defined group of datasets to be aggregated into another grouping. The groupings may be scoped, including by persona of users, such that, when a user is prompted to specify one or more datasets as a target of an operation by the data processing system, the available choices are limited to datasets that have a scope encompassing that user. The interfaces may prompt a user to select a grouping within the hierarchy that contains datasets on which the operation can be performed. Upon selection of a grouping with multiple datasets as a target of an operation that is performed on datasets singly, the operation may be performed on each dataset in the selected group.

IPC Classes  ?

  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06F 3/0482 - Interaction with lists of selectable items, e.g. menus
  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
  • G06F 16/248 - Presentation of query results

74.

On-demand retrieval of structured data in aggregating data across distinct sources

      
Application Number 18338180
Grant Number 12619623
Status In Force
Filing Date 2023-06-20
First Publication Date 2024-07-25
Grant Date 2026-05-05
Owner Ab Initio Technology LLC (USA)
Inventor Gould, Joel

Abstract

A method for enabling a user to generate a complex aggregation on their own by providing the user with a graphical user interface that displays data items in a data catalog and that provides controls for the user to select data items to be used in generating the complex aggregation, and to select a type of aggregation, and based on the user's selections, automatically generating computer instructions to generate a value of the complex aggregation is described.

IPC Classes  ?

  • G06F 16/248 - Presentation of query results
  • G06F 9/451 - Execution arrangements for user interfaces
  • G06F 16/2455 - Query execution
  • G06F 16/25 - Integrating or interfacing systems involving database management systems

75.

Display panel portion with an animated computer icon

      
Application Number 29831258
Grant Number D1032627
Status In Force
Filing Date 2022-03-18
First Publication Date 2024-06-25
Grant Date 2024-06-25
Owner Ab Initio Technology LLC (USA)
Inventor Nielsen, Bo Hjort

76.

Display panel portion with an animated computer icon

      
Application Number 29831275
Grant Number D1032628
Status In Force
Filing Date 2022-03-18
First Publication Date 2024-06-25
Grant Date 2024-06-25
Owner Ab Initio Technology LLC (USA)
Inventor Nielsen, Bo Hjort

77.

Display panel portion with an animated computer icon

      
Application Number 29831244
Grant Number D1031741
Status In Force
Filing Date 2022-03-18
First Publication Date 2024-06-18
Grant Date 2024-06-18
Owner Ab Initio Technology LLC (USA)
Inventor Nielsen, Bo Hjort

78.

Display panel portion with an animated computer icon

      
Application Number 29831176
Grant Number D1031740
Status In Force
Filing Date 2022-03-17
First Publication Date 2024-06-18
Grant Date 2024-06-18
Owner Ab Initio Technology LLC (USA)
Inventor Nielsen, Bo Hjort

79.

Generating rules for data processing values of data fields from semantic labels of the data fields

      
Application Number 18545416
Grant Number 12242444
Status In Force
Filing Date 2023-12-19
First Publication Date 2024-05-09
Grant Date 2025-03-04
Owner Ab Initio Technology LLC (USA)
Inventor
  • Joyce, John
  • Isman, Marshall A.
  • Melbouci, Sandrick

Abstract

Methods and systems are configured to determine a semantic meaning for data and generate data processing rules based on the semantic meaning of the data. The semantic meaning includes syntactical or contextual meaning for the data that is determined, for example, by profiling, by the data processing system, values stored in a field included in data records of one or more datasets; applying, by the data processing system, one or more classifiers to the profiled values; identifying, based on applying the one or more classifiers, one or more attributes indicative of a logical or syntactical characteristic for the values of the field, with each of the one or more attributes having a respective confidence level that is based on an output of each of the one or more classifiers. The attributes are associated with the fields and are used for generating data processing rules and processing the data.

IPC Classes  ?

  • G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
  • G06F 16/22 - IndexingData structures thereforStorage structures
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06N 5/04 - Inference or reasoning models
  • G06N 20/00 - Machine learning

80.

SYSTEMS AND METHODS FOR MANAGING PRIVILEGES IN A DATA PROCESSING SYSTEM

      
Application Number 18278264
Status Pending
Filing Date 2022-02-23
First Publication Date 2024-05-02
Owner Ab Initio Technology LLC (USA)
Inventor
  • Polstra, Drew
  • Parks, Robert

Abstract

Techniques for managing access privileges in a data processing system include obtaining a plurality of rules for granting and/or denying privileges to a first actor to perform at least one action on a first instance of a first data entity of data entities; identifying, from among attributes of the first data entity, a first attribute whose values are used by one or more of the plurality of rules; obtaining, from a user or from at least one data store, a first value of the first attribute; identifying, using the first value and from among the plurality of rules, a first rule that depends on the first value; generating a graphical user interface (GUI) including a visual rendering of at least some of the plurality of rules, the visual rendering emphasizing the first rule identified using the first value of the first attribute; and displaying the generated GUI to the user.

IPC Classes  ?

  • H04L 9/40 - Network security protocols
  • H04L 41/22 - Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks comprising specially adapted graphical user interfaces [GUI]

81.

Generating rules for data processing values of data fields from semantic labels of the data fields

      
Application Number 18399522
Grant Number 12242442
Status In Force
Filing Date 2023-12-28
First Publication Date 2024-04-18
Grant Date 2025-03-04
Owner Ab Initio Technology LLC (USA)
Inventor
  • Joyce, John
  • Isman, Marshall A.
  • Melbouci, Sandrick

Abstract

Methods and systems are configured to determine a semantic meaning for data and generate data processing rules based on the semantic meaning of the data. The semantic meaning includes syntactical or contextual meaning for the data that is determined, for example, by profiling, by the data processing system, values stored in a field included in data records of one or more datasets; applying, by the data processing system, one or more classifiers to the profiled values; identifying, based on applying the one or more classifiers, one or more attributes indicative of a logical or syntactical characteristic for the values of the field, with each of the one or more attributes having a respective confidence level that is based on an output of each of the one or more classifiers. The attributes are associated with the fields and are used for generating data processing rules and processing the data.

IPC Classes  ?

  • G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
  • G06F 16/22 - IndexingData structures thereforStorage structures
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06N 5/04 - Inference or reasoning models
  • G06N 20/00 - Machine learning

82.

Dynamic distribution of container images

      
Application Number 18492173
Grant Number 12307262
Status In Force
Filing Date 2023-10-23
First Publication Date 2024-04-18
Grant Date 2025-05-20
Owner Ab Initio Technology LLC (USA)
Inventor
  • Lynch, Frank
  • Wakeling, Tim

Abstract

A method implemented by a data processing system including: accessing the container image that includes the first application and a second application; determining, by the data processing system, the number of parallel executions of the given module of the first application; for the given module, generating a plurality of instances of the container image in accordance with the number of parallel executions determined, for each instance, configuring that instance to execute the given module of the first application; causing each of the plurality of configured instances to execute on one or more of the host systems; and for at least one of the plurality of configured instances, causing, by the second application of that configured instance, communication between the data processing system and the one or more of the host systems executing that configured instance.

IPC Classes  ?

  • G06F 9/445 - Program loading or initiating
  • G06F 8/61 - Installation
  • G06F 9/451 - Execution arrangements for user interfaces
  • G06F 9/455 - EmulationInterpretationSoftware simulation, e.g. virtualisation or emulation of application or operating system execution engines
  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]

83.

Generation of optimized logic from a schema

      
Application Number 18345852
Grant Number 12346312
Status In Force
Filing Date 2023-06-30
First Publication Date 2024-04-18
Grant Date 2025-07-01
Owner Ab Initio Technology LLC (USA)
Inventor
  • Egenolf, Jonah
  • Isman, Marshall A.
  • Schechter, Ian

Abstract

A method includes accessing a schema that specifies relationships among datasets, computations on the datasets, or transformations of the datasets, selecting a dataset from among the datasets, and identifying, from the schema, other datasets that are related to the selected dataset. Attributes of the datasets are identified, and logical data representing the identified attributes and relationships among the attributes is generated. The logical data is provided to a development environment, which provides access to portions of the logical data representing the identified attributes. A specification that specifies at least one of the identified attributes in performing an operation is received from the development environment. Based on the specification and the relationships among the identified attributes represented by the logical data, a computer program is generated to perform the operation by accessing, from storage, at least one dataset having the at least one of the attributes specified in the specification.

IPC Classes  ?

  • G06F 8/34 - Graphical or visual programming
  • G06F 8/36 - Software reuse
  • G06F 8/38 - Creation or generation of source code for implementing user interfaces
  • G06F 16/21 - Design, administration or maintenance of databases
  • G06F 16/23 - Updating
  • G06F 16/242 - Query formulation
  • G06F 16/2452 - Query translation
  • G06F 16/2453 - Query optimisation
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06F 16/81 - Indexing, e.g. XML tagsData structures thereforStorage structures
  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06Q 10/10 - Office automationTime management
  • G06Q 30/0242 - Determining effectiveness of advertisements

84.

Generating rules for data processing values of data fields from semantic labels of the data fields

      
Application Number 18399545
Grant Number 12242443
Status In Force
Filing Date 2023-12-28
First Publication Date 2024-04-18
Grant Date 2025-03-04
Owner Ab Initio Technology LLC (USA)
Inventor
  • Joyce, John
  • Isman, Marshall A.
  • Melbouci, Sandrick

Abstract

Methods and systems are configured to determine a semantic meaning for data and generate data processing rules based on the semantic meaning of the data. The semantic meaning includes syntactical or contextual meaning for the data that is determined, for example, by profiling, by the data processing system, values stored in a field included in data records of one or more datasets; applying, by the data processing system, one or more classifiers to the profiled values; identifying, based on applying the one or more classifiers, one or more attributes indicative of a logical or syntactical characteristic for the values of the field, with each of the one or more attributes having a respective confidence level that is based on an output of each of the one or more classifiers. The attributes are associated with the fields and are used for generating data processing rules and processing the data.

IPC Classes  ?

  • G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
  • G06F 16/22 - IndexingData structures thereforStorage structures
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06N 5/04 - Inference or reasoning models
  • G06N 20/00 - Machine learning

85.

TECHNIQUES FOR DISCOVERING AND UPDATING SEMANTIC MEANING OF DATA FIELDS

      
Application Number US2023074617
Publication Number 2024/064705
Status In Force
Filing Date 2023-09-19
Publication Date 2024-03-28
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Joyce, John
  • Huang, David
  • Chang, Andrew
  • Morrison, Niel

Abstract

Techniques for discovering semantic meaning of data in fields included in one or more data sets, the method including: a first field having a previously-assigned label that indicates a semantic meaning of the first field; identifying a set of one or more candidate labels, for potential assignment to the first field instead of the previously-assigned label; evaluating, using a previously-determined label score and a first candidate label score, whether to assign a first candidate label to the first field, the evaluating comprising: when the first candidate label score is at least a first threshold amount greater than a previously- determined label score, presenting the first candidate label to a user by generating an interface through which the user can provide input indicating whether to assign the first candidate label to the first field instead of the previously-determined label.

IPC Classes  ?

  • G06F 16/21 - Design, administration or maintenance of databases
  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06F 21/62 - Protecting access to data via a platform, e.g. using keys or access control rules

86.

TECHNIQUES FOR DISCOVERING AND UPDATING SEMANTIC MEANING OF DATA FIELDS

      
Document Number 03268252
Status In Force
Filing Date 2023-09-19
Open to Public Date 2024-03-28
Grant Date 2026-04-21
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Joyce, John
  • Huang, David
  • Chang, Andrew
  • Morrison, Niel

Abstract

Techniques for discovering semantic meaning of data in fields included in one or more data sets, the method including: a first field having a previously-assigned label that indicates a semantic meaning of the first field; identifying a set of one or more candidate labels, for potential assignment to the first field instead of the previously-assigned label; evaluating, using a previously-determined label score and a first candidate label score, whether to assign a first candidate label to the first field, the evaluating comprising: when the first candidate label score is at least a first threshold amount greater than a previously- determined label score, presenting the first candidate label to a user by generating an interface through which the user can provide input indicating whether to assign the first candidate label to the first field instead of the previously-determined label.

IPC Classes  ?

  • G06F 16/21 - Design, administration or maintenance of databases
  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
  • G06F 21/62 - Protecting access to data via a platform, e.g. using keys or access control rules

87.

Publishing to a data warehouse

      
Application Number 18492425
Grant Number 12277135
Status In Force
Filing Date 2023-10-23
First Publication Date 2024-03-28
Grant Date 2025-04-15
Owner Ab Initio Technology LLC (USA)
Inventor
  • Blom, Andrew
  • Miller, Darren
  • Isman, Marshall A.

Abstract

A method for generating an executable application to transform and load data into a structured dataset includes receiving a metadata file that specifies values for parameters for structuring data feeds, received from a networked data source, into a structured database. The metadata file specifies logical rules for transforming the data feeds. The values of the parameters and the logical rules for transforming the plurality of the data feeds are validated to ensure logical consistency for each data feed. Data rules are generated that specify standards for transforming each data feed in accordance with the validated values of the parameters and logical rules. The executable application is generated that is configured to receive source data comprising a data feed from one or more data sources and transform the source data into structured data that satisfies the one or more standards for the structured data record in compliance with the data rules.

IPC Classes  ?

  • G06F 7/00 - Methods or arrangements for processing data by operating upon the order or content of the data handled
  • G06F 8/34 - Graphical or visual programming
  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06F 17/00 - Digital computing or data processing equipment or methods, specially adapted for specific functions
  • H04L 67/565 - Conversion or adaptation of application format or content

88.

Techniques for discovering and updating semantic meaning of data fields

      
Application Number 18470405
Grant Number 12141107
Status In Force
Filing Date 2023-09-19
First Publication Date 2024-03-21
Grant Date 2024-11-12
Owner Ab Initio Technology LLC (USA)
Inventor
  • Joyce, John
  • Huang, David
  • Chang, Andrew
  • Morrison, Niel

Abstract

Techniques for discovering semantic meaning of data in fields included in one or more data sets, the method including: a first field having a previously-assigned label that indicates a semantic meaning of the first field; identifying a set of one or more candidate labels, for potential assignment to the first field instead of the previously-assigned label; evaluating, using a previously-determined label score and a first candidate label score, whether to assign a first candidate label to the first field, the evaluating comprising: when the first candidate label score is at least a first threshold amount greater than a previously-determined label score, presenting the first candidate label to a user by generating an interface through which the user can provide input indicating whether to assign the first candidate label to the first field instead of the previously-determined label.

IPC Classes  ?

  • G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
  • G06F 16/21 - Design, administration or maintenance of databases
  • G06F 40/30 - Semantic analysis

89.

TECHNIQUES FOR PRE-ASSIGNMENT VALIDATION OF DATA MANAGED BY A DATA PROCESSING SYSTEM

      
Application Number US2023070643
Publication Number 2024/054726
Status In Force
Filing Date 2023-07-20
Publication Date 2024-03-14
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Clemens, David
  • Procops, Roy

Abstract

Some embodiments provide techniques of enforcing valid data assignments in a data processing system in which data can be dynamically updated by user devices and/or computerized processes. The techniques identify, using a validation rule associated with a data entity, one or more valid values for assignment to an attribute of an instance of the data entity. The techniques identify the valid value(s) by generating a query for the one or more valid values using one or more condition(s) on the attribute in the validation rule, and executing the generated query to obtain the one or more valid values for the first attribute. The attribute may then be assigned one or more of the identified valid value(s).

IPC Classes  ?

  • G06F 16/215 - Improving data qualityData cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors

90.

TECHNIQUES FOR PRE-ASSIGNMENT VALIDATION OF DATA MANAGED BY A DATA PROCESSING SYSTEM

      
Application Number 18356153
Status Pending
Filing Date 2023-07-20
First Publication Date 2024-03-14
Owner Ab Initio Technology LLC (USA)
Inventor
  • Clemens, David
  • Procops, Roy

Abstract

Some embodiments provide techniques of enforcing valid data assignments in a data processing system in which data can be dynamically updated by user devices and/or computerized processes. The techniques identify, using a validation rule associated with a data entity, one or more valid values for assignment to an attribute of an instance of the data entity. The techniques identify the valid value(s) by generating a query for the one or more valid values using one or more condition(s) on the attribute in the validation rule, and executing the generated query to obtain the one or more valid values for the first attribute. The attribute may then be assigned one or more of the identified valid value(s).

IPC Classes  ?

  • G06F 16/2457 - Query processing with adaptation to user needs
  • G06F 16/2453 - Query optimisation
  • G06F 16/28 - Databases characterised by their database models, e.g. relational or object models

91.

Generating, accessing, and displaying lineage metadata

      
Application Number 18345706
Grant Number 12405949
Status In Force
Filing Date 2023-06-30
First Publication Date 2024-03-07
Grant Date 2025-09-02
Owner Ab Initio Technology LLC (USA)
Inventor
  • Clemens, David
  • Radivojevic, Dusan
  • Galarneau, Neil

Abstract

Among other things, we describe a method of receiving a portion of metadata from a data source, the portion of metadata describing nodes and edges; generating instances of a data structure representing the portion of metadata, at least one instance of the data structure including an identification value that identifies a corresponding node, one or more property values representing respective properties of the corresponding node, and one or more pointers to respective identification values, each pointer representing an edge associated with a node identified by the corresponding respective identification value; storing the instances of the data structure in random access memory; receiving a query that includes an identification of at least one particular element of data; and using at least one instance of the data structure to cause a display of a computer system to display a representation of lineage of the particular element of data.

IPC Classes  ?

92.

OPERATIONALIZING METADATA

      
Application Number US2023029810
Publication Number 2024/044041
Status In Force
Filing Date 2023-08-09
Publication Date 2024-02-29
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Parks, Robert
  • Egenolf, Jonah

Abstract

A method for using a metadata model to perform operations on data items, with the metadata model including parent nodes and child nodes connected by edges, with the parent nodes specifying logical metadata and the child nodes specifying physical metadata representing the data items, and with the edges specifying relationships between the nodes. The method includes: identifying a given data item and physical metadata of that given data item, accessing the metadata model, identifying, in the metadata model, a child node representing the physical metadata of the given data item, traversing one or more edges in the metadata model to identify parent nodes of the child node, determining, from logical metadata associated with the identified parent nodes, one or more operations to be performed on the given data item, applying the one or more operations to the given data item to transform the data item, and storing the transformed data item.

IPC Classes  ?

  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06F 16/907 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

93.

OPERATIONALIZING METADATA

      
Document Number 03265601
Status Pending
Filing Date 2023-08-09
Open to Public Date 2024-02-29
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Parks, Robert
  • Egenolf, Jonah

IPC Classes  ?

  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06F 16/907 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

94.

Operationalizing metadata

      
Application Number 18104066
Grant Number 12443615
Status In Force
Filing Date 2023-01-31
First Publication Date 2024-02-29
Grant Date 2025-10-14
Owner Ab Initio Technology LLC (USA)
Inventor
  • Parks, Robert
  • Egenolf, Jonah

Abstract

A method for using a metadata model to perform operations on data items, with the metadata model including parent nodes and child nodes connected by edges, with the parent nodes specifying logical metadata and the child nodes specifying physical metadata representing the data items, and with the edges specifying relationships between the nodes. The method includes: identifying a given data item and physical metadata of that given data item, accessing the metadata model, identifying, in the metadata model, a child node representing the physical metadata of the given data item, traversing one or more edges in the metadata model to identify parent nodes of the child node, determining, from logical metadata associated with the identified parent nodes, one or more operations to be performed on the given data item, applying the one or more operations to the given data item to transform the data item, and storing the transformed data item.

IPC Classes  ?

  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  • G06F 16/22 - IndexingData structures thereforStorage structures
  • G06F 16/26 - Visual data miningBrowsing structured data
  • G06F 16/901 - IndexingData structures thereforStorage structures

95.

Editor for generating computational graphs

      
Application Number 18112958
Grant Number 12050606
Status In Force
Filing Date 2023-02-22
First Publication Date 2024-01-25
Grant Date 2024-07-30
Owner Ab Initio Technology LLC (USA)
Inventor
  • Schechter, Ian
  • Dickie, Garth

Abstract

Techniques for generating a dataflow graph include generating a first dataflow graph with a plurality of first nodes representing first computer operations in processing data, with at least one of the first computer operations being a declarative operation that specifies one or more characteristics of one or more results of processing of data, and transforming the first dataflow graph into a second dataflow graph for processing data in accordance with the first computer operations, the second dataflow graph including a plurality of second nodes representing second computer operations, with at least one of the second nodes representing one or more imperative operations that implement the logic specified by the declarative operation, where the one or more imperative operations are unrepresented by the first nodes in the first dataflow graph.

IPC Classes  ?

  • G06F 16/2455 - Query execution
  • G06F 9/30 - Arrangements for executing machine instructions, e.g. instruction decode
  • G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]
  • G06F 16/242 - Query formulation
  • G06F 16/2457 - Query processing with adaptation to user needs
  • G06F 16/25 - Integrating or interfacing systems involving database management systems
  • G06F 16/901 - IndexingData structures thereforStorage structures

96.

Discovering a semantic meaning of data fields from profile data of the data fields

      
Application Number 18201545
Grant Number 12456016
Status In Force
Filing Date 2023-05-24
First Publication Date 2023-12-21
Grant Date 2025-10-28
Owner Ab Initio Technology LLC (USA)
Inventor
  • Butler, Christopher Thurston
  • Bush, Timothy Spencer

Abstract

A data processing system for discovering a semantic meaning of a field included in one or more data sets is configured to identify a field included in one or more data sets, with the field having an identifier. For that field, the system profiles data values of the field to generate a data profile, accesses a plurality of label proposal tests, and generates a set of label proposals by applying the plurality of label proposal tests to the data profile. The system determines a similarity among the label proposals and selects a classification. The system identifies one of the label proposals as identifying the semantic meaning. The system stores the identifier of the field with the identified one of the label proposals that identifies the semantic meaning.

IPC Classes  ?

  • G06F 40/30 - Semantic analysis
  • G06F 16/908 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
  • G06F 16/93 - Document management systems
  • G06N 20/00 - Machine learning

97.

DATAFLOW GRAPH DATASETS

      
Document Number 03256554
Status Pending
Filing Date 2023-02-24
Open to Public Date 2023-11-09
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Schechter, Ian Robert
  • Dickie, Garth Allen
  • Egenolf, Jonah
  • Isman, Marshall

IPC Classes  ?

98.

DATAFLOW GRAPH DATASETS

      
Application Number 18114212
Status Pending
Filing Date 2023-02-24
First Publication Date 2023-11-09
Owner Ab Initio Technology LLC (USA)
Inventor
  • Schechter, Ian Robert
  • Dickie, Garth Allen
  • Egenolf, Jonah
  • Isman, Marshall

Abstract

Described herein are techniques, performed by a data processing system, for enabling efficient development of software application programs in a dynamic environment with multiple datasets by generating entries in a dataset catalog to provide a software application program with access to output data dynamically generated by dataflow graphs, the entries associated with respective software application programs developed as dataflow graphs. The techniques include identifying a subgraph, wherein, when the subgraph is executed, the subgraph generates output data by applying one or more data processing operations to data obtained from one or more data sources; creating, in the dataset catalog, a new entry associated with the identified subgraph, the new entry associated with information indicating nodes, links, and configuration parameters of the identified subgraph; and configuring the dataset catalog to enable access to the new entry, in the dataset catalog, associated with the identified subgraph.

IPC Classes  ?

  • G06F 16/901 - IndexingData structures thereforStorage structures

99.

DATAFLOW GRAPH DATASETS

      
Application Number US2023013841
Publication Number 2023/215028
Status In Force
Filing Date 2023-02-24
Publication Date 2023-11-09
Owner AB INITIO TECHNOLOGY LLC (USA)
Inventor
  • Schechter, Ian, Robert
  • Dickie, Garth, Allen
  • Egenolf, Jonah
  • Isman, Marshall

Abstract

Described herein are techniques, performed by a data processing system, for enabling efficient development of software application programs in a dynamic environment with multiple datasets by generating entries in a dataset catalog to provide a software application program with access to output data dynamically generated by dataflow graphs, the entries associated with respective software application programs developed as dataflow graphs. The techniques include identifying a subgraph, wherein, when the subgraph is executed, the subgraph generates output data by applying one or more data processing operations to data obtained from one or more data sources; creating, in the dataset catalog, a new entry associated with the identified subgraph, the new entry associated with information indicating nodes, links, and configuration parameters of the identified subgraph; and configuring the dataset catalog to enable access to the new entry, in the dataset catalog, associated with the identified subgraph.

IPC Classes  ?

100.

Dynamic execution of parameterized applications for the processing of keyed network data streams

      
Application Number 18316333
Grant Number 11983548
Status In Force
Filing Date 2023-05-12
First Publication Date 2023-09-07
Grant Date 2024-05-14
Owner Ab Initio Technology LLC (USA)
Inventor
  • Ravid, Oded
  • Murphy, Trevor

Abstract

A method is described for processing keyed data items that are each associated with a value of a key, the keyed data items being from a plurality of distinct data streams, the processing including collecting the keyed data items, determining, based on contents of at least one of the keyed data items, satisfaction of one or more specified conditions for execution of one or more actions and causing execution of at least one of the one or more actions responsive to the determining.

IPC Classes  ?

  • G06F 7/00 - Methods or arrangements for processing data by operating upon the order or content of the data handled
  • G06F 9/445 - Program loading or initiating
  • G06F 9/448 - Execution paradigms, e.g. implementations of programming paradigms
  • G06F 16/178 - Techniques for file synchronisation in file systems
  • G06F 16/2455 - Query execution
  • G06F 16/901 - IndexingData structures thereforStorage structures
  • G06F 8/41 - Compilation
  1     2     3     ...     9        Next Page