A networked database management system (DBMS) is disclosed. In particular, the disclosed DBMS includes a plurality of nodes, one of which is elected as a designated leader. The designated leader is elected using a consensus algorithm, such as tabulated random votes, RAFT or PAXOS. The designated leader is responsible for managing open coding lines, and determining when to close an open coding line.
G06F 3/06 - Digital input from, or digital output to, record carriers
G06F 16/25 - Integrating or interfacing systems involving database management systems
H04L 67/1097 - Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
2.
DIVIDED MAIN MEMORY WITH COMPUTING DEVICE MEMORY SECTION AND DATABASE MEMORY SECTION
A computing device includes a plurality of nodes, wherein a first node of the plurality of nodes operates in accordance with a computing device operation system (OS) and remaining nodes of the plurality of nodes operate in accordance with a database OS and process a plurality of threads of an application. The computing device further includes a divided main memory that is divided into a computing device memory section and a database memory section, and the database OS determines an allocation of the divided main memory between the computing device memory section and the database memory section, where a first database thread is assigned a buffer of a plurality of buffers of the database memory section, and a first computing device thread utilizes the computing device memory section of the divided main memory.
A method for execution, by a first intermediate node of a plurality of nodes in a database management system, includes processing a message that includes data that is being sent in accordance with a routing path from a source node to a destination node, is a first size, and indicates a next node of the routing path, wherein the first intermediate node is limited to communication with a subset of nodes of the plurality of nodes, and wherein the subset of nodes includes the next node. The method further includes maintaining a tracking table that indicates a total amount of data sent to each node of the subset of nodes during a first time period. The method further includes resetting the total amount of data sent to each of the subset of nodes to zero based one or more of a command and an initiation of a second time period.
G06F 7/08 - Sorting, i.e. grouping record carriers in numerical or other ordered sequence according to the classification of at least some of the information they carry
G06F 7/24 - Sorting, i.e. extracting data from one or more carriers, re-arranging the data in numerical or other ordered sequence, and re-recording the sorted data on the original carrier or on a different carrier or set of carriers
G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]
A method for execution by at least one computing entity of a database system, the method includes obtaining a plurality of data partitions of a data set for storage in the database system, where the data set is organized in rows and columns, and the rows correspond to data records and the columns correspond to fields of the data records. The method further includes dividing a first partition of a plurality of data partitions to produce a first number of first raw data segments for storage in the database system, where the first number is based on a first data storage coding scheme. The method further includes dividing a second partition of the plurality of data partitions to produce a second number of second raw data segments for storage in the database system, where the second number is based on a second data storage coding scheme.
A payload store within a database management system includes a first set of nodes that include a first node that includes memory, a network interface, a storage device, and at least one processing unit. The at least one processing unit is operable to receive, via the network interface, a first set of data rows that includes a plurality of subsets of data rows and a last subset of data rows. The processing module is further operable to randomly assign the plurality of subsets of data rows and the last subset of data rows between the first set of nodes, where a first number of data rows of the plurality of subsets of data rows is assigned to the first node and a second number of data rows is assigned to another node. The processing module is further operable to store the first number of data rows in the storage device.
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
G06F 3/06 - Digital input from, or digital output to, record carriers
G06F 13/16 - Handling requests for interconnection or transfer for access to memory bus
G06F 13/28 - Handling requests for interconnection or transfer for access to input/output bus using burst mode transfer, e.g. direct memory access, cycle steal
G06F 13/42 - Bus transfer protocol, e.g. handshake; Synchronisation
G06F 15/173 - Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star or snowflake
G06F 16/22 - Indexing; Data structures therefor; Storage structures
G06F 16/25 - Integrating or interfacing systems involving database management systems
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
H04L 67/1097 - Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
H04L 67/568 - Storing data temporarily at an intermediate stage, e.g. caching
6.
LOGICAL PARTITIONING OF MEMORY WITHIN A COMPUTING DEVICE
A method includes a host computing device receiving a segment group of data. The method further includes the host computing device evaluating availability status of other computing devices in the storage cluster of computing devices. When one of the other computing devices is unavailable, the method further includes the host computing device dividing the segment group of data into a plurality of lines of data blocks. For a line of the data blocks, the method further includes the host computing device generating at least one parity block. The method further includes the host computing device sending a first data segment that includes first positioned data blocks to a first available computing device. The method further includes the host computing device sending a second data segment that includes second positioned data blocks to a second available computing device. The method further includes the host computing device storing a parity segment.
A method includes receiving, by a first computing entity of a database system, a query request that is formatted in accordance with a generic query format. The method further includes generating, by the first computing entity, an initial query plan based on the query request and a query instruction set. The method further includes determining, by the first computing entity, storage parameters. The method further includes determining, by the first computing entity, processing resources for processing the query request based on the storage parameters. The method further includes generating, by the first computing entity, an optimized query plan from the initial query plan based on the storage parameters, the processing resources, and optimization tools. The method further includes sending, by the first computing entity, the optimized query plan to a second computing entity for distribution and execution of the optimized query plan.
G06F 3/06 - Digital input from, or digital output to, record carriers
G06F 7/24 - Sorting, i.e. extracting data from one or more carriers, re-arranging the data in numerical or other ordered sequence, and re-recording the sorted data on the original carrier or on a different carrier or set of carriers
G06F 16/2457 - Query processing with adaptation to user needs
G06F 16/2458 - Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
G06F 16/901 - Indexing; Data structures therefor; Storage structures
H03M 7/30 - Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
H04L 67/10 - Protocols in which an application is distributed across nodes in the network
8.
Database Management System for Optimizing Queries via Multiple Optimizers
A large highly parallel database management system includes thousands of nodes storing huge volume of data. The database management system includes multiple query optimizers for determining low cost execution plans for queries. The database management system is adapted to receive a data query. An execution plan generator component of the database management system generates an initial execution plan for the query. The initial execution plan is fed as input to more than one query optimizers. Each optimizer starts with the initial execution plan, generates alternative execution plans, and determines a satisfactory execution plan that incurs the lowest cost. The database management system compares the selected execution plans by the optimizers and selects one with the lowest cost. The multiple query optimizers run in parallel.
A method for execution by a computer of a database management system includes obtaining a dataset that includes a set of data records, where the data set is associated with a set of data characteristics. The method includes executing a selected ranked analytical calculation of a ranked list of analytical calculations on the dataset to produce an analytical calculation result, where the ranked list of analytical calculations is generated by ranking a list of analytical calculations that are able to be executed on the dataset, based on a set of analytical calculation characteristics associated with the list of analytical calculations, where an analytical calculation characteristic of the set of analytical calculation characteristics indicates an estimated execution time to perform an analytical calculation of the list of analytical calculations, and the selected ranked analytical calculation is selected based on the set of data characteristics to produce the selected ranked analytical calculation.
A method by one or more processing units of a database management system includes transitioning storage of a collection of data from storage in a set of first temperature storage nodes of the database management system to storage in a set of second temperature storage nodes of the database management system based on one or more criteria associated with the collection of data, where each first temperature storage node of the set of first temperature storage nodes is paired with a corresponding second temperature storage node of the set of second temperature storage nodes to form a set of parallel node pairs. The method further includes, after determining the collection of data has been stored in the set of second temperature storage nodes, deleting the collection of data in the set of first temperature storage nodes.
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
G06F 3/06 - Digital input from, or digital output to, record carriers
G06F 13/16 - Handling requests for interconnection or transfer for access to memory bus
G06F 13/28 - Handling requests for interconnection or transfer for access to input/output bus using burst mode transfer, e.g. direct memory access, cycle steal
G06F 13/42 - Bus transfer protocol, e.g. handshake; Synchronisation
G06F 15/173 - Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star or snowflake
G06F 16/22 - Indexing; Data structures therefor; Storage structures
G06F 16/25 - Integrating or interfacing systems involving database management systems
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
H04L 67/1097 - Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
H04L 67/568 - Storing data temporarily at an intermediate stage, e.g. caching
11.
Database system with coding cluster and methods for use therewith
A networked database management system (DBMS) is disclosed. In particular, the disclosed DBMS includes a plurality of nodes, one of which is elected as a designated leader. The designated leader is elected using a consensus algorithm, such as tabulated random votes, RAFT or PAXOS. The designated leader is responsible for managing open coding lines, and determining when to close an open coding line.
G06F 3/06 - Digital input from, or digital output to, record carriers
G06F 16/25 - Integrating or interfacing systems involving database management systems
H04L 67/1097 - Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
12.
Generating different segment groups for storage via storage devices of different storage clusters
A computing system is operable to generate a first plurality of lines of coding blocks that includes a first plurality of data blocks and a first plurality of parity blocks. A first set of segments of a first segment group that collectively include the first plurality of lines of coding blocks are generated, and the first set of segments of the first segment group are stored via a first plurality of computing devices of a first storage cluster of the computing system. A second plurality of lines of coding blocks that includes a second plurality of data blocks and a second plurality of parity blocks are generated. A second set of segments of a second segment group that collectively include the second plurality of lines of coding blocks are generated. The second set of segments of the second segment group are stored via a second plurality of computing devices of a second storage cluster of the computing system.
G06F 3/06 - Digital input from, or digital output to, record carriers
G06F 7/24 - Sorting, i.e. extracting data from one or more carriers, re-arranging the data in numerical or other ordered sequence, and re-recording the sorted data on the original carrier or on a different carrier or set of carriers
G06F 16/2457 - Query processing with adaptation to user needs
G06F 16/2458 - Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
G06F 16/901 - Indexing; Data structures therefor; Storage structures
H03M 7/30 - Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
H04L 67/10 - Protocols in which an application is distributed across nodes in the network
13.
Determining a coding scheme for a partition of a data set
A method includes obtaining a plurality of data partitions of a data set for storage in a database system. The method further includes determining a first data storage coding scheme for a first partition of the plurality of data partitions, where the first data storage coding scheme includes first encoding parameters regarding encoding the first partition into first data segments and first parity segments. The method further includes determining a second data storage coding scheme for a second partition of the plurality of data partitions. The method further includes dividing the first partition to produce a first number of first raw data segments, where the first number is based on the first data storage coding scheme. The method further includes dividing the second partition to produce a second number of second raw data segments, where the second number is based on the second data storage coding scheme.
09 - Scientific and electric apparatus and instruments
Goods & Services
computer peripherals; computer data storage devices, namely, computer disk drives, optical disc drives, blank digital data storage media, blank data storage disks; integrated computer hardware in the nature of computer hardware with embedded computer operating software; computer hardware with preinstalled computer operating software; computer servers; computer data processors and computer memories, microprocessors, central processing units, circuit boards, and integrated circuits; a full line of downloadable computer software to manage, analyze, retrieve, monitor, maintain, report on, structure, model, forecast, present and display data and information from computer databases, applications, the internet, and cloud systems, and for the development, analysis, management, integration, deployment, virtualization and maintenance of computer software and hardware; downloadable database software to manage, monitor, track and organize data; downloadable computer software applications to manage, monitor, track and organize data, namely, web services software, application server software, business intelligence software, internet and intranet portal software, cloud software, computer software to automate data warehousing, operating system software, computer utility software, computer networking software, cloud computing software; downloadable computer software applications to manage, monitor, track and organize data in the fields of marketing, sales, customer service, contracts, human resources, scientific research, clinical research, health care, health sciences, education, communications and telecommunications, financial transaction processing, analysis and management, simulation, enterprise and resource planning; downloadable computer operating software; downloadable computer operating system programs; downloadable computer software development tools; downloadable computer software for application and database integration; downloadable computer software for creating searchable databases of information and data; downloadable computer software for providing an on-line database in the field of transaction processing to upload transactional data, provide statistical analysis, and produce notifications and reports; downloadable computer software for the field of warehousing and distribution, to manage transactional data, provide statistical analysis, and produce notifications and reports; Computer storage devices, namely, high-speed storage subsystems for storage and backup of electronic data either locally or via a telecommunications network; Electronic data processing apparatus; downloadable computer programs for use in large volume data storage, access and analysis; downloadable computer programs for use in management of large volume databases; downloadable computer software for use in database management; downloadable computer software for use in large volume database management; recorded computer software and hardware for management of large volume databases, sold as a unit; recorded computer software and hardware for fast read access from large volume databases, sold as a unit; downloadable computer software for creating and managing large volume databases; downloadable computer software for fast read access from large volume databases; downloadable database management software for large volume databases ;downloadable database management software for fast read from large volume databases
15.
Confidence-based database management systems and methods for use therewith
A method for execution by one or more database storage nodes of a database management system includes obtaining a row of data that includes a plurality of columns. The method further includes generating a data confidence value for data within one or more columns of the plurality of columns. The method further includes storing the data confidence value in a memory of the database management system.
42 - Scientific, technological and industrial services, research and design
Goods & Services
computer services, namely, providing online non-downloadable computer software and online non-downloadable computer software platforms for database management with machine learning capabilities; providing on-line non-downloadable computer software for database management of large volume databases, large volume datasets, large volume multidimensional datasets with machine learning capabilities; providing on-line non-downloadablecomputersoftware for database management, namely, to manage, analyze, retrieve, monitor, maintain, report on, track, organize, structure, model, forecast, present and display data and information with machine learning capabilities; providing temporary use of non-downloadable computer software applications to manage large volume databases, large volume datasets, and large volume multidimensional datasets with machine learning capabilities; computer services, namely, providing temporary use of nondownloadable computer software for use in database management with machine learning capabilities
09 - Scientific and electric apparatus and instruments
Goods & Services
downloadable and recorded computer software and hardware for management of large volume databases, large volume datasets, large volume multidimensional datasets with machine learning capabilities; downloadable and recorded computer software, computer software suite, and computer program to manage, analyze, retrieve, monitor, maintain, report on, track, organize, structure, model, forecast, present and display data and information with machine learning capabilities; downloadable and recorded computer software machine learning capabilities for managing database.
09 - Scientific and electric apparatus and instruments
42 - Scientific, technological and industrial services, research and design
Goods & Services
downloadable and recorded computer software and hardware for database management, namely, transforming, joining, exploring, and managing large volume of geospatial records; downloadable and recorded computer software and hardware for management of large volume databases, large volume datasets, large volume multidimensional datasets; downloadable and recorded computer software, computer software suite, and computer program to manage, analyze, retrieve, monitor, maintain, report on, track, organize, structure, model, forecast, present and display data and information; downloadable and recorded computer software and hardware for database management, namely, transforming, joining, and exploring, and managing large volume of geospatial records computer services, namely, providing online non-downloadable computer software and online non-downloadable computer software platforms for database management of geospatial data; providing on-line non-downloadable computer software for database management in the field of geospatial data; providing on-line non-downloadable computer software for database managementof large volume databases, large volume datasets, large volume multidimensional datasets; providing online non-downloadable computer software for database management, namely, to manage, analyze, retrieve, monitor, maintain, report on, track, organize, structure, model, forecast, present and display data and information; providing temporary use of non-downloadable computer software applications to manage large volume databases, large volume datasets, and large volume multidimensional datasets; providing temporary use of non-downloadable computer software applications for database management, namely, transforming, joining, exploring, and managing large volume of geospatial records; computer services, namely, providing temporary use of non-downloadable computer software for use in database management
09 - Scientific and electric apparatus and instruments
Goods & Services
Computer peripherals; computer data storage devices, namely, computer disk drives, optical disc drives, blank digital data storage media, blank data storage disks; computer hardware integrated with downloadable and recorded computer operating software; computer hardware with preinstalled computer operating software; computer servers; computer data processors and computer memories, microprocessors, central processing units, circuit boards, and integrated circuits; downloadable and recorded computer software to manage, analyze, retrieve, monitor, maintain, report on, structure, model, forecast, present and display data and information from computer databases, applications, the internet, and cloud systems, and for the development, analysis, management, integration, deployment, virtualization and maintenance of computer software and hardware; downloadable and recorded database software to manage, monitor, track and organize data; downloadable and recorded computer software applications to manage, monitor, track and organize data, namely, web services software, application server software, business intelligence software, internet and intranet portal software, cloud software, computer software to automate data warehousing, operating system software, computer utility software, computer networking software, cloud computing software; downloadable and recorded computer software applications to manage, monitor, track and organize data in the fields of marketing, sales, customer service, contracts, human resources, scientific research, clinical research, health care, health sciences, education, communications and telecommunications, financial transaction processing, analysis and management, simulation, enterprise and resource planning; downloadable and recorded computer operating software; downloadable and recorded computer operating systems; downloadable and recorded computer software development tools; downloadable and recorded computer software for application and database integration; downloadable and recorded computer software for creating searchable databases of information and data; downloadable and recorded computer software for providing an online database in the field of transaction processing to upload transactional data, provide statistical analysis, and produce notifications and reports; downloadable and recorded computer software for the field of warehousing and distribution, to manage transactional data, provide statistical analysis, and produce notifications and reports; Computer storage devices, namely, highspeed storage subsystems for storage and backup of electronic data either locally or via a telecommunications network; Electronic data processing apparatus; downloadable and recorded computer programs for use in large volume data storage, access and analysis; downloadable and recorded computer programs for use in management of large volume databases; downloadable and recorded computer software for use in database management; downloadable and recorded computer software for use in large volume database management; downloadable and recorded computer software and hardware for management of large volume databases; downloadable and recorded computer software and hardware for fast read access from large volume databases; downloadable and recorded computer software for creating and managing large volume databases; downloadable and recorded computer software for fast read access from large volume databases; downloadable and recorded database management software for large volume databases; downloadable and recorded database management software for fast read from large volume databases
20.
Randomized data distribution in highly parallel database management system
A payload store within a database management system includes a first set of nodes that include a first node that includes a memory, a network interface, a storage device, and a processing unit. The processing unit receives, via the network interface, a first portion of data to be stored in a plurality of storage devices of the first set of nodes, and splits the first portion into a first set of data rows that includes a plurality of subsets of data rows and a last subset of data rows. The processing unit further randomly assigns the plurality of subsets of data rows and the last subset of data rows between the first set of nodes and stores a first number of data rows in the storage device. The processing unit further sends, via the network interface, a second number of data rows to the second node for storage therein.
G06F 13/16 - Handling requests for interconnection or transfer for access to memory bus
G06F 13/28 - Handling requests for interconnection or transfer for access to input/output bus using burst mode transfer, e.g. direct memory access, cycle steal
G06F 13/42 - Bus transfer protocol, e.g. handshake; Synchronisation
G06F 15/173 - Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star or snowflake
G06F 16/22 - Indexing; Data structures therefor; Storage structures
G06F 16/25 - Integrating or interfacing systems involving database management systems
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
H04L 67/1097 - Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
H04L 67/568 - Storing data temporarily at an intermediate stage, e.g. caching
G06F 3/06 - Digital input from, or digital output to, record carriers
21.
Dispersing data and parity across a set of segments stored via a computing system
A computing system is operable to generate a plurality of lines of coding blocks that includes a plurality of data blocks and a plurality of parity blocks. Each of the plurality of lines of coding blocks includes a corresponding subset of data blocks a corresponding subset of parity blocks. A set of segments of a segment group are generated to collectively include the plurality of lines of coding blocks. Different coding blocks of each of the plurality of lines of coding blocks are included within different ones of the set of segments, and the plurality of parity blocks are dispersed across all of the set of segments. The set of segments are stored via a plurality of nodes sets, where different segments of the set of segments are stored via memory resources of different node sets of the plurality of node sets.
H03M 7/30 - Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
G06F 16/2457 - Query processing with adaptation to user needs
G06F 16/2458 - Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
G06F 7/24 - Sorting, i.e. extracting data from one or more carriers, re-arranging the data in numerical or other ordered sequence, and re-recording the sorted data on the original carrier or on a different carrier or set of carriers
22.
Database management system with coding cluster and methods for use therewith
A networked database management system (DBMS) is disclosed. In particular, the disclosed DBMS includes a plurality of nodes, one of which is elected as a designated leader. The designated leader is elected using a consensus algorithm, such as tabulated random votes, RAFT or PAXOS. The designated leader is responsible for managing open coding lines, and determining when to close an open coding line.
G06F 3/06 - Digital input from, or digital output to, record carriers
G06F 16/25 - Integrating or interfacing systems involving database management systems
H04L 67/1097 - Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
42 - Scientific, technological and industrial services, research and design
Goods & Services
Application service provider (ASP), namely, hosting of computer software for others; leasing and rental of computer hardware and computer peripherals; leasing computer hardware and computer peripherals; computer services, namely, providing temporary use of nondownloadable computer software for use in database management; computer services, namely, providing online non-downloadable computer software and platforms for database management of time based data; providing an on-line computer database in the field of time based data regarding electronic document and e-mail time-stamping; data as a service (DAAS) services featuring online non-downloadable software for data analytics and tracking of time based data; technical support services, namely, remote and on-site infrastructure management services for monitoring, administration and management of public and private cloud computing IT and application systems; computer services, namely, providing cloud computing featuring a full line of nondownloadable computer software to manage, analyze, retrieve, monitor, maintain, report on, structure, model, forecast, present and display data and information from computer databases, applications and the internet, and for the development, analysis, management, integration, deployment, virtualization and maintenance of computer software and hardware; providing temporary use of non-downloadable database software to manage, monitor, track and organize data; providing temporary use of non-downloadable computer software applications to manage, monitor, track and organize data, namely, web services software, application server software, business intelligence software, internet and intranet portal software, computer software to automate data warehousing, content management software, operating system software, computer utility software, computer networking software, cloud computing software, computer networks, and global communications networks; providing temporary use of non-downloadable computer software applications to manage, monitor, track and organize data in the fields of marketing, sales, customer service, contracts, human resources, clinical research, health care, health sciences, education, communications and telecommunications, financial transaction processing, analysis and management, simulation, enterprise and resource planning; Data automation and collection service using proprietary software to evaluate, analyze and collect service data; Data mining; Data warehousing; Database development services; Design of computer database; Development and creation of computer programs for data processing; Development, updating and maintenance of software and database systems; Maintenance of on-line databases for others; Application service provider featuring software for use in providing an on-line database in the field of transaction processing to upload transactional data, provide statistical analysis, and produce notifications and reports; Application service provider (ASP), namely, hosting computer software applications of others; Application service provider (ASP) featuring software for use in data management; Application service provider featuring application programming interface (API) software for data management; Application service provider, namely, hosting, managing, developing, analyzing, and maintaining applications, software and web sites of others in the fields of data management; Computer programming services for data management; Computer services, namely, hosting an interactive web site that allows data management; Computer software development in the field of data management; Providing a website that gives computer users the ability to manage data; Cloud storage services for electronic data; Cloud storage services for electronic files; Computer services, namely, cloud hosting provider services
42 - Scientific, technological and industrial services, research and design
Goods & Services
Application service provider (ASP), namely, hosting of computer software for others; leasing and rental of computer hardware and computer peripherals; leasing computer hardware and computer peripherals; computer services, namely, providing temporary use of nondownloadable computer software for use in database management; computer services, namely, providing online nondownloadable computer software and platforms for database management of time based data; providing an on-line computer database in the field of time based data regarding electronic document and e-mail time-stamping; data as a service (DAAS) services featuring online nondownloadable software for data analytics and tracking of time based data; technical support services, namely, remote and on-site infrastructure management services for monitoring, administration and management of public and private cloud computing IT and application systems; computer services, namely, providing cloud computing featuring a full line of nondownloadable computer software to manage, analyze, retrieve, monitor, maintain, report on, structure, model, forecast, present and display data and information from computer databases, applications and the internet, and for the development, analysis, management, integration, deployment, virtualization and maintenance of computer software and hardware; providing temporary use of non-downloadable database software to manage, monitor, track and organize data; providing temporary use of non-downloadable computer software applications to manage, monitor, track and organize data, namely, web services software, application server software, business intelligence software, internet and intranet portal software, computer software to automate data warehousing, content management software, operating system software, computer utility software, computer networking software, cloud computing software, computer networks, and global communications networks; providing temporary use of non-downloadable computer software applications to manage, monitor, track and organize data in the fields of marketing, sales, customer service, contracts, human resources, clinical research, health care, health sciences, education, communications and telecommunications, financial transaction processing, analysis and management, simulation, enterprise and resource planning; Data automation and collection service using proprietary software to evaluate, analyze and collect service data; Data mining; Data warehousing; Database development services; Design of computer database; Development and creation of computer programs for data processing; Development, updating and maintenance of software and database systems; Maintenance of on-line databases for others; Application service provider featuring software for use in providing an on-line database in the field of transaction processing to upload transactional data, provide statistical analysis, and produce notifications and reports; Application service provider (ASP), namely, hosting computer software applications of others; Application service provider (ASP) featuring software for use in data management; Application service provider featuring application programming interface (API) software for data management; Application service provider, namely, hosting, managing, developing, analyzing, and maintaining applications, software and web sites of others in the fields of data management; Computer programming services for data management; Computer services, namely, hosting an interactive web site that allows data management; Computer software development in the field of data management; Providing a website that gives computer users the ability to manage data
09 - Scientific and electric apparatus and instruments
Goods & Services
Computer peripherals; computer data storage devices, namely, computer disk drives, optical disc drives, blank digital data storage media, blank data storage disks; computer hardware integrated in the nature of embedded with downloadable and recorded computer operating software; computer hardware with preinstalled computer operating software; computer servers; computer data processors and computer memories, microprocessors, central processing units, circuit boards, and integrated circuits; downloadable and recorded computer software to manage, analyze, retrieve, monitor, maintain, report on, structure, model, forecast, present and display data and information from computer databases, applications, the internet, and cloud systems, and for the development, analysis, management, integration, deployment, virtualization and maintenance of computer software and hardware; downloadable and recorded database software to manage, monitor, track and organize data; downloadable and recorded computer software applications to manage, monitor, track and organize data, namely, web services software, application server software, business intelligence software, internet and intranet portal software, cloud software, computer software to automate data warehousing, operating system software, computer utility software, computer networking software, cloud computing software; downloadable and recorded computer software applications to manage, monitor, track and organize data in the fields of marketing, sales, customer service, contracts, human resources, scientific research, clinical research, health care, health sciences, education, communications and telecommunications, financial transaction processing, analysis and management, simulation, enterprise and resource planning; downloadable and recorded computer operating software; downloadable and recorded computer operating system software; downloadable and recorded computer software development tools; downloadable and recorded computer software for application and database integration; downloadable and recorded computer software for creating searchable databases of information and data; downloadable and recorded computer software for providing an online database in the field of transaction processing to upload transactional data, provide statistical analysis, and produce notifications and reports; downloadable and recorded computer software for the field of warehousing and distribution, to manage transactional data, provide statistical analysis, and produce notifications and reports; Computer storage devices, namely, highspeed storage subsystems for storage and backup of electronic data either locally or via a telecommunications network; Electronic data processing apparatus; downloadable and recorded computer programs for use in large volume data storage, access and analysis; downloadable and recorded computer programs for use in management of large volume databases; downloadable and recorded computer software for use in database management; downloadable and recorded computer software for use in large volume database management; downloadable and recorded computer software and hardware for management of large volume databases, sold as a unit; downloadable and recorded computer software and hardware for fast read access from large volume databases, sold as a unit; downloadable and recorded computer software for creating and managing large volume databases; downloadable and recorded computer software for fast read access from large volume databases; downloadable and recorded database management software for large volume databases; downloadable and recorded database management software for fast read from large volume databases
26.
Efficient database management system and method for use therewith
A method for execution by a computer of a database management system includes obtaining a dataset that includes a set of data records, where the data set is associated with a set of data characteristics. The method further includes ranking analytical calculations of a list of analytical calculations that are able to be executed on the dataset based on the set of data characteristics and a set of analytical calculation characteristics associated with the list of analytical calculations to produce a ranked list of analytical calculations. The method further includes selecting an analytical calculation from the ranked list of analytical calculations based on the ranking. The method further includes executing the selected analytical calculation on the dataset to produce an analytical calculation result.
A large highly parallel database management system includes thousands of nodes storing huge volume of data. The database management system includes multiple query optimizers for determining low cost execution plans for queries. The database management system is adapted to receive a data query. An execution plan generator component of the database management system generates an initial execution plan for the query. The initial execution plan is fed as input to more than one query optimizers. Each optimizer starts with the initial execution plan, generates alternative execution plans, and determines a satisfactory execution plan that incurs the lowest cost. The database management system compares the selected execution plans by the optimizers and selects one with the lowest cost. The multiple query optimizers run in parallel.
A method for managing data storage and retrieval and operated within a database management system includes retrieving a set of confidence data records from memory of the database management system, where a confidence data record of the set of confidence data records includes a set of data columns and one or more confidence columns populated with one or more data confidence values associated with data within one or more data columns of the set of data columns. The method further includes analyzing the set of confidence data records to produce a result, where the result includes a confidence level that is based on corresponding ones of the one or more data confidence values associated with the data within one or more data columns of the set of data columns.
A networked database management system (DBMS) is disclosed. In particular, the disclosed DBMS includes a plurality of nodes, one of which is elected as a designated leader. The designated leader is elected using a consensus algorithm, such as tabulated random votes, RAFT or PAXOS. The designated leader is responsible for managing open coding lines, and determining when to close an open coding line.
G06F 3/06 - Digital input from, or digital output to, record carriers
G06F 16/25 - Integrating or interfacing systems involving database management systems
H04L 67/1097 - Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
30.
Data transition in highly parallel database management system
A method by one or more processing units of a database management system includes determining to transition a collection of data from storage in a set of first temperature storage nodes to storage in a set of second temperature nodes based on one or more criteria associated with the collection of data, where each first temperature storage node of the set of first temperature storage nodes is paired with a corresponding second temperature storage node of the set of second temperature storage nodes to form a set of parallel node pairs. The method further includes transitioning storage of the collection of data in parallel, by each parallel node pair of the set of parallel node pairs, from storage in the set of first temperature storage nodes to storage in the set of second temperature storage nodes.
G06F 13/16 - Handling requests for interconnection or transfer for access to memory bus
G06F 13/28 - Handling requests for interconnection or transfer for access to input/output bus using burst mode transfer, e.g. direct memory access, cycle steal
G06F 13/42 - Bus transfer protocol, e.g. handshake; Synchronisation
G06F 15/173 - Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star or snowflake
G06F 16/22 - Indexing; Data structures therefor; Storage structures
G06F 16/25 - Integrating or interfacing systems involving database management systems
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
H04L 67/1097 - Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
H04L 67/568 - Storing data temporarily at an intermediate stage, e.g. caching
G06F 3/06 - Digital input from, or digital output to, record carriers
A method includes generating a plurality of parity blocks from a plurality of lines of data blocks. The plurality of lines of data blocks are stored in data sections of memory of a cluster of computing devices of the computing system by distributing storage of individual data blocks of the plurality of lines of data blocks among unique data sections of the cluster of computing devices. The plurality of parity blocks are stored in parity sections of memory of the cluster of computing devices by distributing storage of parity blocks of the plurality of parity blocks among unique parity sections of the cluster of computing devices.
H03M 7/30 - Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
G06F 16/2457 - Query processing with adaptation to user needs
G06F 16/2458 - Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
G06F 7/24 - Sorting, i.e. extracting data from one or more carriers, re-arranging the data in numerical or other ordered sequence, and re-recording the sorted data on the original carrier or on a different carrier or set of carriers
42 - Scientific, technological and industrial services, research and design
Goods & Services
Cloud storage services for electronic data; Cloud storage services for electronic files; Computer programming services for data management; Computer software development in the field of data management; Data warehousing; Database development services; Development, updating and maintenance of software and database systems; Maintenance of on-line databases for others; Providing a website that gives computer users the ability to manage data; Computer services, namely, cloud hosting provider services; Computer services, namely, hosting an interactive web site that allows data management
33.
Segmenting a partition of a data set based on a data storage coding scheme
A method includes receiving, by a first computing entity of a database system, a data set that is organized in rows and columns. The method further includes determining whether to partition the data set based on a parameter associated with the data set. When determining to partition the data set, the method includes determining partitioning parameters for the data set, and partitioning the data set into a plurality of data partitions in accordance with the partitioning parameters. The method further includes determining a first coding scheme for a first data partition and determining a first number of first raw data segments for a first segment group of the first partition based on the coding scheme. The method further includes dividing the first partition to produce the first number of first raw data segments for storage in the database system.
A method for execution, by a first intermediate node of a plurality of nodes in a database management system, includes receiving a message, where the first intermediate node is limited to communication with a subset of nodes of the plurality of nodes, where the message: includes data that is being sent in accordance with a routing path, is a first size, and indicates a next node of the routing path, and where the subset of nodes includes the next node. The method continues by generating a revised message, wherein the revised message includes the data and has a second size. The method continues by determining whether there is at least one additional intermediate node after the next node in the routing path. When yes, determining an optimal route for forwarding the revised message via a node of the subset of nodes, and sending the revised message to the node.
G06F 7/08 - Sorting, i.e. grouping record carriers in numerical or other ordered sequence according to the classification of at least some of the information they carry
G06F 7/24 - Sorting, i.e. extracting data from one or more carriers, re-arranging the data in numerical or other ordered sequence, and re-recording the sorted data on the original carrier or on a different carrier or set of carriers
G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]
42 - Scientific, technological and industrial services, research and design
Goods & Services
Application service provider (ASP), namely, hosting of computer software of others; leasing and rental of computer hardware and computer peripherals; leasing computer hardware and computer peripherals; computer services, namely, providing temporary use of non-downloadable computer software for use in database management; computer services, namely, providing on-line non-downloadable computer software for database management of time based data; computer services, namely, providing on-line non-downloadable computer software programs for database management of time based data; data as a service (DAAS) services featuring software for data analytics and tracking of time based data; technical support services, namely, remote and on-site infrastructure management services for monitoring, administration and management of public and private cloud computing IT and application systems; computer services, namely, providing cloud computing featuring a full line of nondownloadable computer software to manage, analyze, retrieve, monitor, maintain, report on, structure, model, forecast, present and display data and information from computer databases, applications and the internet, and for the development, analysis, management, integration, deployment, virtualization and maintenance of computer software and hardware; providing temporary use of nondownloadable database software to manage, monitor, track and organize data; providing temporary use of non-downloadable computer software applications to manage, monitor, track and organize data, namely, web services software, application server software, business intelligence software, internet and intranet portal software, computer software to automate data warehousing, content management software, operating system software, computer utility software, computer networking software, cloud computing software, computer networks, and global communications networks; providing temporary use of non-downloadable computer software applications to manage, monitor, track and organize data in the fields of marketing, sales, customer service, contracts, human resources, clinical research, health care, health sciences, education, communications and telecommunications, financial transaction processing, analysis and management, simulation, enterprise and resource planning; Data automation and collection service using proprietary software to evaluate, analyze and collect service data; Data mining; Data warehousing; Database development services; Design of computer database; Development and creation of computer programs for data processing; Development, updating and maintenance of software and database systems; Maintenance of online databases for others; Application service provider featuring software for use in providing an on-line database in the field of transaction processing to upload transactional data, provide statistical analysis, and produce notifications and reports; Application service provider (ASP), namely, hosting computer software applications of others; Application service provider (ASP) featuring software for use in data management; Application service provider featuring application programming interface (API) software for data management; Application service provider, namely, hosting, managing, developing, analyzing, and maintaining applications, software and web sites of others in the fields of data management; Computer programming services for data management; Computer services, namely, hosting an interactive web site that allows data management; Computer software development in the field of data management; Providing a website that gives computer users the ability to manage data
09 - Scientific and electric apparatus and instruments
42 - Scientific, technological and industrial services, research and design
Goods & Services
Computer peripherals; computer data storage devices, namely, computer disk drives, optical disc drives, blank digital data storage media, blank data storage disks; computer hardware integrated with downloadable and recorded computer operating software; computer hardware with preinstalled computer operating software; computer servers; computer data processors and computer memories, microprocessors, central processing units, circuit boards, and integrated circuits; downloadable and recorded computer software to manage, analyze, retrieve, monitor, maintain, report on, structure, model, forecast, present and display data and information from computer databases, applications, the internet, and cloud systems, and for the development, analysis, management, integration, deployment, virtualization and maintenance of computer software and hardware; downloadable and recorded database software to manage, monitor, track and organize data; downloadable and recorded computer software applications to manage, monitor, track and organize data, namely, web services software, application server software, business intelligence software, internet and intranet portal software, cloud software, computer software to automate data warehousing, operating system software, computer utility software, computer networking software, cloud computing software; downloadable and recorded computer software applications to manage, monitor, track and organize data in the fields of marketing, sales, customer service, contracts, human resources, scientific research, clinical research, health care, health sciences, education, communications and telecommunications, financial transaction processing, analysis and management, simulation, enterprise and resource planning; downloadable and recorded computer operating software; downloadable and recorded computer operating systems; downloadable and recorded computer software development tools; downloadable and recorded computer software for application and database integration; downloadable and recorded computer software for creating searchable databases of information and data; downloadable and recorded computer software for providing an on-line database in the field of transaction processing to upload transactional data, provide statistical analysis, and produce notifications and reports; downloadable and recorded computer software for the field of warehousing and distribution, to manage transactional data, provide statistical analysis, and produce notifications and reports; Computer storage devices, namely, high-speed storage subsystems for storage and backup of electronic data either locally or via a telecommunications network; Electronic data processing apparatus; downloadable and recorded computer programs for use in large volume data storage, access and analysis; downloadable and recorded computer programs for use in management of large volume databases; downloadable and recorded computer software for use in database management; downloadable and recorded computer software for use in large volume database management; downloadable and recorded computer software and hardware for management of large volume databases; downloadable and recorded computer software and hardware for fast read access from large volume databases; downloadable and recorded computer software for creating and managing large volume databases; downloadable and recorded computer software for fast read access from large volume databases; downloadable and recorded database management software for large volume databases; downloadable and recorded database management software for fast read from large volume databases Application service provider (ASP), namely, hosting of computer software of others; leasing and rental of computer hardware and computer peripherals; leasing computer hardware and computer peripherals; computer services, namely, providing temporary use of non-downloadable computer software for use in database management; computer services, namely, providing on-line non-downloadable computer software for database management of time based data; computer services, namely, providing on-line non-downloadable computer software programs for database management of time based data; data as a service (DAAS) services featuring software for data analytics and tracking of time based data; technical support services, namely, remote and on-site infrastructure management services for monitoring, administration and management of public and private cloud computing IT and application systems; computer services, namely, providing cloud computing featuring a full line of nondownloadable computer software to manage, analyze, retrieve, monitor, maintain, report on, structure, model, forecast, present and display data and information from computer databases, applications and the internet, and for the development, analysis, management, integration, deployment, virtualization and maintenance of computer software and hardware; providing temporary use of non-downloadable database software to manage, monitor, track and organize data; providing temporary use of non-downloadable computer software applications to manage, monitor, track and organize data, namely, web services software, application server software, business intelligence software, internet and intranet portal software, computer software to automate data warehousing, content management software, operating system software, computer utility software, computer networking software, cloud computing software, computer networks, and global communications networks; providing temporary use of non-downloadable computer software applications to manage, monitor, track and organize data in the fields of marketing, sales, customer service, contracts, human resources, clinical research, health care, health sciences, education, communications and telecommunications, financial transaction processing, analysis and management, simulation, enterprise and resource planning; Data automation and collection service using proprietary software to evaluate, analyze and collect service data; Data mining; Data warehousing; Database development services; Design of computer database; Development and creation of computer programs for data processing; Development, updating and maintenance of software and database systems; Maintenance of on-line databases for others; Application service provider featuring software for use in providing an on-line database in the field of transaction processing to upload transactional data, provide statistical analysis, and produce notifications and reports; Application service provider (ASP), namely, hosting computer software applications of others; Application service provider (ASP) featuring software for use in data management; Application service provider featuring application programming interface (API) software for data management; Application service provider, namely, hosting, managing, developing, analyzing, and maintaining applications, software and web sites of others in the fields of data management; Computer programming services for data management; Computer services, namely, hosting an interactive web site that allows data management; Computer software development in the field of data management; Providing a website that gives computer users the ability to manage data
A method includes a host computing device receiving a segment group of data. The method further includes the host computing device evaluating availability status of other computing devices in the storage cluster of computing devices. When one of the other computing devices is unavailable, the method further includes the host computing device dividing the segment group of data into a plurality of lines of data blocks. For a line of the data blocks, the method further includes the host computing device generating at least one parity block. The method further includes the host computing device sending a first data segment that includes first positioned data blocks to a first available computing device. The method further includes the host computing device sending a second data segment that includes second positioned data blocks to a second available computing device. The method further includes the host computing device storing a parity segment.
G06F 16/22 - Indexing; Data structures therefor; Storage structures
G06F 7/24 - Sorting, i.e. extracting data from one or more carriers, re-arranging the data in numerical or other ordered sequence, and re-recording the sorted data on the original carrier or on a different carrier or set of carriers
G06F 9/48 - Program initiating; Program switching, e.g. by interrupt
G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]
G06F 11/10 - Adding special bits or symbols to the coded information, e.g. parity check, casting out nines or elevens
G06F 16/25 - Integrating or interfacing systems involving database management systems
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
H03M 7/30 - Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
H03M 7/40 - Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
H04L 67/1097 - Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
H04L 67/561 - Adding application-functional data or data for application control, e.g. adding metadata
09 - Scientific and electric apparatus and instruments
Goods & Services
Downloadable and recorded computer programs for grouping data by time and identification key; Downloadable and recorded computer programs for use in large volume data storage, access and analysis; Downloadable and recorded computer programs for use in management of large volume databases; Downloadable and recorded computer software for use in database management; Downloadable and recorded computer software for use in large volume database management; Downloadable and recorded computer software for creating and managing large volume databases; Downloadable and recorded computer software for fast read access from large volume databases; Downloadable and recorded database management software for large volume databases; Downloadable and recorded database management software for fast read from large volume databases
A method includes generating, by a processing entity of a computing system, a plurality of parity blocks from a plurality of lines of data blocks. A first number of parity blocks of the plurality of parity blocks is generated from a first line of data blocks of the plurality of lines of data blocks. The method further includes storing, by the processing entity, the plurality of lines of data blocks in data sections of memory of a cluster of computing devices of the computing system in accordance with a read/write balancing pattern and a restricted file system. The method further includes storing, by the processing entity, the plurality of parity blocks in parity sections of memory of the cluster of computing devices in accordance with the read/write balancing pattern and the restricted file system.
H03M 7/30 - Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
G06F 16/2457 - Query processing with adaptation to user needs
G06F 16/2458 - Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
G06F 7/24 - Sorting, i.e. extracting data from one or more carriers, re-arranging the data in numerical or other ordered sequence, and re-recording the sorted data on the original carrier or on a different carrier or set of carriers
40.
Database management system and methods for use therewith
A networked database management system (DBMS) is disclosed. In particular, the disclosed DBMS includes a plurality of nodes, one of which is elected as a designated leader. The designated leader is elected using a consensus algorithm, such as tabulated random votes, RAFT or PAXOS. The designated leader is responsible for managing open coding lines, and determining when to close an open coding line.
H04L 67/1097 - Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
G06F 16/25 - Integrating or interfacing systems involving database management systems
G06F 3/06 - Digital input from, or digital output to, record carriers
41.
System and method for designating a leader using a consensus protocol within a database management system
A networked database management system (DBMS) is disclosed. In particular, the disclosed DBMS includes a plurality of nodes, one of which is elected as a designated leader. The designated leader is elected using a consensus algorithm, such as tabulated random votes, RAFT or PAXOS. The designated leader is responsible for managing open coding lines, and determining when to close an open coding line.
A large highly parallel database management system includes thousands of nodes storing huge volume of data. The database management system includes multiple query optimizers for determining low cost execution plans for queries. The database management system is adapted to receive a data query. An execution plan generator component of the database management system generates an initial execution plan for the query. The initial execution plan is fed as input to more than one query optimizers. Each optimizer starts with the initial execution plan, generates alternative execution plans, and determines a satisfactory execution plan that incurs the lowest cost. The database management system compares the selected execution plans by the optimizers and selects one with the lowest cost. The multiple query optimizers run in parallel.
A database management system operates by: obtaining a dataset from a memory of the database management system, wherein the dataset includes a set of data records; determining a set of data characteristics of the dataset; determining a list of analytical calculations that are able to be executed on the dataset; determining an analytical calculation characteristic for each analytical calculation of the list of analytical calculations to produce a set of analytical calculation characteristics, wherein the analytical calculation characteristic indicates an estimated execution time to perform the analytical calculation; ranking each analytical calculation of the list of analytical calculations based on the set of data characteristics and the set of analytical calculation characteristics to produce a ranked list of analytical calculations; selecting an analytical calculation from the ranked list of analytical calculations based on the ranking; and executing the selected analytical calculation on the dataset to produce an analytical calculation result.
A massively parallel database management system includes an index store and a payload store including a set of storage systems of different temperatures. Both the stores each include a list of clusters. Each cluster includes a set of nodes with storage devices forming a group of segments. Nodes and clusters are connected over high speed links. The list of clusters within the payload store includes clusters of different temperatures. The payload store transitions data of a segment group from a higher temperature to a segment group in a lower temperature cluster in parallel. A node moves data of a segment in the higher temperature cluster to a corresponding node's segment in the lower temperature cluster. Once the data is written in the destination segment in the lower temperature cluster, the source segment is freed to store other data. The temperatures include blazing, hot, warm and cold.
G06F 16/00 - Information retrieval; Database structures therefor; File system structures therefor
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
G06F 16/22 - Indexing; Data structures therefor; Storage structures
G06F 16/25 - Integrating or interfacing systems involving database management systems
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
G06F 15/173 - Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star or snowflake
H04L 67/1097 - Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
H04L 67/568 - Storing data temporarily at an intermediate stage, e.g. caching
G06F 13/16 - Handling requests for interconnection or transfer for access to memory bus
G06F 13/28 - Handling requests for interconnection or transfer for access to input/output bus using burst mode transfer, e.g. direct memory access, cycle steal
G06F 13/42 - Bus transfer protocol, e.g. handshake; Synchronisation
G06F 3/06 - Digital input from, or digital output to, record carriers
45.
Database management systems for managing data with data confidence
A method for managing data storage and retrieval and operated within a database management system includes determining whether a data confidence value of a data record that is stored or is to be stored in memory of the database management system is less than a data confidence threshold, where the data confidence value includes one or more of an estimate of an accuracy of data within the data record, an estimate of the accuracy of the data record, and an estimate of a reliability level of the data. When the data confidence value is less than the data confidence threshold, the method continues by generating a confidence data record based on the data record and the data confidence value. The method continues by storing the confidence data record in memory of the database management system.
A networked database management system (DBMS) is disclosed. In particular, the disclosed DBMS includes a plurality of nodes, one of which is elected as a designated leader. The designated leader is responsible for maintaining an inventory of coding lines. A coding line spans all nodes in a cluster, and each node is assigned either data or parity. In addition, the designated leader maintains a pattern of parity rotation.
09 - Scientific and electric apparatus and instruments
Goods & Services
Computer peripherals; computer data storage devices, namely, computer disk drives, optical disc drives, blank digital data storage media, blank data storage disks; computer hardware integrated with downloadable and recorded computer operating software; computer hardware with preinstalled computer operating software; computer servers; computer data processors and computer memories, microprocessors, central processing units, circuit boards, and integrated circuits; downloadable and recorded computer software to manage, analyze, retrieve, monitor, maintain, report on, structure, model, forecast, present and display data and information from computer databases, applications, the internet, and cloud systems, and for the development, analysis, management, integration, deployment, virtualization and maintenance of computer software and hardware; downloadable and recorded database software to manage, monitor, track and organize data; downloadable and recorded computer software applications to manage, monitor, track and organize data, namely, web services software, application server software, business intelligence software, internet and intranet portal software, cloud software, computer software to automate data warehousing, operating system software, computer utility software, computer networking software, cloud computing software; downloadable and recorded computer software applications to manage, monitor, track and organize data in the fields of marketing, sales, customer service, contracts, human resources, scientific research, clinical research, health care, health sciences, education, communications and telecommunications, financial transaction processing, analysis and management, simulation, enterprise and resource planning; downloadable and recorded computer operating software; downloadable and recorded computer operating systems; downloadable and recorded computer software development tools; downloadable and recorded computer software for application and database integration; downloadable and recorded computer software for creating searchable databases of information and data; downloadable and recorded computer software for providing an on-line database in the field of transaction processing to upload transactional data, provide statistical analysis, and produce notifications and reports; downloadable and recorded computer software for the field of warehousing and distribution, to manage transactional data, provide statistical analysis, and produce notifications and reports; Computer storage devices, namely, high-speed storage subsystems for storage and backup of electronic data either locally or via a telecommunications network; Electronic data processing apparatus; downloadable and recorded computer programs for use in large volume data storage, access and analysis; downloadable and recorded computer programs for use in management of large volume databases; downloadable and recorded computer software for use in database management; downloadable and recorded computer software for use in large volume database management; downloadable and recorded computer software and hardware for management of large volume databases; downloadable and recorded computer software and hardware for fast read access from large volume databases; downloadable and recorded computer software for creating and managing large volume databases; downloadable and recorded computer software for fast read access from large volume databases; downloadable and recorded database management software for large volume databases; downloadable and recorded database management software for fast read from large volume databases
48.
Coordinating main memory access of a plurality of sets of threads
A computing device includes a plurality of nodes, where a first node operates in accordance with a computing device operation system (OS) and remaining nodes operate in accordance with a custom OS. The remaining nodes include a plurality of sets of processing core resources that process a plurality of sets of threads of an application. The computing device also includes a main memory divided into a computing device memory section and a custom memory section that includes portions logically allocated as a plurality of buffers. The computing device also includes a memory access control module operable to coordinate access to the plurality of buffers by at least some of the plurality of sets of threads in accordance with the custom OS. The computing device also includes disk memory and a disk memory access control module operable to coordinate access to the disk memory in accordance with the computing device OS.
A method includes receiving, by a first computing entity of a database system, a query request that is formatted in accordance with a generic query format. The method further includes generating, by the first computing entity, an initial query plan based on the query request and a query instruction set. The method further includes determining, by the first computing entity, storage parameters. The method further includes determining, by the first computing entity, processing resources for processing the query request based on the storage parameters. The method further includes generating, by the first computing entity, an optimized query plan from the initial query plan based on the storage parameters, the processing resources, and optimization tools. The method further includes sending, by the first computing entity, the optimized query plan to a second computing entity for distribution and execution of the optimized query plan.
G06F 7/00 - Methods or arrangements for processing data by operating upon the order or content of the data handled
G06F 7/08 - Sorting, i.e. grouping record carriers in numerical or other ordered sequence according to the classification of at least some of the information they carry
A method includes receiving, by a first computing entity of a database system, data that is organized in rows and columns. The method further includes determining, by the first computing entity, one or more key columns from the columns based on a desired sort criteria for the data. The method further includes sorting, by the first computing entity, other columns of the columns based on the one or more key columns to produce sorted other columns. The method further includes sending, by the first computing entity, the one or more key columns to a second computing entity of the database system for storage in a first storage location associated with the second computing entity. The method further includes sending, by the first computing entity, the sorted other columns to the second computing entity for storage in a second storage location associated with the second computing entity.
G06F 16/901 - Indexing; Data structures therefor; Storage structures
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
H03M 7/30 - Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
G06F 16/2457 - Query processing with adaptation to user needs
G06F 16/2458 - Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
G06F 7/24 - Sorting, i.e. extracting data from one or more carriers, re-arranging the data in numerical or other ordered sequence, and re-recording the sorted data on the original carrier or on a different carrier or set of carriers
51.
Generation of an optimized query plan in a database system
A method includes receiving, by a first computing entity of a database system, a query request that is formatted in accordance with a generic query format. The method further includes generating, by the first computing entity, an initial query plan based on the query request and a query instruction set. The method further includes determining, by the first computing entity, storage parameters. The method further includes determining, by the first computing entity, processing resources for processing the query request based on the storage parameters. The method further includes generating, by the first computing entity, an optimized query plan from the initial query plan based on the storage parameters, the processing resources, and optimization tools. The method further includes sending, by the first computing entity, the optimized query plan to a second computing entity for distribution and execution of the optimized query plan.
G06F 3/06 - Digital input from, or digital output to, record carriers
G06F 7/24 - Sorting, i.e. extracting data from one or more carriers, re-arranging the data in numerical or other ordered sequence, and re-recording the sorted data on the original carrier or on a different carrier or set of carriers
G06F 16/2457 - Query processing with adaptation to user needs
G06F 16/2458 - Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
G06F 16/901 - Indexing; Data structures therefor; Storage structures
H03M 7/30 - Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
H04L 67/10 - Protocols in which an application is distributed across nodes in the network
52.
Storing data in a data section and parity in a parity section of computing devices
A method includes generating, by a processing entity of a computing system, a plurality of parity blocks from a plurality of lines of data blocks. A first number of parity blocks of the plurality of parity blocks is generated from a first line of data blocks of the plurality of lines of data blocks. The method further includes storing, by the processing entity, the plurality of lines of data blocks in data sections of memory of a cluster of computing devices of the computing system in accordance with a read/write balancing pattern and a restricted file system. The method further includes storing, by the processing entity, the plurality of parity blocks in parity sections of memory of the cluster of computing devices in accordance with the read/write balancing pattern and the restricted file system.
G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]
H04L 29/08 - Transmission control procedure, e.g. data link level control procedure
G06F 3/06 - Digital input from, or digital output to, record carriers
G06F 12/0893 - Caches characterised by their organisation or structure
G06F 16/17 - File systems; File servers - Details of further file system functions
G06F 11/10 - Adding special bits or symbols to the coded information, e.g. parity check, casting out nines or elevens
G06F 12/109 - Address translation for multiple virtual address spaces, e.g. segmentation
G06F 7/24 - Sorting, i.e. extracting data from one or more carriers, re-arranging the data in numerical or other ordered sequence, and re-recording the sorted data on the original carrier or on a different carrier or set of carriers
A method includes receiving, by a host computing device of a storage cluster of computing devices, a segment group of data. The method further includes processing, by the host computing device, the segment group of data to produce data segments. The method further includes, sending, by the host computing device, the data segments to the computing devices of the storage cluster. The method further includes allocating, by a host node of the first computing device, data segment divisions of the first data segment to nodes of the first computing device. The method further includes allocating, by a host processing core resource of the first node, data segment sub-divisions of the first data segment division to processing core resources of the first node. The method further includes storing, by the first computing device, the first data segment having the data segment divisions and the first data segment division having the data segment sub-divisions.
H03M 7/30 - Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
G06F 16/2457 - Query processing with adaptation to user needs
G06F 16/2458 - Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
G06F 7/24 - Sorting, i.e. extracting data from one or more carriers, re-arranging the data in numerical or other ordered sequence, and re-recording the sorted data on the original carrier or on a different carrier or set of carriers
A method includes a computing device receiving a sort request regarding data of a table. The method further includes the computing device determining probability traits of the data. The method further includes the computing device dividing the sort request into sub-sort requests based on the probability traits. The method further includes the computing device identifying processing core resources to perform the sub-sort requests based on the probability traits. The method further includes the computing device allocating the sub-sort requests to the identified processing core resources in accordance with the probability traits. The method further includes the computing device allocating data portions to the identified processing core resources in accordance with the probability traits. The method further includes the computing core resources executing allocated sub-sort requests on corresponding divided data portions to produce sorted data portions. The method further includes the computing device joining the sorted data portions.
G06F 7/08 - Sorting, i.e. grouping record carriers in numerical or other ordered sequence according to the classification of at least some of the information they carry
G06N 7/00 - Computing arrangements based on specific mathematical models
G06F 9/50 - Allocation of resources, e.g. of the central processing unit [CPU]
G06F 7/24 - Sorting, i.e. extracting data from one or more carriers, re-arranging the data in numerical or other ordered sequence, and re-recording the sorted data on the original carrier or on a different carrier or set of carriers
G06N 20/10 - Machine learning using kernel methods, e.g. support vector machines [SVM]
55.
System and method for optimizing large database management systems with multiple optimizers
A large highly parallel database management system includes thousands of nodes storing huge volume of data. The database management system includes multiple query optimizers for determining low cost execution plans for queries. The database management system is adapted to receive a data query. An execution plan generator component of the database management system generates an initial execution plan for the query. The initial execution plan is fed as input to more than one query optimizers. Each optimizer starts with the initial execution plan, generates alternative execution plans, and determines a satisfactory execution plan that incurs the lowest cost. The database management system compares the selected execution plans by the optimizers and selects one with the lowest cost. The multiple query optimizers run in parallel.
A large highly parallel database management system includes thousands of nodes storing huge volume of data. The database management system includes a query optimizer for optimizing data queries. The optimizer estimates the column cardinality of a set of rows based on estimated column cardinalities of disjoint subsets of the set of rows. For a particular column, the actual column cardinality of the set of rows is the sum of the actual column cardinalities of the two subsets of rows. The optimizer creates two respective Bloom filters from the two subsets, and then combines them to create a combined Bloom filter using logical OR operations. The actual column cardinality of the set of rows is estimated using a computation from the combined Bloom filter.
A large highly parallel database system includes thousands of nodes storing huge volume of data. The database system includes query optimizers for determining low cost execution plans for queries. The optimizers operate at a lower layer and analyze data to determine column cardinality and column distribution for reducing the amount of data to read from storage drives of the nodes, and computation on the data required. The reduced amount of data read and the reduced amount of computation required decreases the elapsed time to execute a query. The optimizers also determine a satisfactory execution plan for a query using particle swarm optimization. Moreover, the optimizers consider the node computer configurations and network throughput in selecting a satisfactory execution plan. Multiple optimizers are also used in parallel to select a satisfactory execution plan. The optimizers also optimize SQL statements integrating with data modeling to allow optimizing data access during data modeling.
Time based data is sequentially stored in a highly parallel database system comprising a cluster of nodes receiving the same data queries. Each node includes a set of silo systems. Each silo system includes one or more hybrid indexing tables and a tiered hierarchical query and data process system. The tiered hierarchical system includes a dispatcher thread, a set of search threads, a set of aggregation threads in respective tiers and a drive access thread. The dispatcher thread breaks a work into a batch of work units and associate them to search threads. A search thread breaks a work unit into a batch of subwork units, and provides a data read request for each subwork unit to the drive read thread. The drive read thread fetches data from a drive and causes the associated aggregation thread to process the data. Processed data is then merged by the search thread.
Data records and associated data confidence of data in each data record are both stored in a database system. Each data record includes a data confidence. The data confidence indicates an accuracy or reliability level of the data of corresponding data record. The data records with confidence information are constructed in memory before they are stored into the database system. When the data records are retrieved from the database for analysis, the data confidence is retrieved as well. The analysis of data contained in the data records further considers the data confidence. The analysis result is thus affected by the data confidence.
Analyzing large datasets requires prioritization of analytical calculations to reduce analysis time and resource requirements. The prioritization process includes determining characteristics of a dataset, determining a list of analytical calculations, prioritizing the list of analytical calculations based on the dataset characteristics and characteristics of the analytical calculations, selecting the highest ranked analytical calculation, and applying the selected calculation on the dataset. The prioritization process of a new calculation based on a known set of calculations includes ranking the set of calculations based on their result scores from execution on a given dataset, comparing the new calculation to the list of known set of calculation, determining a similar calculation, and assigning the rank of the similar calculation to the new calculation.
A massively parallel database management system includes an index store and a payload store including a set of storage systems of different temperatures. Both the index store and the storage system each include a list of clusters. Each cluster includes a set of nodes with storage devices forming a group of segments. Nodes and clusters are connected over high speed links. Each cluster receives data and splits the data into data rows based on a predetermined size. The data rows are randomly and evenly distributed between all nodes of the cluster.
G06F 16/00 - Information retrieval; Database structures therefor; File system structures therefor
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
G06F 16/22 - Indexing; Data structures therefor; Storage structures
G06F 16/25 - Integrating or interfacing systems involving database management systems
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
H04L 67/568 - Storing data temporarily at an intermediate stage, e.g. caching
G06F 15/173 - Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star or snowflake
H04L 67/1097 - Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
G06F 13/16 - Handling requests for interconnection or transfer for access to memory bus
G06F 13/28 - Handling requests for interconnection or transfer for access to input/output bus using burst mode transfer, e.g. direct memory access, cycle steal
G06F 13/42 - Bus transfer protocol, e.g. handshake; Synchronisation
G06F 3/06 - Digital input from, or digital output to, record carriers
62.
Data transition in highly parallel database management system
A massively parallel database management system includes an index store and a payload store including a set of storage systems of different temperatures. Both the stores each include a list of clusters. Each cluster includes a set of nodes with storage devices forming a group of segments. Nodes and clusters are connected over high speed links. The list of clusters within the payload store includes clusters of different temperatures. The payload store transitions data of a segment group from a higher temperature to a segment group in a lower temperature cluster in parallel. A node moves data of a segment in the higher temperature cluster to a corresponding node's segment in the lower temperature cluster. Once the data is written in the destination segment in the lower temperature cluster, the source segment is freed to store other data. The temperatures include blazing, hot, warm and cold.
G06F 16/00 - Information retrieval; Database structures therefor; File system structures therefor
G06F 16/27 - Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
G06F 16/22 - Indexing; Data structures therefor; Storage structures
G06F 16/25 - Integrating or interfacing systems involving database management systems
G06F 16/28 - Databases characterised by their database models, e.g. relational or object models
G06F 13/16 - Handling requests for interconnection or transfer for access to memory bus
G06F 13/28 - Handling requests for interconnection or transfer for access to input/output bus using burst mode transfer, e.g. direct memory access, cycle steal
G06F 13/42 - Bus transfer protocol, e.g. handshake; Synchronisation
G06F 3/06 - Digital input from, or digital output to, record carriers
42 - Scientific, technological and industrial services, research and design
Goods & Services
Application service provider (ASP), namely, hosting of computer software; leasing and rental of computer hardware and computer peripherals; leasing computer hardware and computer peripherals; computer services, namely, providing temporary use of non-downloadable computer software for use in database management; computer services, namely, providing computer software and platforms for database management of time based data; providing an on-line computer database in the field of time based data; data as a service (DAAS) services featuring software for data analytics and tracking of time based data; technical support services, namely, remote and on-site infrastructure management services for monitoring, administration and management of public and private cloud computing IT and application systems; computer services, namely, providing cloud computing featuring a full line of nondownloadable computer software to manage, analyze, retrieve, monitor, maintain, report on, structure, model, forecast, present and display data and information from computer databases, applications and the internet, and for the development, analysis, management, integration, deployment, virtualization and maintenance of computer software and hardware; providing temporary use of non-downloadable database software to manage, monitor, track and organize data; providing temporary use of non-downloadable computer software applications to manage, monitor, track and organize data, namely, web services software, application server software, business intelligence software, internet and intranet portal software, computer software to automate data warehousing, content management software, operating system software, computer utility software, computer networking software, cloud computing software, computer networks, and global communications networks; providing temporary use of non-downloadable computer software applications to manage, monitor, track and organize data in the fields of marketing, sales, customer service, contracts, human resources, clinical research, health care, health sciences, education, communications and telecommunications, financial transaction processing, analysis and management, simulation, enterprise and resource planning; Data automation and collection service using proprietary software to evaluate, analyze and collect service data; Data mining; Data warehousing; Database development services; Design of computer database; Development and creation of computer programs for data processing; Development, updating and maintenance of software and database systems; Maintenance of on-line databases for others; Application service provider featuring software for use in providing an on-line database in the field of transaction processing to upload transactional data, provide statistical analysis, and produce notifications and reports; Application service provider (ASP), namely, hosting computer software applications of others; Application service provider (ASP) featuring software for use in data management; Application service provider featuring application programming interface (API) software for data management; Application service provider, namely, hosting, managing, developing, analyzing, and maintaining applications, software and web sites of others in the fields of data management; Computer programming services for data management; Computer services, namely, hosting an interactive web site that allows data management; Computer software development in the field of data management; Providing a website that gives computer users the ability to manage data
42 - Scientific, technological and industrial services, research and design
Goods & Services
Application service provider (ASP), namely, hosting of computer software; leasing and rental of computer hardware and computer peripherals; leasing computer hardware and computer peripherals; computer services, namely, providing temporary use of non-downloadable computer software for use in database management; computer services, namely, providing computer software and platforms for database management of time based data; providing an on-line computer database in the field of time based data; data as a service (DAAS) services featuring software for data analytics and tracking of time based data; technical support services, namely, remote and on-site infrastructure management services for monitoring, administration and management of public and private cloud computing IT and application systems; computer services, namely, providing cloud computing featuring a full line of nondownloadable computer software to manage, analyze, retrieve, monitor, maintain, report on, structure, model, forecast, present and display data and information from computer databases, applications and the internet, and for the development, analysis, management, integration, deployment, virtualization and maintenance of computer software and hardware; providing temporary use of non-downloadable database software to manage, monitor, track and organize data; providing temporary use of non-downloadable computer software applications to manage, monitor, track and organize data, namely, web services software, application server software, business intelligence software, internet and intranet portal software, computer software to automate data warehousing, content management software, operating system software, computer utility software, computer networking software, cloud computing software, computer networks, and global communications networks; providing temporary use of non-downloadable computer software applications to manage, monitor, track and organize data in the fields of marketing, sales, customer service, contracts, human resources, clinical research, health care, health sciences, education, communications and telecommunications, financial transaction processing, analysis and management, simulation, enterprise and resource planning; Data automation and collection service using proprietary software to evaluate, analyze and collect service data; Data mining; Data warehousing; Database development services; Design of computer database; Development and creation of computer programs for data processing; Development, updating and maintenance of software and database systems; Maintenance of on-line databases for others; Application service provider featuring software for use in providing an on-line database in the field of transaction processing to upload transactional data, provide statistical analysis, and produce notifications and reports; Application service provider (ASP), namely, hosting computer software applications of others; Application service provider (ASP) featuring software for use in data management; Application service provider featuring application programming interface (API) software for data management; Application service provider, namely, hosting, managing, developing, analyzing, and maintaining applications, software and web sites of others in the fields of data management; Computer programming services for data management; Computer services, namely, hosting an interactive web site that allows data management; Computer software development in the field of data management; Providing a website that gives computer users the ability to manage data