Oxylabs, UAB

Lithuania

Back to Profile

1-100 of 108 for Oxylabs, UAB Sort by
Query
Aggregations
Jurisdiction
        United States 92
        Canada 10
        World 6
Date
New (last 4 weeks) 1
2025 April 2
2025 March 2
2025 (YTD) 4
2024 15
See more
IPC Class
G06F 16/951 - IndexingWeb crawling techniques 31
H04L 9/40 - Network security protocols 24
H04L 67/56 - Provisioning of proxy services 23
H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP] 17
H04L 29/08 - Transmission control procedure, e.g. data link level control procedure 16
See more
Status
Pending 28
Registered / In Force 80
Found results for  patents
  1     2        Next Page

1.

Generating a path to a document element using machine learning

      
Application Number 18830361
Grant Number 12287837
Status In Force
Filing Date 2024-09-10
First Publication Date 2025-04-29
Grant Date 2025-04-29
Owner Oxylabs, UAB (Lithuania)
Inventor
  • Kluonaitis, Karolis
  • Juravicius, Martynas
  • Kuksta, Andrius

Abstract

Disclosed herein are system, method, and computer program product embodiments for improving web scraping technology by using machine learning to generate parsing expressions. A system receives a request to identify an element in a first document at a target web page. The system downloads and modifies the first document by adding an index value as an attribute to a tag for the element. A query is submitted to a large language model (LLM), including the modified first document, a description of the element, and a request asking the LLM to identify the element based on the description. The system obtains, from the LLM, the index value assigned to the element. The system generates an expression defining a path to the element in the first document using the index returned by the large language model. The system downloads a second document, and parses data of a second element using the expression.

IPC Classes  ?

2.

Utilizing Database Management and Throttling During Proxy Web Scraping

      
Application Number 18933751
Status Pending
Filing Date 2024-10-31
First Publication Date 2025-04-17
Owner Oxylabs, UAB (Lithuania)
Inventor
  • Vilcinskas, Eivydas
  • Petruškevicius, Arnas
  • Stalioraitis, Giedrius
  • Juravicius, Martynas
  • Stankevicius, Rimantas

Abstract

Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.

IPC Classes  ?

3.

SELECTING MANAGED PROXY IP DEVICES BY GEOGRAPHICAL COORDINATES THEREOF

      
Application Number 18371349
Status Pending
Filing Date 2023-09-21
First Publication Date 2025-03-27
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Stundzia, Paulius
  • Kozlovski, Miroslav

Abstract

This invention discloses selecting a proxy IP device (Proxy-ICDAPIA) by its geographical coordinates and distance from the target web service. A method and system are disclosed, allowing users of the proxy service to specify their preferred Proxy-ICDAPIAs by geographic coordinates in the user's request. Embodiments describe the Proxy-ICDAPIA selection using Geohash areas and strings, also, selecting Proxy-ICDAPIAs from circular geographical areas specified by center coordinates X, Y and radius R. The solution serves for selecting Proxy-ICDAPIAs efficiently in cities and metropolitan areas where geographic density of web services is high, and selecting Proxy-ICDAPIAs by country/city may be not sufficient. Another problem solved when a client uses Proxy-ICDAPIAs selection by coordinates, is an inability to provide a country code. The aforementioned functionality does work without a specified country code, some countries are not included in the pool used when no parameters are provided. This solution allows creating certain length GeoHash pools, then encoding the client's provided coordinates into its own GeoHash to determine the pool (or pools) containing Proxy-ICDAPIAs that match the specified coordinates requirements.

IPC Classes  ?

  • H04L 67/56 - Provisioning of proxy services
  • H04L 43/10 - Active monitoring, e.g. heartbeat, ping or trace-route
  • H04L 67/141 - Setup of application sessions

4.

ADAPTIVE DATA COLLECTION OPTIMIZATION

      
Application Number 18953591
Status Pending
Filing Date 2024-11-20
First Publication Date 2025-03-06
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Juravicius, Martynas
  • Bulba, Erikas
  • Briliauskas, Mantas

Abstract

Systems and methods to intelligently optimize data collection requests are disclosed. In one embodiment, systems are configured to identify and select a complete set of suitable parameters to execute the data collection requests. In another embodiment, systems are configured to identify and select a partial set of suitable parameters to execute the data collection requests. The present embodiments can implement machine learning algorithms to identify and select the suitable parameters according to the nature of the data collection requests and the targets. Moreover, the embodiments provide systems and methods to generate feedback data based upon the effectiveness of the data collection parameters. Furthermore, the embodiments provide systems and methods to score the set of suitable parameters based on the feedback data and the overall cost, which are then stored in an internal database.

IPC Classes  ?

  • G06F 16/955 - Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
  • G06F 11/34 - Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation
  • G06F 16/951 - IndexingWeb crawling techniques

5.

STREAMING PROXY SERVICE

      
Application Number 18807554
Status Pending
Filing Date 2024-08-16
First Publication Date 2024-12-05
Owner Oxylabs, UAB (Lithuania)
Inventor Stalioraitis, Giedrius

Abstract

Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.

IPC Classes  ?

  • H04N 21/222 - Secondary servers, e.g. proxy server or cable television Head-end
  • H04L 9/40 - Network security protocols
  • H04L 61/4511 - Network directoriesName-to-address mapping using standardised directoriesNetwork directoriesName-to-address mapping using standardised directory access protocols using domain name system [DNS]
  • H04L 69/22 - Parsing or analysis of headers

6.

OPTIMIZING SCRAPING REQUESTS THROUGH BROWSING PROFILES

      
Application Number 18778065
Status Pending
Filing Date 2024-07-19
First Publication Date 2024-11-07
Owner OXYLABS, UAB (Lithuania)
Inventor Juravicius, Martynas

Abstract

Systems and methods of task implementation are extended as provided herein and target the web crawling process through a step of submitting a request by a customer to a web crawler. The systems and methods allow a request for a web crawler to be enriched with a customized browsing profile in order to be categorized as an organic human user to obtain targeted content. In one aspect, a method for data extraction and gathering from a Network by a Service provider infrastructure include at least some of the following exemplary steps: receiving and examining the parameters of a request received from a User's Device, enriching the request parameters with a pre-established browsing profile, sending the enriched request to a Target through the selected Proxy, receiving a response from the Target, dissecting the response's metadata that is appropriate for updating the browsing profile utilized for the request, and forwarding the data to the User's device pursuant to the examination of the response obtained from the Target system.

IPC Classes  ?

  • G06F 16/9535 - Search customisation based on user profiles and personalisation
  • G06F 16/951 - IndexingWeb crawling techniques

7.

ACCESS CONTROL POLICY FOR PROXY SERVICES

      
Application Number 18758159
Status Pending
Filing Date 2024-06-28
First Publication Date 2024-10-24
Owner Oxylabs UAB (Lithuania)
Inventor
  • Kozlovski, Miroslav
  • Ciuta, Gytis
  • Karasevic, Valdemar

Abstract

Systems and methods to manage and efficiently perform authorization of multiple proxy clients are disclosed. Furthermore, systems and methods to measure and check whether the web traffic of one or more client devices has reached a permissible limit of web traffic assigned by the proxy service provider. Specifically, a proxy is configured to gather and save authorization information of one or more clients within its memory. Therefore, the proxy server can verify and authorize one or more clients by utilizing the data from its memory. Furthermore, the proxy is configured to measure and report the utilized web traffic of one or more client devices to a messaging platform. In another aspect, systems and methods to check whether one or more client devices have reached a permissible amount of web traffic limit are disclosed.

IPC Classes  ?

8.

METHODS AND SYSTEMS FOR IMPLEMENTING A REGIONALLY CONTIGUOUS PROXY SERVICE

      
Application Number 18759298
Status Pending
Filing Date 2024-06-28
First Publication Date 2024-10-24
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Dzekunskas, Karolis

Abstract

System and method for efficiently implementing scalable, highly efficient decentralized proxy services through proxy infrastructures situated in different geo-locations. In one aspect, the systems and methods enable users from any geographical location to send requests to the geographically closest proxy infrastructure. One exemplary method described allows proxy infrastructures to gather, classify, and store metadata of exit nodes in its internal database. In another aspect, systems and methods described herein enable proxy infrastructures to select metadata of exit nodes from its internal database and forward requests from a user device to respective proxy servers or proxy supernodes to which the selected exit nodes are connected.

IPC Classes  ?

  • H04L 67/561 - Adding application-functional data or data for application control, e.g. adding metadata
  • H04L 43/0852 - Delays
  • H04L 61/4511 - Network directoriesName-to-address mapping using standardised directoriesNetwork directoriesName-to-address mapping using standardised directory access protocols using domain name system [DNS]
  • H04L 67/01 - Protocols
  • H04L 67/306 - User profiles
  • H04L 67/52 - Network services specially adapted for the location of the user terminal

9.

NETWORK COORDINATION BETWEEN PROXY SERVERS

      
Application Number 18617565
Status Pending
Filing Date 2024-03-26
First Publication Date 2024-07-18
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Kozlovski, Miroslav

Abstract

Systems and methods for coordinating network connectivity and communication between proxy servers, exit-nodes and client modules are disclosed. In one aspect, proxy-nodes in a proxy infrastructure accept connections with exit-nodes based on geographical proximity or proxy-node metrics. Further, a proxy-node can communicate and instruct another proxy-node to service the client request via a suitable exit-node. Further still, a proxy-node can communicate and instruct proxy-node to redirect a suitable exit-node towards the first proxy-node in order to service the client request. In another aspect, the proxy-infrastructure enables client modules to connect to proxy-nodes based on geographical proximity, client parameters, and client's behavioral informatics. In yet another aspect, proxy infrastructure enables a proxy node to redirect exit-nodes to a different proxy-node in the event of a) system overload or resource exhaustion, b) graceful shutdown c) erroneous network connection between exit-nodes and the proxy-node.

IPC Classes  ?

  • H04L 67/56 - Provisioning of proxy services
  • H04L 61/4511 - Network directoriesName-to-address mapping using standardised directoriesNetwork directoriesName-to-address mapping using standardised directory access protocols using domain name system [DNS]
  • H04L 61/5007 - Internet protocol [IP] addresses
  • H04L 101/618 - Details of network addresses

10.

ADVANCED DATA COLLECTION BLOCK IDENTIFICATION

      
Application Number 18619453
Status Pending
Filing Date 2024-03-28
First Publication Date 2024-07-18
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Juravicius, Martynas
  • Kuksta, Andrius

Abstract

Systems and methods that allow examination of response data collected from content providers and provide for classification and routing according to the classification. The process of classification employs an unsupervised, or partially unsupervised, Machine Learning classifier model for identifying data collection responses that contains no data, mangled data, or a block, for assigning a classification correspondingly and for feeding the classification decision back to a data collection platform.

IPC Classes  ?

  • G06F 18/214 - Generating training patternsBootstrap methods, e.g. bagging or boosting
  • G06F 18/2411 - Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
  • G06F 18/2415 - Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
  • G06F 18/243 - Classification techniques relating to the number of classes
  • G06N 3/044 - Recurrent networks, e.g. Hopfield networks
  • G06N 5/025 - Extracting rules from data
  • G06N 20/00 - Machine learning
  • G06F 16/951 - IndexingWeb crawling techniques
  • G06F 21/57 - Certifying or maintaining trusted computer platforms, e.g. secure boots or power-downs, version controls, system software checks, secure updates or assessing vulnerabilities
  • H04L 9/40 - Network security protocols
  • H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]

11.

TRAFFIC SERVICE THREADS FOR LARGE POOLS OF NETWORK ADDRESSES

      
Application Number 18431584
Status Pending
Filing Date 2024-02-02
First Publication Date 2024-05-30
Owner Oxylabs, UAB (Lithuania)
Inventor
  • Dagys, Gediminas
  • Vitkus, Paulius

Abstract

Traffic services for network addresses may be provided within threads executing within a main process for managing the traffic services. The threads may share resources within the main process, reducing the computing resources consumed to provide traffic services to large pools of network addresses. According to one embodiment, a method may include executing a main process for managing traffic services; determining, by the main process, a configuration specifying at least one or more destination addresses; instantiating, by the main process, one or more traffic service (TS) threads for the one or more destination addresses; and/or processing, by the one or more traffic service (TS) threads, inbound traffic for the corresponding one or more destination addresses. Other aspects and embodiments for traffic management are also disclosed.

IPC Classes  ?

  • H04L 45/74 - Address processing for routing
  • H04L 65/1045 - Proxies, e.g. for session initiation protocol [SIP]
  • H04L 65/1069 - Session establishment or de-establishment
  • H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
  • H04L 67/562 - Brokering proxy services

12.

Token-based authentication for a proxy web scraping service

      
Application Number 18535911
Grant Number 12169530
Status In Force
Filing Date 2023-12-11
First Publication Date 2024-05-30
Grant Date 2024-12-17
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Vilcinskas, Eivydas
  • Petruškevicius, Arnas
  • Stalioraitis, Giedrius
  • Juravicius, Martynas
  • Stankevicius, Rimantas

Abstract

Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.

IPC Classes  ?

13.

Methods and systems to maintain multiple persistent channels between proxy servers

      
Application Number 18424407
Grant Number 12294628
Status In Force
Filing Date 2024-01-26
First Publication Date 2024-05-23
Grant Date 2025-05-06
Owner Oxylabs, UAB (Lithuania)
Inventor Dagys, Gediminas

Abstract

Proxy servers within a service provider infrastructure are enabled to maintain multiple persistent connections among themselves and to exchange data bi-directionally in an unsolicited manner. Specifically, exit proxy servers are enabled to request their respective proxy supernodes to update the already existing network connection to support WebSocket communication channels. Accordingly, the respective proxy supernodes are enabled to update the network connection with the exit proxy servers to support WebSocket communication channels. A single instance of a proxy supernode and an exit proxy server can maintain multiple WebSocket communication channels with each other. By utilizing the said WebSocket communication channels, the proxy supernode and the exit proxy servers can exchange data with each other simultaneously without any data losses. Thus, by exchanging data via the said WebSocket communication channels, the proxy supernodes and the exit proxy servers are aimed at servicing the proxy clients in processing their data requests.

IPC Classes  ?

  • H04L 67/141 - Setup of application sessions
  • H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
  • H04L 67/56 - Provisioning of proxy services

14.

DATA EXTRACTION CYCLES WITH MULTIPLE PARSING FRAMES

      
Application Number 17954008
Status Pending
Filing Date 2022-09-27
First Publication Date 2024-03-28
Owner Oxylabs, UAB (Lithuania)
Inventor Malinauskas, Tadas

Abstract

A parsing facility within a service provider infrastructure can navigate through source documents of target web pages and mine a specific list of target data by utilizing multiple parsing frames received from an external computing resource and/or system. The parsing facility receives a series of a plurality of parsing frames at random intermittent intervals. The parsing facility can store each of the plurality of parsing frames within its internal storage and learns the differences between each of the plurality of parsing frames. After learning the differences, the parsing facility can recognize appropriate parsing frames to locate and mine each target data from the source documents. The parsing facility can mine data from source documents by using each of the plurality of parsing frames for every mining cycle, thereby effectively managing the reception and usage of multiple parsing frames without any errors or faults.

IPC Classes  ?

  • G06F 16/2458 - Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries

15.

USING A GRAPH OF REDIRECTS TO IDENTIFY MULTIPLE ADDRESSES REPRESENTING A COMMON WEB PAGE

      
Application Number 17950962
Status Pending
Filing Date 2022-09-22
First Publication Date 2024-03-28
Owner Oxylabs UAB (Lithuania)
Inventor Barzdzius, Tadas

Abstract

Embodiments relate to scraping web content. When scraping data, the target website sometimes redirects to different URLs within its domain. The different URLs represent the same context. Embodiments use a graph ontology to identify which redirected URLs represent the same page.

IPC Classes  ?

  • G06F 16/951 - IndexingWeb crawling techniques
  • G06F 16/955 - Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
  • G06F 16/958 - Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

16.

TRANSMITTING REQUEST AND RESPONSE INFORMATION THROUGH DIFFERENT PROXIES

      
Application Number 18139822
Status Pending
Filing Date 2023-04-26
First Publication Date 2024-03-07
Owner Oxylabs, UAB (Lithuania)
Inventor
  • Kozlovski, Miroslav
  • Petruskevicius, Arnas
  • Pilkauskas, Valdas

Abstract

Systems and methods to manage and efficiently implement functional proxy services are disclosed. In the proxy services, a single instance of exit-node is connected to at least two or multiple supernodes at any given time. One of the plurality of supernodes is configured to ping and send diagnostic requests to the connected exit-node through a network. The exit-node is directed to send the pong message and diagnostic response data to a different supernode from among the plurality of supernodes connected to the exit-node. Likewise, a client's request is received by an element of the proxy service provider and forwarded to a specific supernode capable of forwarding the client's request to the exit-node. After performing the client's request, the exit-node returns response data to a different supernode from among the plurality of supernodes connected to the exit-node.

IPC Classes  ?

  • H04L 41/12 - Discovery or management of network topologies

17.

MANAGED EXIT NODES AND THIRD PARTY PROXY PROVIDERS IN A PROXY INFRASTRUCTURE

      
Application Number 18377726
Status Pending
Filing Date 2023-10-06
First Publication Date 2024-02-08
Owner Oxylabs, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Kozlovski, Miroslav
  • Ciuta, Gytis

Abstract

Systems and methods herein provide for a proxy infrastructure. In the proxy infrastructure, a network element (e.g., a supernode) is connected with a plurality of exit nodes. At one of a plurality of messenger units of the proxy infrastructure, a proxy protocol request is received directly from a client computing device. The proxy protocol request specifies a request and a target. In response the proxy protocol request, a selection is made between one between one of the plurality of exit nodes. A message with the request is sent from the messenger to the supernode connected with the selected exit node. Finally, the message is sent from the supernode to the selected exit node to forward the request to the target.

IPC Classes  ?

  • H04L 67/56 - Provisioning of proxy services
  • H04L 41/12 - Discovery or management of network topologies
  • H04L 67/141 - Setup of application sessions
  • H04L 43/10 - Active monitoring, e.g. heartbeat, ping or trace-route
  • H04L 67/1004 - Server selection for load balancing
  • H04L 67/01 - Protocols
  • H04L 61/4511 - Network directoriesName-to-address mapping using standardised directoriesNetwork directoriesName-to-address mapping using standardised directory access protocols using domain name system [DNS]
  • H04L 67/561 - Adding application-functional data or data for application control, e.g. adding metadata
  • H04L 9/40 - Network security protocols
  • H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
  • H04L 67/146 - Markers for unambiguous identification of a particular session, e.g. session cookie or URL-encoding
  • H04L 12/46 - Interconnection of networks
  • H04L 12/66 - Arrangements for connecting between networks having differing types of switching systems, e.g. gateways

18.

Adaptive data collection optimization

      
Application Number 18477095
Grant Number 12182219
Status In Force
Filing Date 2023-09-28
First Publication Date 2024-01-25
Grant Date 2024-12-31
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Juravicius, Martynas
  • Bulba, Erikas
  • Briliauskas, Mantas

Abstract

Systems and methods to intelligently optimize data collection requests are disclosed. In one embodiment, systems are configured to identify and select a complete set of suitable parameters to execute the data collection requests. In another embodiment, systems are configured to identify and select a partial set of suitable parameters to execute the data collection requests. The present embodiments can implement machine learning algorithms to identify and select the suitable parameters according to the nature of the data collection requests and the targets. Moreover, the embodiments provide systems and methods to generate feedback data based upon the effectiveness of the data collection parameters. Furthermore, the embodiments provide systems and methods to score the set of suitable parameters based on the feedback data and the overall cost, which are then stored in an internal database.

IPC Classes  ?

  • G06F 16/955 - Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
  • G06F 11/34 - Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation
  • G06F 16/951 - IndexingWeb crawling techniques

19.

Methods and systems to maintain multiple persistent channels between proxy servers

      
Application Number 18132872
Grant Number 11936742
Status In Force
Filing Date 2023-04-10
First Publication Date 2024-01-18
Grant Date 2024-03-19
Owner OXYLABS, UAB (Lithuania)
Inventor Dagys, Gediminas

Abstract

Proxy servers within a service provider infrastructure are enabled to maintain multiple persistent connections among themselves and to exchange data bi-directionally in an unsolicited manner. Specifically, exit proxy servers are enabled to request their respective proxy supernodes to update the already existing network connection to support Web Socket communication channels. Accordingly, the respective proxy supernodes are enabled to update the network connection with the exit proxy servers to support WebSocket communication channels. A single instance of a proxy supernode and an exit proxy server can maintain multiple Web Socket communication channels with each other. By utilizing the said Web Socket communication channels, the proxy supernode and the exit proxy servers can exchange data with each other simultaneously without any data losses. Thus, by exchanging data via the said Web Socket communication channels, the proxy supernodes and the exit proxy servers are aimed at servicing the proxy clients in processing their data requests.

IPC Classes  ?

  • H04L 67/141 - Setup of application sessions
  • H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
  • H04L 67/56 - Provisioning of proxy services

20.

Network coordination between proxy servers

      
Application Number 17845240
Grant Number 12003600
Status In Force
Filing Date 2022-06-21
First Publication Date 2023-12-21
Grant Date 2024-06-04
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Kozlovski, Miroslav

Abstract

Systems and methods for coordinating network connectivity and communication between proxy servers, exit-nodes and client modules are disclosed. In one aspect, the proxy infrastructure enables network connectivity between exit-nodes and proxy-nodes without the need of any proxy-gateways or middleware entities to delegate the connections. Proxy-nodes in the proxy infrastructure accept connections with exit-nodes based on geographical proximity, proxy-node metrics, such as server loads and clients' frequent preferences. Further, a single instance of proxy-node can communicate and instruct another instance of proxy-node, to service the client request via a suitable exit-node. Further still, a single instance of proxy-node can communicate and instruct another instance of proxy-node to redirect a suitable exit-node towards the first proxy-node in order to service the client request. In another instance, the proxy-infrastructure enables client modules to connect to proxy-nodes based on geographical proximity, client parameters, and client's behavioral informatics. In yet another aspect, proxy infrastructure enables a proxy node to redirect exit-nodes to a different proxy—node in the event of a) system overload or resource exhaustion, b) graceful shutdown c) erroneous network connection between exit-nodes and the proxy-node.

IPC Classes  ?

  • G06F 15/16 - Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
  • H04L 61/4511 - Network directoriesName-to-address mapping using standardised directoriesNetwork directoriesName-to-address mapping using standardised directory access protocols using domain name system [DNS]
  • H04L 61/5007 - Internet protocol [IP] addresses
  • H04L 67/56 - Provisioning of proxy services
  • H04L 101/618 - Details of network addresses

21.

Optimizing scraping requests through browsing profiles

      
Application Number 18449073
Grant Number 12072946
Status In Force
Filing Date 2023-08-14
First Publication Date 2023-12-07
Grant Date 2024-08-27
Owner OXYLABS, UAB (Lithuania)
Inventor Juravicius, Martynas

Abstract

Systems and methods of task implementation are extended as provided herein and target the web crawling process through a step of submitting a request by a customer to a web crawler. The systems and methods allow a request for a web crawler to be enriched with a customized browsing profile in order to be categorized as an organic human user to obtain targeted content. In one aspect, a method for data extraction and gathering from a Network by a Service provider infrastructure include at least some of the following exemplary steps: receiving and examining the parameters of a request received from a User's Device, enriching the request parameters with a pre-established browsing profile, sending the enriched request to a Target through the selected Proxy, receiving a response from the Target, dissecting the response's metadata that is appropriate for updating the browsing profile utilized for the request, and forwarding the data to the User's device pursuant to the examination of the response obtained from the Target system.

IPC Classes  ?

  • G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
  • G06F 16/951 - IndexingWeb crawling techniques
  • G06F 16/9535 - Search customisation based on user profiles and personalisation

22.

Methods and systems for implementing a regionally contiguous proxy service

      
Application Number 18448635
Grant Number 12058224
Status In Force
Filing Date 2023-08-11
First Publication Date 2023-11-30
Grant Date 2024-08-06
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Dzekunskas, Karolis

Abstract

System and method for efficiently implementing scalable, highly efficient decentralized proxy services through proxy infrastructures situated in different geo-locations. In one aspect, the systems and methods enable users from any geographical location to send requests to the geographically closest proxy infrastructure. One exemplary method described allows proxy infrastructures to gather, classify, and store metadata of exit nodes in its internal database. In another aspect, systems and methods described herein enable proxy infrastructures to select metadata of exit nodes from its internal database and forward requests from a user device to respective proxy servers or proxy supernodes to which the selected exit nodes are connected.

IPC Classes  ?

  • G06F 15/173 - Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star or snowflake
  • H04L 43/0852 - Delays
  • H04L 61/4511 - Network directoriesName-to-address mapping using standardised directoriesNetwork directoriesName-to-address mapping using standardised directory access protocols using domain name system [DNS]
  • H04L 67/01 - Protocols
  • H04L 67/306 - User profiles
  • H04L 67/52 - Network services specially adapted for the location of the user terminal
  • H04L 67/561 - Adding application-functional data or data for application control, e.g. adding metadata

23.

Streaming proxy service

      
Application Number 17659218
Grant Number 12149755
Status In Force
Filing Date 2022-04-14
First Publication Date 2023-10-19
Grant Date 2024-11-19
Owner Oxylabs, UAB (Lithuania)
Inventor Stalioraitis, Giedrius

Abstract

Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.

IPC Classes  ?

  • H04N 21/222 - Secondary servers, e.g. proxy server or cable television Head-end
  • H04L 9/40 - Network security protocols
  • H04L 61/4511 - Network directoriesName-to-address mapping using standardised directoriesNetwork directoriesName-to-address mapping using standardised directory access protocols using domain name system [DNS]
  • H04L 69/22 - Parsing or analysis of headers

24.

TRANSMITTING REQUEST AND RESPONSE INFORMATION THROUGH DIFFERENT PROXIES

      
Application Number EP2023052926
Publication Number 2023/186381
Status In Force
Filing Date 2023-02-07
Publication Date 2023-10-05
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Kozlovski, Miroslav
  • Pilkauskas, Valdas
  • Petruskevicius, Arnas

Abstract

Systems and methods to manage and efficiently implement functional proxy services are disclosed. In the proxy services, a single instance of exit-node is connected to at least two or multiple supernodes at any given time. One of the plurality of supernodes is configured to ping and send diagnostic requests to the connected exit-node through a network. The exit-node is directed to send the pong message and diagnostic response data to a different supernode from among the plurality of supernodes connected to the exit-node. Likewise, a client's request is received by an element of the proxy service provider and forwarded to a specific supernode capable of forwarding the client's request to the exit-node. After performing the client's request, the exit-node returns response data to a different supernode from among the plurality of supernodes connected to the exit- node.

IPC Classes  ?

  • H04L 67/1008 - Server selection for load balancing based on parameters of servers, e.g. available memory or workload
  • H04L 43/08 - Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
  • H04L 43/50 - Testing arrangements
  • H04L 67/288 - Distributed intermediate devices, i.e. intermediate devices for interaction with other intermediate devices on the same level
  • H04L 67/60 - Scheduling or organising the servicing of application requests, e.g. requests for application data transmissions using the analysis and optimisation of the required network resources
  • H04L 67/1029 - Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers using data related to the state of servers by a load balancer

25.

TRANSMITTING REQUEST AND RESPONSE INFORMATION THROUGH DIFFERENT PROXIES

      
Document Number 03238249
Status Pending
Filing Date 2023-02-07
Open to Public Date 2023-10-05
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Kozlovski, Miroslav
  • Pilkauskas, Valdas
  • Petruskevicius, Arnas

Abstract

Systems and methods to manage and efficiently implement functional proxy services are disclosed. In the proxy services, a single instance of exit-node is connected to at least two or multiple supernodes at any given time. One of the plurality of supernodes is configured to ping and send diagnostic requests to the connected exit-node through a network. The exit-node is directed to send the pong message and diagnostic response data to a different supernode from among the plurality of supernodes connected to the exit-node. Likewise, a client's request is received by an element of the proxy service provider and forwarded to a specific supernode capable of forwarding the client's request to the exit-node. After performing the client's request, the exit-node returns response data to a different supernode from among the plurality of supernodes connected to the exit- node.

IPC Classes  ?

  • H04L 43/08 - Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
  • H04L 43/50 - Testing arrangements
  • H04L 67/1008 - Server selection for load balancing based on parameters of servers, e.g. available memory or workload
  • H04L 67/1029 - Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers using data related to the state of servers by a load balancer
  • H04L 67/288 - Distributed intermediate devices, i.e. intermediate devices for interaction with other intermediate devices on the same level
  • H04L 67/60 - Scheduling or organising the servicing of application requests, e.g. requests for application data transmissions using the analysis and optimisation of the required network resources

26.

INTERNET PROXY SYSTEM

      
Document Number 03238328
Status Pending
Filing Date 2023-01-30
Open to Public Date 2023-08-17
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Kozlovski, Miroslav
  • Ciuta, Gytis

Abstract

Systems and methods herein provide for a proxy infrastructure. In the proxy infrastructure, a network element (e.g., a supernode) is connected with a plurality of exit nodes. At one of a plurality of messenger units of the proxy infrastructure, a proxy protocol request is received directly from a client computing device. The proxy protocol request specifies a request and a target. In response the proxy protocol request, a selection is made between one between one of the plurality of exit nodes. A message with the request is sent from the messenger to the supernode connected with the selected exit node. Finally, the message is sent from the supernode to the selected exit node to forward the request to the target.

IPC Classes  ?

  • H04L 41/0654 - Management of faults, events, alarms or notifications using network fault recovery
  • H04L 67/56 - Provisioning of proxy services

27.

Advanced data collection block identification

      
Application Number 18305438
Grant Number 12086209
Status In Force
Filing Date 2023-04-24
First Publication Date 2023-08-17
Grant Date 2024-09-10
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Juravicius, Martynas
  • Kuksta, Andrius

Abstract

Systems and methods that allow examination of response data collected from content providers and provide for classification and routing according to the classification. The process of classification employs an unsupervised, or partially unsupervised, Machine Learning classifier model for identifying data collection responses that contains no data, mangled data, or a block, for assigning a classification correspondingly and for feeding the classification decision back to a data collection platform.

IPC Classes  ?

  • G06F 18/214 - Generating training patternsBootstrap methods, e.g. bagging or boosting
  • G06F 16/951 - IndexingWeb crawling techniques
  • G06F 18/2411 - Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
  • G06F 18/2415 - Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
  • G06F 18/243 - Classification techniques relating to the number of classes
  • G06F 21/57 - Certifying or maintaining trusted computer platforms, e.g. secure boots or power-downs, version controls, system software checks, secure updates or assessing vulnerabilities
  • G06N 3/044 - Recurrent networks, e.g. Hopfield networks
  • G06N 5/025 - Extracting rules from data
  • G06N 20/00 - Machine learning
  • H04L 9/40 - Network security protocols
  • H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]

28.

INTERNET PROXY SYSTEM

      
Application Number EP2023052179
Publication Number 2023/151976
Status In Force
Filing Date 2023-01-30
Publication Date 2023-08-17
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Kozlovski, Miroslav
  • Ciuta, Gytis

IPC Classes  ?

  • H04L 67/56 - Provisioning of proxy services
  • H04L 41/0654 - Management of faults, events, alarms or notifications using network fault recovery

29.

Graceful shutdown of supernodes in an internet proxy system

      
Application Number 17958055
Grant Number 11936753
Status In Force
Filing Date 2022-09-30
First Publication Date 2023-08-10
Grant Date 2024-03-19
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Kozlovski, Miroslav
  • Ciuta, Gytis

Abstract

Systems and methods herein provide for a proxy infrastructure. In the proxy infrastructure, a network element (e.g., a supernode) is connected with a plurality of exit nodes. At one of a plurality of messenger units of the proxy infrastructure, a proxy protocol request is received directly from a client computing device. The proxy protocol request specifies a request and a target. In response the proxy protocol request, a selection is made between one between one of the plurality of exit nodes. A message with the request is sent from the messenger to the supernode connected with the selected exit node. Finally, the message is sent from the supernode to the selected exit node to forward the request to the target.

IPC Classes  ?

  • H04L 67/56 - Provisioning of proxy services
  • H04L 9/40 - Network security protocols
  • H04L 12/46 - Interconnection of networks
  • H04L 12/66 - Arrangements for connecting between networks having differing types of switching systems, e.g. gateways
  • H04L 41/12 - Discovery or management of network topologies
  • H04L 43/10 - Active monitoring, e.g. heartbeat, ping or trace-route
  • H04L 61/4511 - Network directoriesName-to-address mapping using standardised directoriesNetwork directoriesName-to-address mapping using standardised directory access protocols using domain name system [DNS]
  • H04L 67/01 - Protocols
  • H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
  • H04L 67/1004 - Server selection for load balancing
  • H04L 67/141 - Setup of application sessions
  • H04L 67/146 - Markers for unambiguous identification of a particular session, e.g. session cookie or URL-encoding
  • H04L 67/561 - Adding application-functional data or data for application control, e.g. adding metadata

30.

AUTOMATIZED PARSING TEMPLATE CUSTOMIZER

      
Application Number EP2022081884
Publication Number 2023/131434
Status In Force
Filing Date 2022-11-15
Publication Date 2023-07-13
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Kuksta, Andrius
  • Juravicius, Martynas

Abstract

Systems and methods to intelligently adapt parsing rules according to the layout changes occurring in multiple targets are disclosed. Specifically, the disclosure provides a solution to detect the layout changes in a target domain and to update parsing templates or parsing rules. The disclosed embodiments in one aspect describe methods and systems to receive and store parsing templates or parsing rules and monitoring tables or a list of related URLs within an internal storage facility. Methods and systems to scrape and parse data by following parsing rules or using parsing templates. The methods and systems describe the manner in which the parsed data and the actual data are analyzed to detect any changes in the layout of the target domain(s). The methods and systems give details on how to decide whether to update parsing rules or parsing templates depending on the layout changes in the target domains.

IPC Classes  ?

31.

Adaptive data collection optimization

      
Application Number 18181742
Grant Number 11809509
Status In Force
Filing Date 2023-03-10
First Publication Date 2023-07-06
Grant Date 2023-11-07
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Juravicius, Martynas
  • Bulba, Erikas
  • Briliauskas, Mantas

Abstract

Systems and methods to intelligently optimize data collection requests are disclosed. In one embodiment, systems are configured to identify and select a complete set of suitable parameters to execute the data collection requests. In another embodiment, systems are configured to identify and select a partial set of suitable parameters to execute the data collection requests. The present embodiments can implement machine learning algorithms to identify and select the suitable parameters according to the nature of the data collection requests and the targets. Moreover, the embodiments provide systems and methods to generate feedback data based upon the effectiveness of the data collection parameters. Furthermore, the embodiments provide systems and methods to score the set of suitable parameters based on the feedback data and the overall cost, which are then stored in an internal database.

IPC Classes  ?

  • G06F 16/955 - Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
  • G06F 16/951 - IndexingWeb crawling techniques
  • G06F 11/34 - Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation

32.

AUTOMATIZED PARSING TEMPLATE CUSTOMIZER

      
Application Number 17570181
Status Pending
Filing Date 2022-01-06
First Publication Date 2023-07-06
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Kuksta, Andrius
  • Juravicius, Martynas

Abstract

Systems and methods to intelligently adapt parsing rules according to the layout changes occurring in multiple targets are disclosed. Specifically, the disclosure provides a solution to detect the layout changes in a target domain and to update parsing templates or parsing rules. The disclosed embodiments in one aspect describe methods and systems to receive and store parsing templates or parsing rules and monitoring tables or a list of related URLs within an internal storage facility. Methods and systems to scrape and parse data by following parsing rules or using parsing templates. The methods and systems describe the manner in which the parsed data and the actual data are analyzed to detect any changes in the layout of the target domain(s). The methods and systems give details on how to decide whether to update parsing rules or parsing templates depending on the layout changes in the target domains.

IPC Classes  ?

  • G06F 40/221 - Parsing markup language streams
  • G06F 16/958 - Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
  • G06F 40/186 - Templates
  • G06F 40/14 - Tree-structured documents

33.

Access control policy for proxy services

      
Application Number 17875813
Grant Number 12063216
Status In Force
Filing Date 2022-07-28
First Publication Date 2023-06-29
Grant Date 2024-08-13
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Kozlovski, Miroslav
  • Ciuta, Gytis
  • Karasevic, Valdemar

Abstract

Systems and methods to manage and efficiently perform authorization of multiple proxy clients are disclosed. Furthermore, systems and methods to measure and check whether the web traffic of one or more client devices has reached a permissible limit of web traffic assigned by the proxy service provider. Specifically, a proxy is configured to gather and save authorization information of one or more clients within its memory. Therefore, the proxy server can verify and authorize one or more clients by utilizing the data from its memory. Furthermore, the proxy is configured to measure and report the utilized web traffic of one or more client devices to a messaging platform. In another aspect, systems and methods to check whether one or more client devices have reached a permissible amount of web traffic limit are disclosed.

IPC Classes  ?

34.

Proxy selection by monitoring quality and available capacity

      
Application Number 18165027
Grant Number 11817946
Status In Force
Filing Date 2023-02-06
First Publication Date 2023-06-22
Grant Date 2023-11-14
Owner OXYLABS, UAB (Lithuania)
Inventor Suckel, Vitalij

Abstract

Empirical data of exit nodes are continuously monitored and each exit node's overall performance and available capacity are calculated. The empirical data can include monitoring the number of concurrent requests currently being executed by each exit node and the disconnection chronology of each exit node. Further, each exit node is tested by benchmark requests and ping messages and each exit node's quality rate is calculated. Additionally, systems and methods are provided to select an exit node with the highest quality and available capacity value, from a particular pool to route the user request.

IPC Classes  ?

  • G06F 15/173 - Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star or snowflake
  • H04L 67/2866 - ArchitecturesArrangements
  • H04L 43/10 - Active monitoring, e.g. heartbeat, ping or trace-route
  • H04L 9/40 - Network security protocols

35.

TRAFFIC SERVICE THREADS FOR LARGE POOLS OF NETWORK ADDRESSES

      
Application Number IB2023051319
Publication Number 2023/112012
Status In Force
Filing Date 2023-02-14
Publication Date 2023-06-22
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Dagys, Gediminas
  • Paulius, Vitkus

Abstract

Traffic services for network addresses may be provided within threads executing within a main process for managing the traffic services. The threads may share resources within the main process, reducing the computing resources consumed to provide traffic services to large pools of network addresses. According to one embodiment, a method may include executing a main process for managing traffic services; determining, by the main process, a configuration specifying at least one or more destination addresses; instantiating, by the main process, one or more traffic service (TS) threads for the one or more destination addresses; and/or processing, by the one or more traffic service (TS) threads, inbound traffic for the corresponding one or more destination addresses. Other aspects and embodiments for traffic management are also disclosed.

IPC Classes  ?

36.

Optimizing scraping requests through browsing profiles

      
Application Number 18169080
Grant Number 11768898
Status In Force
Filing Date 2023-02-14
First Publication Date 2023-06-22
Grant Date 2023-09-26
Owner OXYLABS, UAB (Lithuania)
Inventor Juravicius, Martynas

Abstract

Systems and methods of task implementation are extended as provided herein and target the web crawling process through a step of submitting a request by a customer to a web crawler. The systems and methods allow a request for a web crawler to be enriched with a customized browsing profile in order to be categorized as an organic human user to obtain targeted content. In one aspect, a method for data extraction and gathering from a Network by a Service provider infrastructure include at least some of the following exemplary steps: receiving and examining the parameters of a request received from a User's Device, enriching the request parameters with a pre-established browsing profile, sending the enriched request to a Target through the selected Proxy, receiving a response from the Target, dissecting the response's metadata that is appropriate for updating the browsing profile utilized for the request, and forwarding the data to the User's device pursuant to the examination of the response obtained from the Target system.

IPC Classes  ?

  • G06F 16/00 - Information retrievalDatabase structures thereforFile system structures therefor
  • G06F 16/9535 - Search customisation based on user profiles and personalisation
  • G06F 16/951 - IndexingWeb crawling techniques

37.

Traffic service threads for large pools of network addresses

      
Application Number 17931815
Grant Number 11929926
Status In Force
Filing Date 2022-09-13
First Publication Date 2023-06-15
Grant Date 2024-03-12
Owner Oxylabs, UAB (Lithuania)
Inventor
  • Dagys, Gediminas
  • Vitkus, Paulius

Abstract

Traffic services for network addresses may be provided within threads executing within a main process for managing the traffic services. The threads may share resources within the main process, reducing the computing resources consumed to provide traffic services to large pools of network addresses. According to one embodiment, a method may include executing a main process for managing traffic services; determining, by the main process, a configuration specifying at least one or more destination addresses; instantiating, by the main process, one or more traffic service (TS) threads for the one or more destination addresses; and/or processing, by the one or more traffic service (TS) threads, inbound traffic for the corresponding one or more destination addresses. Other aspects and embodiments for traffic management are also disclosed.

IPC Classes  ?

  • H04L 45/74 - Address processing for routing
  • H04L 65/1045 - Proxies, e.g. for session initiation protocol [SIP]
  • H04L 65/1069 - Session establishment or de-establishment
  • H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
  • H04L 67/562 - Brokering proxy services

38.

Curating proxy server pools

      
Application Number 18165444
Grant Number 11831726
Status In Force
Filing Date 2023-02-07
First Publication Date 2023-06-15
Grant Date 2023-11-28
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Vasiliauskas, Justas
  • Gervinskas, Justas

Abstract

A system and method of forming proxy server pools is provided. The method comprises several steps, such as requesting a pool to execute the user's request and retrieving an initial group. The system checks the service history of an initial group, including whether any of the proxy servers in an initial group are exclusive to existing pools. The exclusive proxy servers in an initial group with eligible proxy servers are replaced when needed and new proxy server pools are formed. The system also records the service history of proxy servers and pools before and after the pools are created. The method can also involve predicting the pool health in relation with the thresholds foreseen and replacing the proxy servers below the threshold.

IPC Classes  ?

  • H04L 67/2866 - ArchitecturesArrangements
  • H04L 41/5019 - Ensuring fulfilment of SLA
  • H04L 9/40 - Network security protocols
  • H04L 67/562 - Brokering proxy services
  • G06N 5/01 - Dynamic search techniquesHeuristicsDynamic treesBranch-and-bound
  • H04L 67/564 - Enhancement of application control based on intercepted application data
  • H04L 67/561 - Adding application-functional data or data for application control, e.g. adding metadata
  • H04L 43/0817 - Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking functioning

39.

Transmitting request and response information through different proxies

      
Application Number 17707682
Grant Number 11652697
Status In Force
Filing Date 2022-03-29
First Publication Date 2023-05-16
Grant Date 2023-05-16
Owner Oxylabs, UAB (Lithuania)
Inventor
  • Kozlovski, Miroslav
  • Petruskevicius, Arnas
  • Pilkauskas, Valdas

Abstract

Systems and methods to manage and efficiently implement functional proxy services are disclosed. In the proxy services, a single instance of exit-node is connected to at least two or multiple supernodes at any given time. One of the plurality of supernodes is configured to ping and send diagnostic requests to the connected exit-node through a network. The exit-node is directed to send the pong message and diagnostic response data to a different supernode from among the plurality of supernodes connected to the exit-node. Likewise, a client's request is received by an element of the proxy service provider and forwarded to a specific supernode capable of forwarding the client's request to the exit-node. After performing the client's request, the exit-node returns response data to a different supernode from among the plurality of supernodes connected to the exit-node.

IPC Classes  ?

  • G06F 15/16 - Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
  • H04L 41/12 - Discovery or management of network topologies

40.

Methods and systems to maintain multiple persistent channels between proxy servers

      
Application Number 17864266
Grant Number 11652890
Status In Force
Filing Date 2022-07-13
First Publication Date 2023-05-16
Grant Date 2023-05-16
Owner Oxylabs, UAB (Lithuania)
Inventor Dagys, Gediminas

Abstract

Proxy servers within a service provider infrastructure are enabled to maintain multiple persistent connections among themselves and to exchange data bi-directionally in an unsolicited manner. Specifically, exit proxy servers are enabled to request their respective proxy supernodes to update the already existing network connection to support WebSocket communication channels. Accordingly, the respective proxy supernodes are enabled to update the network connection with the exit proxy servers to support WebSocket communication channels. A single instance of a proxy supernode and an exit proxy server can maintain multiple WebSocket communication channels with each other. By utilizing the said WebSocket communication channels, the proxy supernode and the exit proxy servers can exchange data with each other simultaneously without any data losses. Thus, by exchanging data via the said WebSocket communication channels, the proxy supernodes and the exit proxy servers are aimed at servicing the proxy clients in processing their data requests.

IPC Classes  ?

  • H04L 67/141 - Setup of application sessions
  • H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
  • H04L 67/56 - Provisioning of proxy services

41.

REGULATION METHODS FOR PROXY SERVICES

      
Application Number EP2022071840
Publication Number 2023/051979
Status In Force
Filing Date 2022-08-03
Publication Date 2023-04-06
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Stalioraitis, Giedrius
  • Balkauskas, Ovidijus

Abstract

Systems and methods to manage and regulate the requests of multiple proxy clients are disclosed. In one aspect, the system and methods disclosed herein aids in configuring proxy server(s) with a rate-limit functionality. Configuration of the rate-limit functionality may be realized by, but not limited to, installing configuration file(s) and/or software application(s) on the proxy server(s). The configuration provides information about the list of restricted and unrestricted domains and their respective request limit specification in a given time frame. Therefore, each time before a proxy server forwards the clients' requests to a target domain, the proxy server checks and ensures that the request count to the particular target domain is well within the limit specified in the request limit specification. Thus, the embodiments described herein aid in preventing the IP addresses of proxy service providers from being blocked or denied from the target websites.

IPC Classes  ?

  • H04L 47/25 - Flow controlCongestion control with rate being modified by the source upon detecting a change of network conditions

42.

Regulation methods for proxy services

      
Application Number 17936391
Grant Number 11632436
Status In Force
Filing Date 2022-09-29
First Publication Date 2023-03-30
Grant Date 2023-04-18
Owner Oxylabs, UAB (Lithuania)
Inventor
  • Stalioraitis, Giedrius
  • Balkauskas, Ovidijus

Abstract

Systems and methods to manage and regulate the requests of multiple proxy clients are disclosed. In one aspect, the system and methods disclosed herein aids in configuring proxy server(s) with a rate-limit functionality. Configuration of the rate-limit functionality may be realized by, but not limited to, installing configuration file(s) and/or software application(s) on the proxy server(s). The configuration provides information about the list of restricted and unrestricted domains and their respective request limit specification in a given time frame. Therefore, each time before a proxy server forwards the clients' requests to a target domain, the proxy server checks and ensures that the request count to the particular target domain is well within the limit specified in the request limit specification. Thus, the embodiments described herein aid in preventing the IP addresses of proxy service providers from being blocked or denied from the target websites.

IPC Classes  ?

43.

Managed exit nodes and third party proxies

      
Application Number 17669204
Grant Number 11601518
Status In Force
Filing Date 2022-02-10
First Publication Date 2023-03-07
Grant Date 2023-03-07
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Kozlovski, Miroslav
  • Ciuta, Gytis

Abstract

Systems and methods herein provide for a proxy infrastructure. In the proxy infrastructure, a network element (e.g., a supernode) is connected with a plurality of exit nodes. At one of a plurality of messenger units of the proxy infrastructure, a proxy protocol request is received directly from a client computing device. The proxy protocol request specifies a request and a target. In response the proxy protocol request, a selection is made between one between one of the plurality of exit nodes. A message with the request is sent from the messenger to the supernode connected with the selected exit node. Finally, the message is sent from the supernode to the selected exit node to forward the request to the target.

IPC Classes  ?

  • G06F 15/173 - Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star or snowflake
  • H04L 67/56 - Provisioning of proxy services
  • H04L 67/1004 - Server selection for load balancing
  • H04L 43/10 - Active monitoring, e.g. heartbeat, ping or trace-route

44.

ADAPTIVE DATA COLLECTION OPTIMIZATION

      
Document Number 03214792
Status Pending
Filing Date 2022-08-03
Open to Public Date 2023-03-02
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Juravicius, Martynas
  • Bulba, Erikas
  • Briliauskas, Mantas

Abstract

Systems and methods to intelligently optimize data collection requests are disclosed. In one embodiment, systems are configured to identify and select a complete set of suitable parameters to execute the data collection requests. In another embodiment, systems are configured to identify and select a partial set of suitable parameters to execute the data collection requests. The present embodiments can implement machine learning algorithms to identify and select the suitable parameters according to the nature of the data collection requests and the targets. Moreover, the embodiments provide systems and methods to generate feedback data based upon the effectiveness of the data collection parameters. Furthermore, the embodiments provide systems and methods to score the set of suitable parameters based on the feedback data and the overall cost, which are then stored in an internal database.

IPC Classes  ?

45.

Adaptive data collection optimization

      
Application Number 17823660
Grant Number 11636169
Status In Force
Filing Date 2022-08-31
First Publication Date 2023-03-02
Grant Date 2023-04-25
Owner Oxylabs, UAB (Lithuania)
Inventor
  • Juravicius, Martynas
  • Bulba, Erikas
  • Briliauskas, Mantas

Abstract

Systems and methods to intelligently optimize data collection requests are disclosed. In one embodiment, systems are configured to identify and select a complete set of suitable parameters to execute the data collection requests. In another embodiment, systems are configured to identify and select a partial set of suitable parameters to execute the data collection requests. The present embodiments can implement machine learning algorithms to identify and select the suitable parameters according to the nature of the data collection requests and the targets. Moreover, the embodiments provide systems and methods to generate feedback data based upon the effectiveness of the data collection parameters. Furthermore, the embodiments provide systems and methods to score the set of suitable parameters based on the feedback data and the overall cost, which are then stored in an internal database.

IPC Classes  ?

  • G06F 16/955 - Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]

46.

ADAPTIVE DATA COLLECTION OPTIMIZATION

      
Application Number EP2022071835
Publication Number 2023/025552
Status In Force
Filing Date 2022-08-03
Publication Date 2023-03-02
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Juravicius, Martynas
  • Bulba, Erikas
  • Briliauskas, Mantas

Abstract

Systems and methods to intelligently optimize data collection requests are disclosed. In one embodiment, systems are configured to identify and select a complete set of suitable parameters to execute the data collection requests. In another embodiment, systems are configured to identify and select a partial set of suitable parameters to execute the data collection requests. The present embodiments can implement machine learning algorithms to identify and select the suitable parameters according to the nature of the data collection requests and the targets. Moreover, the embodiments provide systems and methods to generate feedback data based upon the effectiveness of the data collection parameters. Furthermore, the embodiments provide systems and methods to score the set of suitable parameters based on the feedback data and the overall cost, which are then stored in an internal database.

IPC Classes  ?

47.

Direct data center request for proxy web scraping

      
Application Number 17958039
Grant Number 11734367
Status In Force
Filing Date 2022-09-30
First Publication Date 2023-02-02
Grant Date 2023-08-22
Owner Oxylabs, UAB (Lithuania)
Inventor
  • Vilcinskas, Eivydas
  • Petruskevicius, Arnas
  • Stalioraitis, Giedrius
  • Juravicius, Martynas
  • Stankevicius, Rimantas

Abstract

Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.

IPC Classes  ?

48.

Token-based authentication for a proxy web scraping service

      
Application Number 17726265
Grant Number 11841910
Status In Force
Filing Date 2022-04-21
First Publication Date 2023-01-19
Grant Date 2023-12-12
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Vilcinskas, Eivydas
  • Petru{hacek Over (s)}kevicius, Arnas
  • Stalioraitis, Giedrius
  • Juravicius, Martynas
  • Stankevicius, Rimantas

Abstract

Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.

IPC Classes  ?

49.

TRAFFIC COUNTING FOR PROXY WEB SCRAPING

      
Application Number 17373312
Status Pending
Filing Date 2021-07-12
First Publication Date 2023-01-19
Owner
  • OXYLABS, UAB (Lithuania)
  • TESO LT, UAB (Lithuania)
Inventor
  • Vilcinskas, Eivydas
  • Petruskevicius, Arnas
  • Stalioraitis, Giedrius
  • Juravicius, Martynas
  • Stankevicius, Rimantas

Abstract

Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.

IPC Classes  ?

50.

DYNAMIC WEB PAGE CLASSIFICATION IN WEB DATA COLLECTION

      
Application Number 17368636
Status Pending
Filing Date 2021-07-06
First Publication Date 2023-01-19
Owner
  • OXYLABS, UAB (Lithuania)
  • TESO LT, UAB (Lithuania)
Inventor
  • Kuksta, Andrius
  • Gorskovas, Jurijus
  • Juravicius, Martynas

Abstract

The current application discloses processor-implemented methods and systems of processing unclassified HTML responses collected in the context of a data collection service, the method comprising, in one embodiment, receiving unclassified HTML documents, isolating elements relevant for category identification, deriving classification attributes from the isolated elements, and applying a Machine Learning-based classification model resulting in HTML data items classified and labelled accordingly. In certain embodiments the Machine Learning model may be a model trained on a pre-created training data set labeled manually or in an automatic fashion.

IPC Classes  ?

  • G06N 5/02 - Knowledge representationSymbolic representation
  • G06F 16/958 - Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

51.

Methods and systems for implementing a regionally contiguous proxy service

      
Application Number 17935663
Grant Number 11770457
Status In Force
Filing Date 2022-09-27
First Publication Date 2023-01-19
Grant Date 2023-09-26
Owner Oxylabs, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Dzekunskas, Karolis

Abstract

System and method for efficiently implementing scalable, highly efficient decentralized proxy services through proxy infrastructures situated in different geo-locations. In one aspect, the systems and methods enable users from any geographical location to send requests to the geographically closest proxy infrastructure. One exemplary method described allows proxy infrastructures to gather, classify, and store metadata of exit nodes in its internal database. In another aspect, systems and methods described herein enable proxy infrastructures to select metadata of exit nodes from its internal database and forward requests from a user device to respective proxy servers or proxy supernodes to which the selected exit nodes are connected.

IPC Classes  ?

  • G06F 15/173 - Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star or snowflake
  • H04L 67/561 - Adding application-functional data or data for application control, e.g. adding metadata
  • H04L 43/0852 - Delays
  • H04L 67/306 - User profiles
  • H04L 61/4511 - Network directoriesName-to-address mapping using standardised directoriesNetwork directoriesName-to-address mapping using standardised directory access protocols using domain name system [DNS]
  • H04L 67/01 - Protocols
  • H04L 67/52 - Network services specially adapted for the location of the user terminal

52.

Direct leg access for proxy web scraping

      
Application Number 17699939
Grant Number 11741174
Status In Force
Filing Date 2022-03-21
First Publication Date 2023-01-19
Grant Date 2023-08-29
Owner Oxylabs, UAB (Lithuania)
Inventor
  • Vilcinskas, Eivydas
  • Petru{hacek Over (s)}kevicius, Arnas
  • Stalioraitis, Giedrius
  • Juravicius, Martynas
  • Stankevicius, Rimantas

Abstract

Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.

IPC Classes  ?

53.

WEB SCRAPING THROUGH USE OF PROXIES, AND APPLICATIONS THEREOF

      
Document Number 03214799
Status Pending
Filing Date 2022-06-24
Open to Public Date 2023-01-12
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Vilcinskas, Eivydas
  • Petruskevicius, Arnas
  • Stalioraitis, Giedrius
  • Juravicius, Martynas
  • Stankevicius, Rimantas

Abstract

Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.

IPC Classes  ?

  • G06F 16/951 - IndexingWeb crawling techniques
  • G06F 16/958 - Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

54.

Advanced data collection block identification

      
Application Number 17823257
Grant Number 11669588
Status In Force
Filing Date 2022-08-30
First Publication Date 2022-12-29
Grant Date 2023-06-06
Owner Oxylabs, UAB (Lithuania)
Inventor
  • Juravicius, Martynas
  • Kuksta, Andrius

Abstract

Systems and methods that allow examination of response data collected from content providers and provide for classification and routing according to the classification. The process of classification employs an unsupervised, or partially unsupervised, Machine Learning classifier model for identifying data collection responses that contains no data, mangled data, or a block, for assigning a classification correspondingly and for feeding the classification decision back to a data collection platform.

IPC Classes  ?

  • G06F 18/214 - Generating training patternsBootstrap methods, e.g. bagging or boosting
  • G06N 5/025 - Extracting rules from data
  • G06N 20/00 - Machine learning
  • G06F 18/2411 - Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
  • G06F 18/2415 - Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
  • G06F 18/243 - Classification techniques relating to the number of classes
  • G06N 3/044 - Recurrent networks, e.g. Hopfield networks
  • G06F 16/951 - IndexingWeb crawling techniques
  • G06F 21/57 - Certifying or maintaining trusted computer platforms, e.g. secure boots or power-downs, version controls, system software checks, secure updates or assessing vulnerabilities
  • H04L 9/40 - Network security protocols
  • H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]

55.

ADVANCED RESPONSE PROCESSING IN WEB DATA COLLECTION

      
Application Number 17856539
Status Pending
Filing Date 2022-07-01
First Publication Date 2022-12-29
Owner
  • OXYLABS, UAB (Lithuania)
  • TESO LT, UAB (Lithuania)
Inventor
  • Juravicius, Martynas
  • Kuksta, Andrius

Abstract

ADVANCED RESPONSE PROCESSING IN WEB DATA COLLECTION discloses processor-implemented apparatuses, methods, and systems of processing unstructured raw HTML responses collected in the context of a data collection service, the method comprising, in one embodiment, receiving raw unstructured HTML documents and extracting text data with associated meta information that may comprise style and formatting information. In some embodiments data field tags and values may be assigned to the text blocks extracted, classifying the data based on the processing of Machine Learning algorithms. Additionally, blocks of extracted data may be grouped and re-grouped together and presented as a single data point. In another embodiment the system may aggregate and present the text data with the associated meta information in a structured format. In certain embodiments the Machine Learning model may be a model trained on a pre-created training data set labeled manually or in an automatic fashion.

IPC Classes  ?

  • G06F 16/953 - Querying, e.g. by the use of web search engines
  • G06N 20/00 - Machine learning
  • G06K 9/62 - Methods or arrangements for recognition using electronic means
  • G06V 10/46 - Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]Salient regional features

56.

ADVANCED RESPONSE PROCESSING IN WEB DATA COLLECTION

      
Document Number 03214781
Status Pending
Filing Date 2022-06-21
Open to Public Date 2022-12-29
Grant Date 2025-05-20
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Juravicius, Martynas
  • Kuksta, Andrius

Abstract

Advanced response processing in web data collection discloses processor-implemented apparatuses, methods, and systems of processing unstructured raw HTML responses collected in the context of a data collection service, the method comprising, in one embodiment, receiving raw unstructured HTML documents and extracting text data with associated meta information that may comprise style and formatting information. In some embodiments data field tags and values may be assigned to the text blocks extracted, classifying the data based on the processing of Machine Learning algorithms. Additionally, blocks of extracted data may be grouped and re-grouped together and presented as a single data point. In another embodiment the system may aggregate and present the text data with the associated meta information in a structured format. In certain embodiments the Machine Learning model may be a model trained on a pre-created training data set labeled manually or in an automatic fashion.

IPC Classes  ?

57.

E-COMMERCE TOOLKIT INFRASTRUCTURE

      
Application Number 17360587
Status Pending
Filing Date 2021-06-28
First Publication Date 2022-12-29
Owner
  • OXYLABS, UAB (Lithuania)
  • TESO LT, UAB (Lithuania)
Inventor
  • Vilcinskas, Eivydas
  • Stankevicius, Rimantas
  • Sulzenko, Aleksandras

Abstract

In one aspect, methods and systems for producing an index of a target website are described. In another aspect, methods and systems for extracting specific information from one or more specific indexed URLs are described. The method and system for producing an index of a target website include receiving and analyzing a client's specifications for the index, accessing a target website, extracting the relevant information from the target website, parsing the extracted information in order to identify the URLs, producing the index containing the identified URLs, storing the index (which contains the list of indexed URLs) in a database, compiling the index (which contains the list of indexed URLs) into different formats requested by the client and providing the client, the access information for accessing the compiled index.

IPC Classes  ?

  • G06F 16/951 - IndexingWeb crawling techniques
  • G06F 16/955 - Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
  • G06F 16/25 - Integrating or interfacing systems involving database management systems

58.

Proxy selection by monitoring quality and available capacity

      
Application Number 17822452
Grant Number 11606438
Status In Force
Filing Date 2022-08-26
First Publication Date 2022-12-22
Grant Date 2023-03-14
Owner Oxylabs, UAB (Lithuania)
Inventor Suckel, Vitalij

Abstract

Empirical data of exit nodes are continuously monitored and each exit node's overall performance and available capacity are calculated. The empirical data can include monitoring the number of concurrent requests currently being executed by each exit node and the disconnection chronology of each exit node. Further, each exit node is tested by benchmark requests and ping messages and each exit node's quality rate is calculated. Additionally, systems and methods are provided to select an exit node with the highest quality and available capacity value, from a particular pool to route the user request.

IPC Classes  ?

  • G06F 15/173 - Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star or snowflake
  • H04L 67/2866 - ArchitecturesArrangements
  • H04L 9/40 - Network security protocols
  • H04L 43/10 - Active monitoring, e.g. heartbeat, ping or trace-route

59.

Regulation methods for proxy services

      
Application Number 17805246
Grant Number 11496594
Status In Force
Filing Date 2022-06-03
First Publication Date 2022-11-08
Grant Date 2022-11-08
Owner
  • OXYLABS, UAB (Lithuania)
  • OXYLABS, UAB (Lithuania)
Inventor
  • Stalioraitis, Giedrius
  • Balkauskas, Ovidijus

Abstract

Systems and methods to manage and regulate the requests of multiple proxy clients are disclosed. In one aspect, the system and methods disclosed herein aids in configuring proxy server(s) with a rate-limit functionality. Configuration of the rate-limit functionality may be realized by, but not limited to, installing configuration file(s) and/or software application(s) on the proxy server(s). The configuration provides information about the list of restricted and unrestricted domains and their respective request limit specification in a given time frame. Therefore, each time before a proxy server forwards the clients' requests to a target domain, the proxy server checks and ensures that the request count to the particular target domain is well within the limit specified in the request limit specification. Thus, the embodiments described herein aid in preventing the IP addresses of proxy service providers from being blocked or denied from the target websites.

IPC Classes  ?

  • G06F 15/16 - Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
  • H04L 67/56 - Provisioning of proxy services

60.

Traffic service threads for large pools of network addresses

      
Application Number 17644249
Grant Number 11489770
Status In Force
Filing Date 2021-12-14
First Publication Date 2022-11-01
Grant Date 2022-11-01
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Dagys, Gediminas
  • Vitkus, Paulius

Abstract

Traffic services for network addresses may be provided within threads executing within a main process for managing the traffic services. The threads may share resources within the main process, reducing the computing resources consumed to provide traffic services to large pools of network addresses. According to one embodiment, a method may include executing a main process for managing traffic services; determining, by the main process, a configuration specifying at least one or more destination addresses; instantiating, by the main process, one or more traffic service (TS) threads for the one or more destination addresses; and/or processing, by the one or more traffic service (TS) threads, inbound traffic for the corresponding one or more destination addresses. Other aspects and embodiments for traffic management are also disclosed.

IPC Classes  ?

  • H04L 45/74 - Address processing for routing
  • H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]

61.

Exit node benchmark feature

      
Application Number 17810351
Grant Number 11606439
Status In Force
Filing Date 2022-07-01
First Publication Date 2022-10-20
Grant Date 2023-03-14
Owner Oxylabs, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Kozlovski, Miroslav

Abstract

Systems and methods for effectively managing exit nodes are provided. The exemplary systems and methods use a Supernode to examine an Exit Node through sending and receiving a request to a Target. Information about the exit node is then stored into the Supernode. According to the information provided from the Supernode, the Exit Nodes Database systemizes the proxies according to availability and provides available exit nodes to a User Device.

IPC Classes  ?

  • H04L 67/288 - Distributed intermediate devices, i.e. intermediate devices for interaction with other intermediate devices on the same level
  • H04L 43/50 - Testing arrangements

62.

Early positive communication response in a proxy infrastructure

      
Application Number 17669208
Grant Number 11470175
Status In Force
Filing Date 2022-02-10
First Publication Date 2022-10-11
Grant Date 2022-10-11
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Kozlovski, Miroslav
  • Ciuta, Gytis

Abstract

Systems and methods herein provide for a proxy infrastructure. In the proxy infrastructure, a network element (e.g., a supernode) is connected with a plurality of exit nodes. At one of a plurality of messenger units of the proxy infrastructure, a proxy protocol request is received directly from a client computing device. The proxy protocol request specifies a request and a target. In response the proxy protocol request, a selection is made between one between one of the plurality of exit nodes. A message with the request is sent from the messenger to the supernode connected with the selected exit node. Finally, the message is sent from the supernode to the selected exit node to forward the request to the target.

IPC Classes  ?

63.

Adaptive data collection optimization

      
Application Number 17655932
Grant Number 11468137
Status In Force
Filing Date 2022-03-22
First Publication Date 2022-10-11
Grant Date 2022-10-11
Owner
  • OXYLABS, UAB (Lithuania)
  • OXYLABS, UAB (Lithuania)
Inventor
  • Juravicius, Martynas
  • Bulba, Erikas
  • Briliauskas, Mantas

Abstract

Systems and methods to intelligently optimize data collection requests are disclosed. In one embodiment, systems are configured to identify and select a complete set of suitable parameters to execute the data collection requests. In another embodiment, systems are configured to identify and select a partial set of suitable parameters to execute the data collection requests. The present embodiments can implement machine learning algorithms to identify and select the suitable parameters according to the nature of the data collection requests and the targets. Moreover, the embodiments provide systems and methods to generate feedback data based upon the effectiveness of the data collection parameters. Furthermore, the embodiments provide systems and methods to score the set of suitable parameters based on the feedback data and the overall cost, which are then stored in an internal database.

IPC Classes  ?

  • G06F 16/955 - Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]

64.

Proxy selection by monitoring quality and available capacity

      
Application Number 17806096
Grant Number 11463537
Status In Force
Filing Date 2022-06-09
First Publication Date 2022-10-04
Grant Date 2022-10-04
Owner OXYLABS, UAB (Lithuania)
Inventor Suckel, Vitalij

Abstract

Empirical data of exit nodes are continuously monitored and each exit node's overall performance and available capacity are calculated. The empirical data can include monitoring the number of concurrent requests currently being executed by each exit node and the disconnection chronology of each exit node. Further, each exit node is tested by benchmark requests and ping messages and each exit node's quality rate is calculated. Additionally, systems and methods are provided to select an exit node with the highest quality and available capacity value, from a particular pool to route the user request.

IPC Classes  ?

  • G06F 15/173 - Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star or snowflake
  • H04L 67/2866 - ArchitecturesArrangements
  • H04L 9/40 - Network security protocols
  • H04L 43/10 - Active monitoring, e.g. heartbeat, ping or trace-route

65.

Advanced data collection block identification

      
Application Number 17217869
Grant Number 11461588
Status In Force
Filing Date 2021-03-30
First Publication Date 2022-10-04
Grant Date 2022-10-04
Owner
  • OXYLABS, UAB (Lithuania)
  • OXYLABS, UAB (Lithuania)
Inventor
  • Juravicius, Martynas
  • Kuksta, Andrius

Abstract

Systems and methods that allow examination of response data collected from content providers and provide for classification and routing according to the classification. The process of classification employs an unsupervised, or partially unsupervised, Machine Learning classifier model for identifying data collection responses that contains no data, mangled data, or a block, for assigning a classification correspondingly and for feeding the classification decision back to a data collection platform.

IPC Classes  ?

  • G06K 9/62 - Methods or arrangements for recognition using electronic means
  • G06N 5/02 - Knowledge representationSymbolic representation
  • G06N 3/04 - Architecture, e.g. interconnection topology
  • G06N 20/00 - Machine learning
  • G06F 16/951 - IndexingWeb crawling techniques
  • G06F 21/57 - Certifying or maintaining trusted computer platforms, e.g. secure boots or power-downs, version controls, system software checks, secure updates or assessing vulnerabilities
  • H04L 9/40 - Network security protocols
  • H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]

66.

Curating proxy server pools

      
Application Number 17806754
Grant Number 11616848
Status In Force
Filing Date 2022-06-14
First Publication Date 2022-09-29
Grant Date 2023-03-28
Owner Oxylabs, UAB (Lithuania)
Inventor
  • Vasiliauskas, Justas
  • Gervinskas, Justas

Abstract

A system and method of forming proxy server pools is provided. The method comprises several steps, such as requesting a pool to execute the user's request and retrieving an initial group. The system checks the service history of an initial group, including whether any of the proxy servers in an initial group are exclusive to existing pools. The exclusive proxy servers in an initial group with eligible proxy servers are replaced when needed and new proxy server pools are formed. The system also records the service history of proxy servers and pools before and after the pools are created. The method can also involve predicting the pool health in relation with the thresholds foreseen and replacing the proxy servers below the threshold.

IPC Classes  ?

67.

Supernode graceful shutdown in a proxy infrastructure

      
Application Number 17669217
Grant Number 11457073
Status In Force
Filing Date 2022-02-10
First Publication Date 2022-09-27
Grant Date 2022-09-27
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Kozlovski, Miroslav
  • Ciuta, Gytis

Abstract

Systems and methods herein provide for a proxy infrastructure. In the proxy infrastructure, a network element (e.g., a supernode) is connected with a plurality of exit nodes. At one of a plurality of messenger units of the proxy infrastructure, a proxy protocol request is received directly from a client computing device. The proxy protocol request specifies a request and a target. In response the proxy protocol request, a selection is made between one between one of the plurality of exit nodes. A message with the request is sent from the messenger to the supernode connected with the selected exit node. Finally, the message is sent from the supernode to the selected exit node to forward the request to the target.

IPC Classes  ?

  • H04L 67/56 - Provisioning of proxy services
  • H04L 67/141 - Setup of application sessions
  • H04L 41/12 - Discovery or management of network topologies
  • H04L 43/10 - Active monitoring, e.g. heartbeat, ping or trace-route

68.

IMPLEMENTING A REGIONALLY CONTIGUOUS PROXY SERVICE

      
Document Number 03204621
Status Pending
Filing Date 2021-11-04
Open to Public Date 2022-09-22
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Dzekunskas, Karolis

Abstract

System and method for efficiently implementing scalable, highly efficient decentralized proxy services through proxy infrastructures situated in different geo-locations. In one aspect, the systems and methods enable users from any geographical location to send requests to the geographically closest proxy infrastructure. One exemplary method described allows proxy infrastructures to gather, classify, and store metadata of exit nodes in its internal database. In another aspect, systems and methods described herein enable proxy infrastructures to select metadata of exit nodes from its internal database and forward requests from a user device to respective proxy servers or proxy supernodes to which the selected exit nodes are connected.

IPC Classes  ?

  • H04L 61/4511 - Network directoriesName-to-address mapping using standardised directoriesNetwork directoriesName-to-address mapping using standardised directory access protocols using domain name system [DNS]
  • H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
  • H04L 67/289 - Intermediate processing functionally located close to the data consumer application, e.g. in same machine, in same home or in same sub-network
  • H04L 67/2895 - Intermediate processing functionally located close to the data provider application, e.g. reverse proxies
  • H04L 67/306 - User profiles
  • H04L 67/51 - Discovery or management thereof, e.g. service location protocol [SLP] or web services
  • H04L 67/52 - Network services specially adapted for the location of the user terminal
  • H04L 67/563 - Data redirection of data network streams
  • H04L 67/564 - Enhancement of application control based on intercepted application data
  • H04L 67/568 - Storing data temporarily at an intermediate stage, e.g. caching

69.

Methods and systems for implementing a regionally contiguous proxy service

      
Application Number 17804213
Grant Number 11489937
Status In Force
Filing Date 2022-05-26
First Publication Date 2022-09-22
Grant Date 2022-11-01
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Dzekunskas, Karolis

Abstract

System and method for efficiently implementing scalable, highly efficient decentralized proxy services through proxy infrastructures situated in different geo-locations. In one aspect, the systems and methods enable users from any geographical location to send requests to the geographically closest proxy infrastructure. One exemplary method described allows proxy infrastructures to gather, classify, and store metadata of exit nodes in its internal database. In another aspect, systems and methods described herein enable proxy infrastructures to select metadata of exit nodes from its internal database and forward requests from a user device to respective proxy servers or proxy supernodes to which the selected exit nodes are connected.

IPC Classes  ?

  • G06F 15/16 - Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
  • H04L 67/561 - Adding application-functional data or data for application control, e.g. adding metadata
  • H04L 67/01 - Protocols
  • H04L 67/52 - Network services specially adapted for the location of the user terminal
  • H04L 67/306 - User profiles
  • H04L 61/4511 - Network directoriesName-to-address mapping using standardised directoriesNetwork directoriesName-to-address mapping using standardised directory access protocols using domain name system [DNS]
  • H04L 43/0852 - Delays

70.

Web scraper history management across multiple data centers

      
Application Number 17373634
Grant Number 11416564
Status In Force
Filing Date 2021-07-12
First Publication Date 2022-08-16
Grant Date 2022-08-16
Owner
  • OXYLABS, UAB (Lithuania)
  • TESO LT, UAB (Lithuania)
Inventor
  • Vilcinskas, Eivydas
  • Petruskevicius, Arnas
  • Stalioraitis, Giedrius
  • Juravicius, Martynas
  • Stankevicius, Rimantas

Abstract

Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.

IPC Classes  ?

71.

Database server management for proxy scraping jobs

      
Application Number 17373482
Grant Number 11416291
Status In Force
Filing Date 2021-07-12
First Publication Date 2022-08-16
Grant Date 2022-08-16
Owner
  • OXYLABS, UAB (Lithuania)
  • TESO LT, UAB (Lithuania)
Inventor
  • Vilcinskas, Eivydas
  • Petruskevicius, Arnas
  • Stalioraitis, Giedrius
  • Juravicius, Martynas
  • Stankevicius, Rimantas

Abstract

Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.

IPC Classes  ?

  • G06F 16/951 - IndexingWeb crawling techniques
  • G06F 9/48 - Program initiatingProgram switching, e.g. by interrupt

72.

Access control policy for proxy services

      
Application Number 17562553
Grant Number 11411954
Status In Force
Filing Date 2021-12-27
First Publication Date 2022-08-09
Grant Date 2022-08-09
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Kozlovski, Miroslav
  • Ciuta, Gytis
  • Karasevic, Valdemar

Abstract

Systems and methods to manage and efficiently perform authorization of multiple proxy clients are disclosed. Furthermore, systems and methods to measure and check whether the web traffic of one or more client devices has reached a permissible limit of web traffic assigned by the proxy service provider. Specifically, a proxy is configured to gather and save authorization information of one or more clients within its memory. Therefore, the proxy server can verify and authorize one or more clients by utilizing the data from its memory. Furthermore, the proxy is configured to measure and report the utilized web traffic of one or more client devices to a messaging platform. In another aspect, systems and methods to check whether one or more client devices have reached a permissible amount of web traffic limit are disclosed.

IPC Classes  ?

  • H04L 29/06 - Communication control; Communication processing characterised by a protocol
  • H04L 9/40 - Network security protocols

73.

Dynamic optimization of request parameters for proxy server

      
Application Number 17660344
Grant Number 11470174
Status In Force
Filing Date 2022-04-22
First Publication Date 2022-08-04
Grant Date 2022-10-11
Owner
  • OXYLABS, UAB (Lithuania)
  • OXYLABS, UAB (Lithuania)
Inventor
  • Vilcinskas, Eivydas
  • Juravicius, Martynas
  • Stalioraitis, Giedrius

Abstract

Systems and methods of task implementation are extended as provided herein and target the web crawling process through a step of submitting a request by a customer to a web crawler. The systems and methods allow a more complex request for a web crawler to be defined in order to receive more specific data. In one aspect, a method for data extraction and gathering from a Network by a Service provider infrastructure include the following steps: checking the parameters of a request received from a User's Device, adjusting the request parameters according to pre-established Scraping logic, selecting a Proxy according to the criteria of the pre-established Scraping logic, sending the adjusted request to the Target through the selected Proxy, checking metadata received from the Target, and forwarding the data to the User's device.

IPC Classes  ?

  • H04L 67/56 - Provisioning of proxy services
  • H04L 67/146 - Markers for unambiguous identification of a particular session, e.g. session cookie or URL-encoding
  • H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]

74.

Proxy selection by monitoring quality and available capacity

      
Application Number 17650959
Grant Number 11388253
Status In Force
Filing Date 2022-02-14
First Publication Date 2022-07-12
Grant Date 2022-07-12
Owner OXYLABS, UAB (Lithuania)
Inventor Suckel, Vitalij

Abstract

Empirical data of exit nodes are continuously monitored and each exit node's overall performance and available capacity are calculated. The empirical data can include monitoring the number of concurrent requests currently being executed by each exit node and the disconnection chronology of each exit node. Further, each exit node is tested by benchmark requests and ping messages and each exit node's quality rate is calculated. Additionally, systems and methods are provided to select an exit node with the highest quality and available capacity value, from a particular pool to route the user request.

IPC Classes  ?

  • G06F 15/173 - Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star or snowflake
  • H04L 67/2866 - ArchitecturesArrangements
  • H04L 9/40 - Network security protocols
  • H04L 43/10 - Active monitoring, e.g. heartbeat, ping or trace-route

75.

Exit node benchmark feature

      
Application Number 17656007
Grant Number 11412062
Status In Force
Filing Date 2022-03-23
First Publication Date 2022-07-07
Grant Date 2022-08-09
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Kozlovski, Miroslav

Abstract

Systems and methods for effectively managing exit nodes are provided. The exemplary systems and methods use a Supernode to examine an Exit Node through sending and receiving a request to a Target. Information about the exit node is then stored into the Supernode. According to the information provided from the Supernode, the Exit Nodes Database systemizes the proxies according to availability and provides available exit nodes to a User Device.

IPC Classes  ?

  • H04L 29/08 - Transmission control procedure, e.g. data link level control procedure
  • H04L 12/26 - Monitoring arrangements; Testing arrangements
  • H04L 67/288 - Distributed intermediate devices, i.e. intermediate devices for interaction with other intermediate devices on the same level
  • H04L 43/50 - Testing arrangements

76.

Advanced response processing in web data collection

      
Application Number 17359082
Grant Number 11379542
Status In Force
Filing Date 2021-06-25
First Publication Date 2022-07-05
Grant Date 2022-07-05
Owner
  • OXYLABS, UAB (Lithuania)
  • TESO LT, UAB (Lithuania)
Inventor
  • Juravicius, Martynas
  • Kuksta, Andrius

Abstract

ADVANCED RESPONSE PROCESSING IN WEB DATA COLLECTION discloses processor-implemented apparatuses, methods, and systems of processing unstructured raw HTML responses collected in the context of a data collection service, the method comprising, in one embodiment, receiving raw unstructured HTML documents and extracting text data with associated meta information that may comprise style and formatting information. In some embodiments data field tags and values may be assigned to the text blocks extracted, classifying the data based on the processing of Machine Learning algorithms. Additionally, blocks of extracted data may be grouped and re-grouped together and presented as a single data point. In another embodiment the system may aggregate and present the text data with the associated meta information in a structured format. In certain embodiments the Machine Learning model may be a model trained on a pre-created training data set labeled manually or in an automatic fashion.

IPC Classes  ?

  • G06N 20/00 - Machine learning
  • G06F 16/953 - Querying, e.g. by the use of web search engines
  • G06K 9/62 - Methods or arrangements for recognition using electronic means
  • G06V 10/46 - Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]Salient regional features

77.

Methods and systems for implementing a regionally contiguous proxy service

      
Application Number 17455256
Grant Number 11381667
Status In Force
Filing Date 2021-11-17
First Publication Date 2022-07-05
Grant Date 2022-07-05
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Dzekunskas, Karolis

Abstract

System and method for efficiently implementing scalable, highly efficient decentralized proxy services through proxy infrastructures situated in different geo-locations. In one aspect, the systems and methods enable users from any geographical location to send requests to the geographically closest proxy infrastructure. One exemplary method described allows proxy infrastructures to gather, classify, and store metadata of exit nodes in its internal database. In another aspect, systems and methods described herein enable proxy infrastructures to select metadata of exit nodes from its internal database and forward requests from a user device to respective proxy servers or proxy supernodes to which the selected exit nodes are connected.

IPC Classes  ?

  • G06F 15/173 - Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star or snowflake
  • H04L 67/561 - Adding application-functional data or data for application control, e.g. adding metadata
  • H04L 67/306 - User profiles
  • H04L 43/0852 - Delays
  • H04L 61/4511 - Network directoriesName-to-address mapping using standardised directoriesNetwork directoriesName-to-address mapping using standardised directory access protocols using domain name system [DNS]
  • H04L 67/01 - Protocols
  • H04L 67/52 - Network services specially adapted for the location of the user terminal

78.

Regulation methods for proxy services

      
Application Number 17652282
Grant Number 11381666
Status In Force
Filing Date 2022-02-24
First Publication Date 2022-07-05
Grant Date 2022-07-05
Owner
  • OXYLABS, UAB (Lithuania)
  • OXYLABS, UAB (Lithuania)
Inventor
  • Stalioraitis, Giedrius
  • Balkauskas, Ovidijus

Abstract

Systems and methods to manage and regulate the requests of multiple proxy clients are disclosed. In one aspect, the system and methods disclosed herein aids in configuring proxy server(s) with a rate-limit functionality. Configuration of the rate-limit functionality may be realized by, but not limited to, installing configuration file(s) and/or software application(s) on the proxy server(s). The configuration provides information about the list of restricted and unrestricted domains and their respective request limit specification in a given time frame. Therefore, each time before a proxy server forwards the clients' requests to a target domain, the proxy server checks and ensures that the request count to the particular target domain is well within the limit specified in the request limit specification. Thus, the embodiments described herein aid in preventing the IP addresses of proxy service providers from being blocked or denied from the target websites.

IPC Classes  ?

  • G06F 15/16 - Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
  • H04L 67/56 - Provisioning of proxy services

79.

Throttling client requests for web scraping

      
Application Number 17373287
Grant Number 11372937
Status In Force
Filing Date 2021-07-12
First Publication Date 2022-06-28
Grant Date 2022-06-28
Owner
  • OXYLABS, UAB (Lithuania)
  • TESO LT, UAB (Lithuania)
Inventor
  • Vilcinskas, Eivydas
  • Petruskevicius, Arnas
  • Stalioraitis, Giedrius
  • Juravicius, Martynas
  • Stankevicius, Rimantas

Abstract

Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.

IPC Classes  ?

80.

Curating proxy server pools

      
Application Number 17654083
Grant Number 11637902
Status In Force
Filing Date 2022-03-09
First Publication Date 2022-06-23
Grant Date 2023-04-25
Owner Oxylabs, UAB (Lithuania)
Inventor
  • Vasiliauskas, Justas
  • Gervinskas, Justas

Abstract

A system and method of forming proxy server pools is provided. The method comprises several steps, such as requesting a pool to execute the user's request and retrieving an initial group. The system checks the service history of an initial group, including whether any of the proxy servers in an initial group are exclusive to existing pools. The exclusive proxy servers in an initial group with eligible proxy servers are replaced when needed and new proxy server pools are formed. The system also records the service history of proxy servers and pools before and after the pools are created. The method can also involve predicting the pool health in relation with the thresholds foreseen and replacing the proxy servers below the threshold.

IPC Classes  ?

81.

Adaptive data collection optimization

      
Application Number 17454074
Grant Number 11314833
Status In Force
Filing Date 2021-11-09
First Publication Date 2022-04-26
Grant Date 2022-04-26
Owner
  • OXYLABS, UAB (Lithuania)
  • OXYLABS, UAB (Lithuania)
Inventor
  • Juravicius, Martynas
  • Bulba, Erikas
  • Briliauskas, Mantas

Abstract

Systems and methods to intelligently optimize data collection requests are disclosed. In one embodiment, systems are configured to identify and select a complete set of suitable parameters to execute the data collection requests. In another embodiment, systems are configured to identify and select a partial set of suitable parameters to execute the data collection requests. The present embodiments can implement machine learning algorithms to identify and select the suitable parameters according to the nature of the data collection requests and the targets. Moreover, the embodiments provide systems and methods to generate feedback data based upon the effectiveness of the data collection parameters. Furthermore, the embodiments provide systems and methods to score the set of suitable parameters based on the feedback data and the overall cost, which are then stored in an internal database.

IPC Classes  ?

  • G06F 16/955 - Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]

82.

OPTIMIZING SCRAPING REQUESTS THROUGH BROWSING PROFILES

      
Document Number 03190466
Status Pending
Filing Date 2021-04-23
Open to Public Date 2022-04-07
Owner OXYLABS, UAB (Lithuania)
Inventor Juravicius, Martynas

Abstract

Systems and methods of task implementation are extended as provided herein and target the web crawling process through a step of submitting a request by a customer to a web crawler. The systems and methods allow a request for a web crawler to be enriched with a customized browsing profile in order to be categorized as an organic human user to obtain targeted content. In one aspect, a method for data extraction and gathering from a Network by a Service provider infrastructure include at least some of the following exemplary steps: receiving and examining the parameters of a request received from a User's Device, enriching the request parameters with a pre-established browsing profile, sending the enriched request to a Target through the selected Proxy, receiving a response from the Target, dissecting the response's metadata that is appropriate for updating the browsing profile utilized for the request, and forwarding the data to the User's device pursuant to the examination of the response obtained from the Target system.

IPC Classes  ?

  • G06F 16/9535 - Search customisation based on user profiles and personalisation

83.

Proxy selection by monitoring quality and available capacity

      
Application Number 17451878
Grant Number 11297158
Status In Force
Filing Date 2021-10-22
First Publication Date 2022-04-05
Grant Date 2022-04-05
Owner OXYLABS, UAB (Lithuania)
Inventor Suckel, Vitalij

Abstract

Empirical data of exit nodes are continuously monitored and each exit node's overall performance and available capacity are calculated. The empirical data can include monitoring the number of concurrent requests currently being executed by each exit node and the disconnection chronology of each exit node. Further, each exit node is tested by benchmark requests and ping messages and each exit node's quality rate is calculated. Additionally, systems and methods are provided to select an exit node with the highest quality and available capacity value, from a particular pool to route the user request.

IPC Classes  ?

  • G06F 15/173 - Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star or snowflake
  • H04L 67/2866 - ArchitecturesArrangements
  • H04L 29/06 - Communication control; Communication processing characterised by a protocol
  • H04L 43/10 - Active monitoring, e.g. heartbeat, ping or trace-route

84.

Regulation methods for proxy services

      
Application Number 17491136
Grant Number 11297152
Status In Force
Filing Date 2021-09-30
First Publication Date 2022-04-05
Grant Date 2022-04-05
Owner
  • OXYLABS, UAB (Lithuania)
  • OXYLABS, UAB (Lithuania)
Inventor
  • Stalioraitis, Giedrius
  • Balkauskas, Ovidijus

Abstract

Systems and methods to manage and regulate the requests of multiple proxy clients are disclosed. In one aspect, the system and methods disclosed herein aids in configuring proxy server(s) with a rate-limit functionality. Configuration of the rate-limit functionality may be realized by, but not limited to, installing configuration file(s) and/or software application(s) on the proxy server(s). The configuration provides information about the list of restricted and unrestricted domains and their respective request limit specification in a given time frame. Therefore, each time before a proxy server forwards the clients' requests to a target domain, the proxy server checks and ensures that the request count to the particular target domain is well within the limit specified in the request limit specification. Thus, the embodiments described herein aid in preventing the IP addresses of proxy service providers from being blocked or denied from the target websites.

IPC Classes  ?

  • G06F 15/16 - Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
  • H04L 67/56 - Provisioning of proxy services
  • H04L 47/25 - Flow controlCongestion control with rate being modified by the source upon detecting a change of network conditions
  • H04L 67/01 - Protocols

85.

Optimizing scraping requests through browsing profiles

      
Application Number 17037429
Grant Number 11615157
Status In Force
Filing Date 2020-09-29
First Publication Date 2022-03-31
Grant Date 2023-03-28
Owner Oxylabs, UAB (Lithuania)
Inventor Juravicius, Martynas

Abstract

Systems and methods of task implementation are extended as provided herein and target the web crawling process through a step of submitting a request by a customer to a web crawler. The systems and methods allow a request for a web crawler to be enriched with a customized browsing profile in order to be categorized as an organic human user to obtain targeted content. In one aspect, a method for data extraction and gathering from a Network by a Service provider infrastructure include at least some of the following exemplary steps: receiving and examining the parameters of a request received from a User's Device, enriching the request parameters with a pre-established browsing profile, sending the enriched request to a Target through the selected Proxy, receiving a response from the Target, dissecting the response's metadata that is appropriate for updating the browsing profile utilized for the request, and forwarding the data to the User's device pursuant to the examination of the response obtained from the Target system.

IPC Classes  ?

86.

Direct leg access for proxy web scraping

      
Application Number 17373570
Grant Number 11281730
Status In Force
Filing Date 2021-07-12
First Publication Date 2022-03-22
Grant Date 2022-03-22
Owner
  • OXYLABS, UAB (Lithuania)
  • TESO LT, UAB (Lithuania)
Inventor
  • Vilcinskas, Eivydas
  • Petruskevicius, Arnas
  • Stalioraitis, Giedrius
  • Juravicius, Martynas
  • Stankevicius, Rimantas

Abstract

Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.

IPC Classes  ?

87.

DYNAMIC OPTIMIZATION OF REQUEST PARAMETERS FOR PROXY SERVER

      
Document Number 03190449
Status Pending
Filing Date 2021-04-07
Open to Public Date 2022-03-17
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Vilcinskas, Eivydas
  • Stalioraitis, Giedrius
  • Juravicius, Martynas

Abstract

Systems and methods of task implementation are extended as provided herein and target the web crawling process through a step of submitting a request by a customer to a web crawler. The systems and methods allow a more complex request for a web crawler to be defined in order to receive more specific data. In one aspect, a method for data extraction and gathering from a Network by a Service provider infrastructure include the following steps: checking the parameters of a request received from a User's Device, adjusting the request parameters according to pre- established Scraping logic, selecting a Proxy according to the criteria of the pre-established Scraping logic, sending the adjusted request to the Target through the selected Proxy, checking metadata received from the Target, and forwarding the data to the User's device.

IPC Classes  ?

  • G06F 16/951 - IndexingWeb crawling techniques
  • H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
  • H04L 67/562 - Brokering proxy services
  • H04L 67/563 - Data redirection of data network streams
  • H04L 69/22 - Parsing or analysis of headers

88.

Dynamic optimization of request parameters for proxy server

      
Application Number 17305123
Grant Number 11343342
Status In Force
Filing Date 2021-06-30
First Publication Date 2022-03-17
Grant Date 2022-05-24
Owner
  • OXYLABS, UAB (Lithuania)
  • OXYLABS, UAB (Lithuania)
Inventor
  • Vilcinskas, Eivydas
  • Juravicius, Martynas
  • Stalioraitis, Giedrius

Abstract

Systems and methods of task implementation are extended as provided herein and target the web crawling process through a step of submitting a request by a customer to a web crawler. The systems and methods allow a more complex request for a web crawler to be defined in order to receive more specific data. In one aspect, a method for data extraction and gathering from a Network by a Service provider infrastructure include the following steps: checking the parameters of a request received from a User's Device, adjusting the request parameters according to pre-established Scraping logic, selecting a Proxy according to the criteria of the pre-established Scraping logic, sending the adjusted request to the Target through the selected Proxy, checking metadata received from the Target, and forwarding the data to the User's device.

IPC Classes  ?

  • H04L 67/56 - Provisioning of proxy services
  • H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
  • H04L 67/146 - Markers for unambiguous identification of a particular session, e.g. session cookie or URL-encoding

89.

Curating proxy server pools

      
Application Number 17303616
Grant Number 11310336
Status In Force
Filing Date 2021-06-03
First Publication Date 2022-03-03
Grant Date 2022-04-19
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Vasiliauskas, Justas
  • Gervinskas, Justas

Abstract

A system and method of forming proxy server pools is provided. The method comprises several steps, such as requesting a pool to execute the user's request and retrieving an initial group. The system checks the service history of an initial group, including whether any of the proxy servers in an initial group are exclusive to existing pools. The exclusive proxy servers in an initial group with eligible proxy servers are replaced when needed and new proxy server pools are formed. The system also records the service history of proxy servers and pools before and after the pools are created. The method can also involve predicting the pool health in relation with the thresholds foreseen and replacing the proxy servers below the threshold.

IPC Classes  ?

  • H04L 67/2866 - ArchitecturesArrangements
  • H04L 29/06 - Communication control; Communication processing characterised by a protocol
  • H04L 41/5019 - Ensuring fulfilment of SLA
  • H04L 67/562 - Brokering proxy services
  • G06N 5/00 - Computing arrangements using knowledge-based models

90.

Curating proxy server pools

      
Application Number 17006499
Grant Number 11463536
Status In Force
Filing Date 2020-08-28
First Publication Date 2022-03-03
Grant Date 2022-10-04
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Vasiliauskas, Justas
  • Gervinskas, Justas

Abstract

A system and method of forming proxy server pools is provided. The method comprises several steps, such as requesting a pool to execute the user's request and retrieving an initial group. The system checks the service history of an initial group, including whether any of the proxy servers in an initial group are exclusive to existing pools. The exclusive proxy servers in an initial group with eligible proxy servers are replaced when needed and new proxy server pools are formed. The system also records the service history of proxy servers and pools before and after the pools are created. The method can also involve predicting the pool health in relation with the thresholds foreseen and replacing the proxy servers below the threshold.

IPC Classes  ?

91.

EXIT NODE BENCHMARK FEATURE

      
Document Number 03173869
Status Pending
Filing Date 2021-04-09
Open to Public Date 2021-12-30
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Kozlovski, Miroslav

Abstract

Systems and methods for effectively managing exit nodes are provided. The exemplary systems and methods use a Supernode to examine an Exit Node through sending and receiving a request to a Target. Information about the exit node is then stored into the Supernode. According to the information provided from the Supernode, the Exit Nodes Database systemizes the proxies according to availability and provides available exit nodes to a User Device.

IPC Classes  ?

  • H04L 43/50 - Testing arrangements
  • H04L 67/288 - Distributed intermediate devices, i.e. intermediate devices for interaction with other intermediate devices on the same level
  • H04L 67/56 - Provisioning of proxy services

92.

Exit node benchmark feature

      
Application Number 17402748
Grant Number 11316948
Status In Force
Filing Date 2021-08-16
First Publication Date 2021-12-30
Grant Date 2022-04-26
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Kozlovski, Miroslav

Abstract

Systems and methods for effectively managing exit nodes are provided. The exemplary systems and methods use a Supernode to examine an Exit Node through sending and receiving a request to a Target. Information about the exit node is then stored into the Supernode. According to the information provided from the Supernode, the Exit Nodes Database systemizes the proxies according to availability and provides available exit nodes to a User Device.

IPC Classes  ?

  • H04L 29/08 - Transmission control procedure, e.g. data link level control procedure
  • H04L 12/26 - Monitoring arrangements; Testing arrangements
  • H04L 67/288 - Distributed intermediate devices, i.e. intermediate devices for interaction with other intermediate devices on the same level
  • H04L 43/50 - Testing arrangements

93.

Methods and systems for implementing a regionally contiguous proxy service

      
Application Number 17207198
Grant Number 11212354
Status In Force
Filing Date 2021-03-19
First Publication Date 2021-12-28
Grant Date 2021-12-28
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Dzekunskas, Karolis

Abstract

System and method for efficiently implementing scalable, highly efficient decentralized proxy services through proxy infrastructures situated in different geo-locations. In one aspect, the systems and methods enable users from any geographical location to send requests to the geographically closest proxy infrastructure. One exemplary method described allows proxy infrastructures to gather, classify, and store metadata of exit nodes in its internal database. In another aspect, systems and methods described herein enable proxy infrastructures to select metadata of exit nodes from its internal database and forward requests from a user device to respective proxy servers or proxy supernodes to which the selected exit nodes are connected.

IPC Classes  ?

  • G06F 15/173 - Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star or snowflake
  • H04L 29/08 - Transmission control procedure, e.g. data link level control procedure
  • H04L 29/12 - Arrangements, apparatus, circuits or systems, not covered by a single one of groups characterised by the data terminal
  • H04L 29/06 - Communication control; Communication processing characterised by a protocol
  • H04L 12/26 - Monitoring arrangements; Testing arrangements

94.

Token-based authentication for a proxy web scraping service

      
Application Number 17373608
Grant Number 11204971
Status In Force
Filing Date 2021-07-12
First Publication Date 2021-12-21
Grant Date 2021-12-21
Owner
  • OXYLABS, UAB (Lithuania)
  • TESO LT, UAB (Lithuania)
Inventor
  • Vilcinskas, Eivydas
  • Petruskevicius, Arnas
  • Stalioraitis, Giedrius
  • Juravicius, Martynas
  • Stankevicius, Rimantas

Abstract

Embodiments disclose a system that allows for improved generation of web requests for scraping that, because of the nature of the requests and time and manner they are sent out, appear more organic, as in human generated, than conventional automated scraping systems. The system then manages how a client request to scrape a target website is made to the site, masking the request in a manner that makes it appear to the Web server as if the request is not generated by an automated system. In this way, by appearing more organic, Web servers may be less likely to block requests from the disclosed system or may take longer to block requests from the disclosed system. By avoiding Web servers blocking requests and extending the lifetime of IP proxies before they are blocked, embodiments can use a limited IP proxy address space more efficiently.

IPC Classes  ?

  • G06F 15/16 - Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
  • G06F 16/951 - IndexingWeb crawling techniques
  • H04L 29/06 - Communication control; Communication processing characterised by a protocol

95.

Proxy scraper detector

      
Application Number 17136879
Grant Number 11196712
Status In Force
Filing Date 2020-12-29
First Publication Date 2021-12-07
Grant Date 2021-12-07
Owner OXYLABS, UAB (Lithuania)
Inventor Norbutas, Emanuelis

Abstract

The method and system detects if proxies are used by a user from a web server's side. The method and system uses HTTP/2 and HTTP/3 protocols and, more precisely, the ping frames to test the round trip time of messages between a web server and a user. At the same time, a web server uses an Internet Control Message Protocol echo requests to measure the round trip time to an IP address. A web server can then compare, aggregate, and analyze different round trip times and determine if they are coming from different sources, i.e. if a user is using a proxy server. A web server can make decisions based on the comparison of round trip times. For example, a difference in a single user's round trip times may trigger a restrictive user policy at the web server's end and a web server can decide to return the requested content, return an error message, or ban them and similarly limit services.

IPC Classes  ?

  • H04L 29/06 - Communication control; Communication processing characterised by a protocol
  • H04L 29/08 - Transmission control procedure, e.g. data link level control procedure
  • H04L 12/26 - Monitoring arrangements; Testing arrangements

96.

Proxy server synchronizer

      
Application Number 17136890
Grant Number 11196833
Status In Force
Filing Date 2020-12-29
First Publication Date 2021-12-07
Grant Date 2021-12-07
Owner OXYLABS, UAB (Lithuania)
Inventor Norbutas, Emanuelis

Abstract

Systems and methods of web crawling/scraping process implementation are extended and target the web crawling process by submitting a request by a last-mile proxy to a web target. The systems and methods allow a request for a web crawler to be directed toward the target content platform through a proxy, or a plurality of proxies, for the purpose of optimizing the processing of the request. In at least one aspect, the systems and methods disclosed mitigate the potential for a negative evaluation of the requests by the content platform targeted through introducing the transfer of the execution of the steps within a scraping flow within the last-mile proxy system, thus aligning both network and application layer responses to the tests described.

IPC Classes  ?

  • H04L 29/08 - Transmission control procedure, e.g. data link level control procedure

97.

Proxy selection by monitoring quality and available capacity

      
Application Number 17217856
Grant Number 11184458
Status In Force
Filing Date 2021-03-30
First Publication Date 2021-11-23
Grant Date 2021-11-23
Owner OXYLABS, UAB (Lithuania)
Inventor Suckel, Vitalij

Abstract

Empirical data of exit nodes are continuously monitored and each exit node's overall performance and available capacity are calculated. The empirical data can include monitoring the number of concurrent requests currently being executed by each exit node and the disconnection chronology of each exit node. Further, each exit node is tested by benchmark requests and ping messages and each exit node's quality rate is calculated. Additionally, systems and methods are provided to select an exit node with the highest quality and available capacity value, from a particular pool to route the user request.

IPC Classes  ?

  • G06F 15/173 - Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star or snowflake
  • H04L 29/08 - Transmission control procedure, e.g. data link level control procedure
  • H04L 12/26 - Monitoring arrangements; Testing arrangements
  • H04L 29/06 - Communication control; Communication processing characterised by a protocol

98.

Dynamic optimization of request parameters for proxy server

      
Application Number 17185756
Grant Number 11140235
Status In Force
Filing Date 2021-02-25
First Publication Date 2021-10-05
Grant Date 2021-10-05
Owner
  • OXYLABS, UAB (Lithuania)
  • OXYLABS, UAB (Lithuania)
Inventor
  • Vilcinskas, Eivydas
  • Juravicius, Martynas
  • Stalioraitis, Giedrius

Abstract

Systems and methods of task implementation are extended as provided herein and target the web crawling process through a step of submitting a request by a customer to a web crawler. The systems and methods allow a more complex request for a web crawler to be defined in order to receive more specific data. In one aspect, a method for data extraction and gathering from a Network by a Service provider infrastructure include the following steps: checking the parameters of a request received from a User's Device, adjusting the request parameters according to pre-established Scraping logic, selecting a Proxy according to the criteria of the pre-established Scraping logic, sending the adjusted request to the Target through the selected Proxy, checking metadata received from the Target, and forwarding the data to the User's device.

IPC Classes  ?

  • H04L 29/08 - Transmission control procedure, e.g. data link level control procedure

99.

Exit node benchmark feature

      
Application Number 17098167
Grant Number 11140238
Status In Force
Filing Date 2020-11-13
First Publication Date 2021-10-05
Grant Date 2021-10-05
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Pilkauskas, Valdas
  • Kozlovski, Miroslav

Abstract

Systems and methods for effectively managing exit nodes are provided. The exemplary systems and methods use a Supernode to examine an Exit Node through sending and receiving a request to a Target. Information about the exit node is then stored into the Supernode. According to the information provided from the Supernode, the Exit Nodes Database systemizes the proxies according to availability and provides available exit nodes to a User Device.

IPC Classes  ?

  • G06F 15/16 - Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
  • H04L 29/08 - Transmission control procedure, e.g. data link level control procedure
  • H04L 12/26 - Monitoring arrangements; Testing arrangements

100.

SMART PROXY ROTATOR

      
Document Number 03139804
Status In Force
Filing Date 2020-06-08
Open to Public Date 2021-04-08
Grant Date 2024-04-23
Owner OXYLABS, UAB (Lithuania)
Inventor
  • Juravicius, Martynas
  • Vilcinskas, Eivydas

Abstract

The task, logic of HTTP/HTTPS session statistics interception and collection is moved to the client side instead of the proxy layer. Encrypted HTTPS tunnel is terminated at the client end, making the actual content or data in transit invisible to both proxies and the smart proxy rotator (SPR). Client's scraping software has a plug-in installed that expands its functionality. HTTP/HTTPS session quality metrics are intercepted and collected at the client side, then sent to the SPR. Proxy usage mark "can be used" is obtained from the SPR for the currently analyzed proxy, based on the results of metrics analysis.

IPC Classes  ?

  • H04L 12/16 - Arrangements for providing special services to substations
  • H04L 67/02 - Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
  • H04L 67/56 - Provisioning of proxy services
  1     2        Next Page