An automotive assistant that receives a prompt with a succession of tasks to be carried out includes a first level to interact with an occupant and a second level that interacts with the first level and with external applications. The second level has domain-specific delegees, each of which carries out one of the tasks using one of the applications. A top-level prompt builder uses the context information and the query to build a succession of top-level prompts, each of which is directed to a causing execution of a corresponding one of the tasks by an appropriate one of the delegees. It does so by prompting a top-level model to provide a corresponding succession of outputs. A top-level query builder uses the succession of outputs to build a corresponding succession of second-level queries, each of which is directed to a corresponding one of the domain-specific delegees.
2.
AUTOMOTIVE ASSISTANT WITH HIERARCHY HAVING A BACKBONE AND DOMAIN-SPECIFIC DELEGEES
An automotive assistant in an infotainment system of a vehicle includes a hierarchy that receives a top-level query via a speech interface and that provides a top-level response to the query. The hierarchy includes a top-level agent and a first and second domain with a related domain specific query. Both domains are queried using prompts comprising natural language. The top level response is formulated based on first and second domain specific activities.
B60K 35/26 - Dispositions de sortie, c.-à-d. du véhicule à l'utilisateur, associées aux fonctions du véhicule ou spécialement adaptées à celles-ci utilisant une sortie acoustique
A method for providing a vehicle with a command system that responds to natural-language commands in a target language includes receiving source training-data used for training in a source language, generating target training-data based on the source training-data, and using that target training-data to train the command system in the target language. Generating the target-training data includes generating rewritten source-sentences having concept tags derived from source annotations and using a translation model to translate these into corresponding target sentences, thus generating pairs of rewritten source-sentence and a target sentences. A first subset of these pairs is used to fine tune the translator model. The fine-tuned model then translates the pairs in a second subset of pairs to generate the target training-data based on the source training-data.
A method for vehicle occupant input includes acquiring a vehicle occupant data from a plurality of occupants of a vehicle, where the vehicle occupant data includes an acoustic spatial data and biometric data including at least voice characteristic data. The method further includes associating one or more positions of a plurality of positions in the vehicle with an occupant information. The occupant information associated with a position includes the biometric data for an occupant at the position based on the vehicle occupant data acquired and processing a first audio input from a first occupant of the vehicle using the voice characteristic data associated with a first position of the first occupant in the vehicle.
A method includes the use of a source mapper that is configured to operate in a source vehicle to enable training of a target mapper that is configured to operate in a target vehicle. The source mapper maps an utterance into a source command that controls a source-vehicle device in the source vehicle and the target mapper maps the same utterance into a target command that controls a target-vehicle device in the target vehicle. The source command and target command are commands from a source command-set and a target-command set, respectively, with the target-command set differing from the source-command set.
G10L 15/06 - Création de gabarits de référenceEntraînement des systèmes de reconnaissance de la parole, p. ex. adaptation aux caractéristiques de la voix du locuteur
G10L 15/183 - Classement ou recherche de la parole utilisant une modélisation du langage naturel selon les contextes, p. ex. modèles de langage
G10L 15/22 - Procédures utilisées pendant le processus de reconnaissance de la parole, p. ex. dialogue homme-machine
6.
TIME-VARYING COUNTENANCE ASSESSMENT FOR AFFECT STATE ESTIMATION
A method includes generating an embedding vector for each image among a plurality of images of an occupant in a vehicle. The plurality of images define an image sequence. The method further includes defining an ordered sequence of the embedding vectors, and outputting an emotional state of the occupant based at least in part on an assessment of the ordered sequence of the embedding vectors using a neural network. The ordered sequence being indicative of temporal sequence of the plurality of images.
G06V 20/59 - Contexte ou environnement de l’image à l’intérieur d’un véhicule, p. ex. concernant l’occupation des sièges, l’état du conducteur ou les conditions de l’éclairage intérieur
G06V 40/16 - Visages humains, p. ex. parties du visage, croquis ou expressions
G06V 10/62 - Extraction de caractéristiques d’images ou de vidéos relative à une dimension temporelle, p. ex. extraction de caractéristiques axées sur le tempsSuivi de modèle
G06V 10/82 - Dispositions pour la reconnaissance ou la compréhension d’images ou de vidéos utilisant la reconnaissance de formes ou l’apprentissage automatique utilisant les réseaux neuronaux
A system includes a processor and a non-transitory computer readable medium. The non-transitory computer readable medium includes programming instructions that when executed by the processor, cause the processor to operate a large language model, and generate a prompt chain using an emotional state information providing an emotional state of a driver. The prompt chain includes a sequence of parameterized prompts, where the sequence of parameterized prompts includes a first parameterized prompt that incorporates information gleaned from a response by the large language model to a second parameterized prompt that occurred prior to the first parameterized prompt. The programming instructions further cause the processor to output, by the large language model, a candidate action to be undertaken to regulate the emotional state of the driver based the prompt chain and a context information, and cause the candidate action to be performed in response to receiving approval from the driver.
The techniques described herein relate to systems and methods for generating a dialogue between at least a first and second machine learning (ML) enabled agent and presenting the dialogue to a user. An example method includes receiving, from the user, input indicative of a request to generate a dialogue about a topic, accessing reference data related to the topic, generating a first prompt, processing the first prompt with the first agent to generate first natural language text (NET) responsive to the first prompt and in accordance with the first idiolect, providing the first NET to both the second agent and to the user, generating, using the first NET, a second prompt, processing the second prompt with the second agent to generate second NET responsive to the second prompt and in accordance with the second idiolect, and providing the second NET to both the first agent and to the user.
A method for causing an enhanced announcement to sound in a vehicle, the method may include receiving an utterance to a neural network, the utterance including a phrase to be audibly played via a vehicle virtual assistant, transforming the utterance into a humanized utterance reflective of a human-like voice, and generating an enhanced announcement from the humanized utterance by applying at least one effect to at least a portion of the humanized utterance to decrease the human-like voice of at least that portion.
A method comprising causing a voice assistant and a recommendation engine that are executing in an infotainment system of a vehicle to cooperate in processing a vehicle occupant's acceptance of a recommendation proposed by the recommendation engine by having an interface to enable the recommendation engine to provide recommendation context to the voice assistant to enable the voice assistant to resolve an ambiguity in the occupant's acceptance of the recommendation.
An apparatus for interacting with an occupant in a vehicle includes an infotainment system that has been integrated into the vehicle, an automotive assistant that is configured to execute in the infotainment system, a speech interface that is configured to receive, from the occupant, an original utterance that is to be processed by the automotive assistant, and a classifier that determines that the original utterance has either anaphora or ellipsis. A model that provides a response to the occupant based on a prompt that includes the original utterance and that has been augmented by a function specification selected based at least in part on a rewritten utterance that has been derived from the original utterance.
A voice converter includes a first stage that receives an input signal and a second stage that provides an output signal. The input signal comprises content and first speaker-information. The output signal comprises the same content but with second speaker-information having supplanted the first speaker-information. The stages are trained independently of each other with the first having been trained using a training dataset that comprises utterances from other than the target speaker and the second having been trained using a tuning dataset that comprises utterances that consist of the second speaker-information.
G10L 13/033 - Édition de voix, p. ex. transformation de la voix du synthétiseur
G10L 25/30 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes caractérisées par la technique d’analyse utilisant des réseaux neuronaux
G10L 25/75 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes pour la modélisation des paramètres du conduit vocal
G10L 21/013 - Adaptation à la hauteur tonale ciblée
An apparatus for assessing affect state of an occupant of a vehicle includes a sensor that collects information indicative of said occupant's affect display, an enrollment database that comprises affect information derived from said occupant, and an infotainment system that executes an affect assessor.
B60W 40/08 - Calcul ou estimation des paramètres de fonctionnement pour les systèmes d'aide à la conduite de véhicules routiers qui ne sont pas liés à la commande d'un sous-ensemble particulier liés aux conducteurs ou aux passagers
B60W 50/14 - Moyens d'information du conducteur, pour l'avertir ou provoquer son intervention
B60W 50/16 - Signalisation tactile au conducteur, p. ex. vibration ou augmentation de la résistance sur le volant ou sur la pédale d'accélérateur
G06V 20/59 - Contexte ou environnement de l’image à l’intérieur d’un véhicule, p. ex. concernant l’occupation des sièges, l’état du conducteur ou les conditions de l’éclairage intérieur
14.
DETECTING ACCIDENTAL ACTIVATION OF SPEECH INTERFACE
A replay detector executing in a vehicle prevents a speech interface from acting on audio inputs received from a microphone when those audio inputs are from a first class of audio inputs and permits the speech interface to acting on audio inputs from a second class of audio inputs. Audio inputs from the first class result from technically- produced acoustic signals that have been produced in an environment of the microphone. Audio inputs from the second class result from acoustic signals that have been produced by at least one person in that environment.
An apparatus for reducing occurrences of out-of-domain utterances resulting from static intervals within an utterance that is made by a user within a vehicle includes an automotive assistant for receiving an utterance from a user. The automotive assistant includes a streaming-language processor to process the utterance in real time as it is being received from the user via a microphone in the vehicle. The utterance includes a growth interval, a static interval, and a transition from the growth interval to the static interval. A hesitation model is configured to interact with the streaming-language processor to determine whether a static interval represents either an intent to end the utterance or an intent to begin a new growth interval after the static interval. The streaming-language processor is configured to respond to detection of the latter intent by promoting resolution of the utterance based on information from the hesitation model and intent-prediction model.
Control over speaking style of a text-to-speech (TTS) system is provided without necessarily requiring that the training of the TTS conversion process (e.g., the ANN used for the conversion) take into account the speaking styles of the training data. For example, the TTS system may allow adjustment of characteristics of speaking styles, such as, speed, perceivable degree of “kindness”, average pitch, pitch variation, and duration of pauses. In some examples, a voice designer may have a number of independent controls that vary corresponding characteristics without necessarily varying others. Once the designer has configured a desired overall speaking style based on those controllable characteristics, the TTS system can be configured to use that speaking style for deployments of the TTS system. For example, the TTS system may be used for audio output in a voice assistant, for instance, for an in-vehicle voice assistant.
G10L 13/033 - Édition de voix, p. ex. transformation de la voix du synthétiseur
G06F 3/0484 - Techniques d’interaction fondées sur les interfaces utilisateur graphiques [GUI] pour la commande de fonctions ou d’opérations spécifiques, p. ex. sélection ou transformation d’un objet, d’une image ou d’un élément de texte affiché, détermination d’une valeur de paramètre ou sélection d’une plage de valeurs
G10L 13/08 - Analyse de texte ou génération de paramètres pour la synthèse de la parole à partir de texte, p. ex. conversion graphème-phonème, génération de prosodie ou détermination de l'intonation ou de l'accent tonique
A vehicle includes a cabin, an internal-loudspeaker set an external-microphone set, and a signal processor that filters a raw audio signal that has been received by the external-microphone set broadcasts the resulting filtered audio signal into the cabin using the internal-loudspeaker set.
G10K 11/178 - Procédés ou dispositifs de protection contre le bruit ou les autres ondes acoustiques ou pour amortir ceux-ci, en général utilisant des effets d'interférenceMasquage du son par régénération électro-acoustique en opposition de phase des ondes acoustiques originales
H04N 7/18 - Systèmes de télévision en circuit fermé [CCTV], c.-à-d. systèmes dans lesquels le signal vidéo n'est pas diffusé
H04N 23/62 - Commande des paramètres via des interfaces utilisateur
H04N 23/695 - Commande de la direction de la caméra pour modifier le champ de vision, p. ex. par un panoramique, une inclinaison ou en fonction du suivi des objets
An automotive assistant that executes in an infotainment system of a vehicle includes an arbitrator that is configured to receive the audio input provided by the occupant and, based at least in part on the audio input, to output a member-selection signal that selects a domain-specific member from a federation of domain-specific members. The automotive assistant is further configured to receive content from the selected domain-specific member for use in providing and to provide audio output to respond to the audio input provided by the occupant. This audio output is based at least in part on the content from the selected domain-specific member of the federation of domain-specific members.
An apparatus for providing information services to a user in a vehicle includes an automotive concierge, a prompter, and an interaction mode. The automotive concierge is hosted by an infotainment system in the vehicle and engages in an interaction with the user based on context. The prompter generates a prompt for a model in response to an instruction from the automotive concierge, the instruction being based on the context information and the prompt being selected to cause the model to generate content that invites a choice from the user. The interaction mode delivers the content to the user and provides information from the user to the automotive concierge.
In an infotainment system of a vehicle, a mass storage unit stores map data and context data. The map data comprises geographic information about a geographic area around the vehicle and the context data comprises location-dependent context information about the geographic area for use when engaging in speech interaction with an occupant of the vehicle. The context data includes subsets of context data, each of which is pertinent to a different geographic area. A memory manager copies these subsets of context data into a dynamic memory based on the vehicle's movement.
B60K 35/26 - Dispositions de sortie, c.-à-d. du véhicule à l'utilisateur, associées aux fonctions du véhicule ou spécialement adaptées à celles-ci utilisant une sortie acoustique
G01C 21/00 - NavigationInstruments de navigation non prévus dans les groupes
An apparatus for providing information services to a user in a vehicle includes an automotive concierge, a prompter, and an interaction mode. The automotive concierge is hosted by an infotainment system in the vehicle and engages in an interaction with the user based on context. The prompter generates a prompt for a model in response to an instruction from the automotive concierge, the instruction being based on the context information and the prompt being selected to cause the model to generate content that invites a choice from the user. The interaction mode delivers the content to the user and provides information from the user to the automotive concierge.
B60K 35/26 - Dispositions de sortie, c.-à-d. du véhicule à l'utilisateur, associées aux fonctions du véhicule ou spécialement adaptées à celles-ci utilisant une sortie acoustique
09 - Appareils et instruments scientifiques et électriques
42 - Services scientifiques, technologiques et industriels, recherche et conception
Produits et services
Downloadable computer software and downloadable computer
software platforms for use in and with mobile devices and
vehicles for enabling operation, control and performance of
vehicle systems; downloadable computer software and
downloadable computer software platforms for use in and with
mobile devices and vehicles for enabling operation and
control of mobile device and vehicle functions based on user
commands; downloadable artificial intelligence software for
enabling user interaction with vehicles; downloadable
computer software for understanding user preferences;
downloadable computer software for speech recognition and
natural language understanding; downloadable computer
software for gaze and gesture detection in or associated
with vehicles; downloadable computer software for
authentication and identification of individuals;
downloadable computer software for reading and translating
handwriting and converting text into speech; downloadable
computer software for speech signal enhancement;
downloadable computer software and downloadable computer
software platforms for connecting vehicles with one or more
computing devices; downloadable computer software for
connecting, operating, and managing networked vehicles
software for vehicle navigation; downloadable computer
software for vehicle operation, control and user interaction
with vehicles; downloadable computer software for use in the
operation and control of autonomous-driving vehicles. Providing temporary use of non-downloadable computer
software for operating voice recognition and voice-activated
personal assistance programs; providing temporary use of
non-downloadable computer software for enabling hands-free
operation of computing devices using voice activation and
voice recognition; software as a service (SaaS) services
featuring software using artificial intelligence technology
that enables users to use a voice activated virtual
assistant; software as a service (SaaS) services featuring
software using artificial intelligence technology, namely, a
digital assistant featuring speech recognition software;
software as a service (SaaS) services featuring software
applications for computer understanding, recognition, and
processing of natural language; software as a service (SaaS)
services featuring software applications for programming and
controlling communication with voice assistants,
drive-assistants, and smart assistants; software as a
service (SaaS) services featuring software applications for
recognizing, authenticating, and verifying the identity of a
speaker; software as a service (SaaS) services featuring
software applications for the deployment of conversational
Artificial Intelligence (AI) technology; software as a
service (SaaS) services featuring software applications for
use in and with mobile devices and vehicles for enabling
operation, control and performance of vehicle systems;
software as a service (SaaS) services featuring software
applications for use in and with mobile devices and vehicles
for enabling operation and control of mobile device and
vehicle functions based on user commands; software as a
service (SaaS) services featuring software applications
including artificial intelligence software for enabling user
interaction with vehicles; software as a service (SaaS)
services featuring software applications for understanding
user preferences; software as a service (SaaS) services
featuring software applications for speech recognition and
natural language understanding; software as a service (SaaS)
services featuring software applications for gaze and
gesture detection in or associated with vehicles; software
as a service (SaaS) services featuring software applications
for authentication and identification of individuals;
software as a service (SaaS) services featuring software
applications for reading and translating handwriting and
converting text into speech; software as a service (SaaS)
services featuring software applications for speech signal
enhancement; software as a service (SaaS) services featuring
software applications for connecting vehicles with one or
more computing devices; software as a service (SaaS)
services featuring software applications for connecting,
operating, and managing networked vehicles; software as a
service (SaaS) services featuring software applications for
vehicle navigation; software as a service (SaaS) services
featuring software applications for vehicle operation,
control and user interaction with vehicles; software as a
service (SaaS) services featuring software applications for
use in the operation and control of autonomous-driving
vehicles; software as a service (SaaS) services for
developing, deploying, maintaining, administering, managing,
training, validating, configuring, monitoring, using,
querying, and auditing artificial intelligence software
including machine learning applications consisting of large
language model applications; providing temporary use of
online non-downloadable software for developing, deploying,
maintaining, administering, managing, training, validating,
configuring, monitoring, using, querying, and auditing
artificial intelligence software in the nature of machine
learning applications consisting of large language model
applications; platform as a service (PaaS) featuring
computer software for enabling secure transmission of
digital information and data to and from artificial
intelligence software including machine learning models
consisting of large language models, for accelerating
training and development of large language models, for
administering large language models, for integrating with
other software systems to share data, and for improving the
quality of machine learning model in the nature of large
language model responses; platform as a service (PaaS)
featuring computer software for developing, deploying,
maintaining, managing, training, validating, configuring,
monitoring, querying, and auditing large language model
applications; software as a service (SAAS) services
featuring software for enterprise-grade, large-language
model hosting and fine-tuning on the cloud.
23.
AUTOMOTIVE INFOTAINMENT SYSTEM WITH SPATIALLY-COGNIZANT APPLICATIONS THAT INTERACT WITH A SPEECH INTERFACE
An automotive processing unit includes an infotainment system having a speech interface, an application suite comprising one or more spatially-cognizant applications, and an automotive assistant that is configured to execute one or more of the spatially-cognizant applications. The speech interface is configured to receive a navigation announcement from a navigator and a touring announcement from one of the spatially-cognizant applications and, in response, to cause a spoken announcement to be made audible in a vehicle's cabin through a loudspeaker. The spoken announcement comprising content from at least one of the touring announcement and the navigation announcement.
A system for interactive and iterative media generation may include loudspeakers configured to play back audio signals into an environment, the audio signals including karaoke content; at least one microphone configured to receive microphone signals indicative of sound in the environment; and a processor programmed to receive a first microphone signal from the at least one microphone, the first microphone signal including a first user sound and karaoke content, instruct the loudspeakers to play back the first microphone signal, receiving a second microphone signal from the at least one microphone, the second microphone signal including the first user sound of the first microphone signal and a second user sound, transmitting the second microphone signal, including the first and second microphone signals and the karaoke content, as an instance of iteratively-generated media content.
A vehicle system for classifying spoken utterance within a vehicle cabin as one of system-directed and non-system directed, the system may include at least one microphone configured to detect at least one audio signal from at least one occupant of a vehicle, and a processor programmed to receive the at least one audio signal including at least one acoustic utterance, determine a number of vehicle occupants based at least in part on the at least one signal, determine a probability that the utterance is system directed based at least in part one the utterance and the number of vehicle occupants, determine a classification threshold based at least in part on the number of vehicle occupants, compare the classification threshold to the probability to determine whether the at least one acoustic utterance is one of a system directed utterance and a non-system directed utterance.
B60R 16/037 - Circuits électriques ou circuits de fluides spécialement adaptés aux véhicules et non prévus ailleursAgencement des éléments des circuits électriques ou des circuits de fluides spécialement adapté aux véhicules et non prévu ailleurs électriques pour le confort des occupants
G10L 15/22 - Procédures utilisées pendant le processus de reconnaissance de la parole, p. ex. dialogue homme-machine
09 - Appareils et instruments scientifiques et électriques
42 - Services scientifiques, technologiques et industriels, recherche et conception
Produits et services
Downloadable computer software and downloadable computer
software platforms for use in and with mobile devices and
vehicles for enabling operation, control and performance of
vehicle systems; downloadable computer software and
downloadable computer software platforms for use in and with
mobile devices and vehicles for enabling operation and
control of mobile device and vehicle functions based on user
commands; downloadable artificial intelligence software for
enabling user interaction with vehicles; downloadable
computer software for understanding user preferences;
downloadable computer software for speech recognition and
natural language understanding; downloadable computer
software for gaze and gesture detection in or associated
with vehicles; downloadable computer software for
authentication and identification of individuals;
downloadable computer software for reading and translating
handwriting and converting text into speech; downloadable
computer software for speech signal enhancement;
downloadable computer software and downloadable computer
software platforms for connecting vehicles with one or more
computing devices; downloadable computer software for
connecting, operating, and managing networked vehicles
software for vehicle navigation; downloadable computer
software for vehicle operation, control and user interaction
with vehicles; downloadable computer software for use in the
operation and control of autonomous-driving vehicles. Providing temporary use of non-downloadable computer
software for operating voice recognition and voice-activated
personal assistance programs; providing temporary use of
non-downloadable computer software for enabling hands-free
operation of computing devices using voice activation and
voice recognition; software as a service (SaaS) services
featuring software using artificial intelligence technology
that enables users to use a voice activated virtual
assistant; software as a service (SaaS) services featuring
software using artificial intelligence technology, namely, a
digital assistant featuring speech recognition software;
software as a service (SaaS) services featuring software
applications for computer understanding, recognition, and
processing of natural language; software as a service (SaaS)
services featuring software applications for programming and
controlling communication with voice assistants,
drive-assistants, and smart assistants; software as a
service (SaaS) services featuring software applications for
recognizing, authenticating, and verifying the identity of a
speaker; software as a service (SaaS) services featuring
software applications for the deployment of conversational
artificial intelligence (AI) technology; software as a
service (SaaS) services featuring software applications for
use in and with mobile devices and vehicles for enabling
operation, control and performance of vehicle systems;
software as a service (SaaS) services featuring software
applications for use in and with mobile devices and vehicles
for enabling operation and control of mobile device and
vehicle functions based on user commands; software as a
service (SaaS) services featuring software applications
including artificial intelligence software for enabling user
interaction with vehicles; software as a service (SaaS)
services featuring software applications for understanding
user preferences; software as a service (SaaS) services
featuring software applications for speech recognition and
natural language understanding; software as a service (SaaS)
services featuring software applications for gaze and
gesture detection in or associated with vehicles; software
as a service (SaaS) services featuring software applications
for authentication and identification of individuals;
software as a service (SaaS) services featuring software
applications for reading and translating handwriting and
converting text into speech; software as a service (SaaS)
services featuring software applications for speech signal
enhancement; software as a service (SaaS) services featuring
software applications for connecting vehicles with one or
more computing devices; software as a service (SaaS)
services featuring software applications for connecting,
operating, and managing networked vehicles; software as a
service (SaaS) services featuring software applications for
vehicle navigation; software as a service (SaaS) services
featuring software applications for vehicle operation,
control and user interaction with vehicles; software as a
service (SaaS) services featuring software applications for
use in the operation and control of autonomous-driving
vehicles; software as a service (SaaS) services for
developing, deploying, maintaining, administering, managing,
training, validating, configuring, monitoring, using,
querying, and auditing artificial intelligence software
including machine learning applications consisting of large
language model applications; providing temporary use of
online non-downloadable software for developing, deploying,
maintaining, administering, managing, training, validating,
configuring, monitoring, using, querying, and auditing
artificial intelligence software in the nature of machine
learning applications consisting of large language model
applications; platform as a service (PaaS) featuring
computer software for enabling secure transmission of
digital information and data to and from artificial
intelligence software including machine learning models
consisting of large language models, for accelerating
training and development of large language models, for
administering large language models, for integrating with
other software systems to share data, and for improving the
quality of machine learning model in the nature of large
language model responses; platform as a service (PaaS)
featuring computer software for developing, deploying,
maintaining, managing, training, validating, configuring,
monitoring, querying, and auditing large language model
applications; software as a service (SAAS) services
featuring software for enterprise-grade, large-language
model hosting and fine-tuning on the cloud.
27.
EMERGENT DRIVING ASSISTANT BASED ON EMOTION AND GAZE
An emergent driving assistant for a vehicle includes a control circuitry that receives a constraint signal, a feature signal, and a motion signal. The feature signal is in a first or second state. The first state indicates that the driver exhibits a non-neutral emotional state or a non-neutral gaze state. The second state indicates that the driver exhibits neutral emotional and gaze states. The constraint signal indicates a constraint on motion of the vehicle on a section of a road. The motion signal indicates the vehicle's motion. The control circuitry determines, based on the constraint and motion signals, whether the vehicle is violating the constraint. It then relieves the driver from complete control over a vehicle subsystem when the feature signal is in the first state and when the vehicle is violating the constraint.
A61B 5/18 - Dispositifs pour l'exécution des tests de capacité pour conducteurs de véhicules
G06V 20/58 - Reconnaissance d’objets en mouvement ou d’obstacles, p. ex. véhicules ou piétonsReconnaissance des objets de la circulation, p. ex. signalisation routière, feux de signalisation ou routes
G06V 20/59 - Contexte ou environnement de l’image à l’intérieur d’un véhicule, p. ex. concernant l’occupation des sièges, l’état du conducteur ou les conditions de l’éclairage intérieur
09 - Appareils et instruments scientifiques et électriques
42 - Services scientifiques, technologiques et industriels, recherche et conception
Produits et services
(1) Downloadable computer software and downloadable computer software platforms for use in and with mobile devices and vehicles for enabling operation, control and performance of vehicle systems; downloadable computer software and downloadable computer software platforms for use in and with mobile devices and vehicles for enabling operation and control of mobile device and vehicle functions based on user commands; downloadable artificial intelligence software for enabling user interaction with vehicles; downloadable computer software for understanding user preferences; downloadable computer software for speech recognition and natural language understanding; downloadable computer software for gaze and gesture detection in or associated with vehicles; downloadable computer software for authentication and identification of individuals; downloadable computer software for reading and translating handwriting and converting text into speech; downloadable computer software for speech signal enhancement; downloadable computer software and downloadable computer software platforms for connecting vehicles with one or more computing devices; downloadable computer software for connecting, operating, and managing networked vehicles software for vehicle navigation; downloadable computer software for vehicle operation, control and user interaction with vehicles; downloadable computer software for use in the operation and control of autonomous-driving vehicles. (1) Providing temporary use of non-downloadable computer software for operating voice recognition and voice-activated personal assistance programs; providing temporary use of non-downloadable computer software for enabling hands-free operation of computing devices using voice activation and voice recognition; software as a service (SaaS) services featuring software using artificial intelligence technology that enables users to use a voice activated virtual assistant; software as a service (SaaS) services featuring software using artificial intelligence technology, namely, a digital assistant featuring speech recognition software; software as a service (SaaS) services featuring software applications for computer understanding, recognition, and processing of natural language; software as a service (SaaS) services featuring software applications for programming and controlling communication with voice assistants, drive-assistants, and smart assistants; software as a service (SaaS) services featuring software applications for recognizing, authenticating, and verifying the identity of a speaker; software as a service (SaaS) services featuring software applications for the deployment of conversational Artificial Intelligence (AI) technology; software as a service (SaaS) services featuring software applications for use in and with mobile devices and vehicles for enabling operation, control and performance of vehicle systems; software as a service (SaaS) services featuring software applications for use in and with mobile devices and vehicles for enabling operation and control of mobile device and vehicle functions based on user commands; software as a service (SaaS) services featuring software applications including artificial intelligence software for enabling user interaction with vehicles; software as a service (SaaS) services featuring software applications for understanding user preferences; software as a service (SaaS) services featuring software applications for speech recognition and natural language understanding; software as a service (SaaS) services featuring software applications for gaze and gesture detection in or associated with vehicles; software as a service (SaaS) services featuring software applications for authentication and identification of individuals; software as a service (SaaS) services featuring software applications for reading and translating handwriting and converting text into speech; software as a service (SaaS) services featuring software applications for speech signal enhancement; software as a service (SaaS) services featuring software applications for connecting vehicles with one or more computing devices; software as a service (SaaS) services featuring software applications for connecting, operating, and managing networked vehicles; software as a service (SaaS) services featuring software applications for vehicle navigation; software as a service (SaaS) services featuring software applications for vehicle operation, control and user interaction with vehicles; software as a service (SaaS) services featuring software applications for use in the operation and control of autonomous-driving vehicles; software as a service (SaaS) services for developing, deploying, maintaining, administering, managing, training, validating, configuring, monitoring, using, querying, and auditing artificial intelligence software including machine learning applications consisting of large language model applications; providing temporary use of online non-downloadable software for developing, deploying, maintaining, administering, managing, training, validating, configuring, monitoring, using, querying, and auditing artificial intelligence software in the nature of machine learning applications consisting of large language model applications; platform as a service (PaaS) featuring computer software for enabling secure transmission of digital information and data to and from artificial intelligence software including machine learning models consisting of large language models, for accelerating training and development of large language models, for administering large language models, for integrating with other software systems to share data, and for improving the quality of machine learning model in the nature of large language model responses; platform as a service (PaaS) featuring computer software for developing, deploying, maintaining, managing, training, validating, configuring, monitoring, querying, and auditing large language model applications; software as a service (SAAS) services featuring software for enterprise-grade, large-language model hosting and fine-tuning on the cloud.
29.
ARTIFICIALLY INTELLIGENT COMPANION FOR ENGAGING WITH A PERSON IN A MOTOR VEHICLE
An artificially intelligent companion that engages in a conversation with a person in a vehicle using a chatbot and a prompter that provides a prompt to the chatbot. A context source that provides context to the chatbot.
G10L 15/183 - Classement ou recherche de la parole utilisant une modélisation du langage naturel selon les contextes, p. ex. modèles de langage
G06V 20/59 - Contexte ou environnement de l’image à l’intérieur d’un véhicule, p. ex. concernant l’occupation des sièges, l’état du conducteur ou les conditions de l’éclairage intérieur
G10L 15/18 - Classement ou recherche de la parole utilisant une modélisation du langage naturel
G10L 15/22 - Procédures utilisées pendant le processus de reconnaissance de la parole, p. ex. dialogue homme-machine
G10L 25/63 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier pour comparaison ou différentiation pour estimer un état émotionnel
30.
Multimodal sensor fusion for building virtual gamepad for in-car games
A vehicle gaming system for a vehicle includes one or more computing devices configured to obtain body gesture data indicative of one or more recognized poses of a user of a video game; obtain speech data indicative of one or more recognized words spoken by the user in a passenger cabin of the vehicle; generate a multimodal gaming input by combining the body gesture data and the speech data; identify one or more game commands for the video game being played from a game command datastore based on the multimodal gaming input; and transmit the one or more game commands to a remote gaming server to obtain gaming codes associated with the one or more game commands to cause the video game to perform an action associated with the one or more game commands.
A63F 13/213 - Dispositions d'entrée pour les dispositifs de jeu vidéo caractérisées par leurs capteurs, leurs finalités ou leurs types comprenant des moyens de photo-détection, p. ex. des caméras, des photodiodes ou des cellules infrarouges
A63F 13/215 - Dispositions d'entrée pour les dispositifs de jeu vidéo caractérisées par leurs capteurs, leurs finalités ou leurs types comprenant des moyens de détection des signaux acoustiques, p. ex. utilisant un microphone
A63F 13/803 - Conduite de véhicules ou de moyens de transport, p. ex. voitures, avions, bateaux, robots ou tanks
09 - Appareils et instruments scientifiques et électriques
42 - Services scientifiques, technologiques et industriels, recherche et conception
Produits et services
Downloadable computer software and downloadable computer software platforms for use in and with mobile devices and vehicles for enabling operation, control and performance of vehicle systems; downloadable computer software and downloadable computer software platforms for use in and with mobile devices and vehicles for enabling operation and control of mobile device and vehicle functions based on user commands; downloadable artificial intelligence software for enabling user interaction with vehicles; downloadable computer software for understanding user preferences; downloadable computer software for speech recognition and natural language understanding; downloadable computer software for gaze and gesture detection in or associated with vehicles; downloadable computer software for authentication and identification of individuals; downloadable computer software for reading and translating handwriting and converting text into speech; downloadable computer software for speech signal enhancement; downloadable computer software and downloadable computer software platforms for connecting vehicles with one or more computing devices; downloadable computer software for connecting, operating, and managing networked vehicles software for vehicle navigation; downloadable computer software for vehicle operation, control and user interaction with vehicles; downloadable computer software for use in the operation and control of autonomous-driving vehicles Providing temporary use of non-downloadable computer software for operating voice recognition and voice-activated personal assistance programs; providing temporary use of non-downloadable computer software for enabling hands-free operation of computing devices using voice activation and voice recognition; software as a service (SaaS) services featuring software using artificial intelligence technology that enables users to use a voice activated virtual assistant; software as a service (SaaS) services featuring software using artificial intelligence technology, namely, a digital assistant featuring speech recognition software; software as a service (SaaS) services featuring software applications for computer understanding, recognition, and processing of natural language; software as a service (SaaS) services featuring software applications for programming and controlling communication with voice assistants, drive-assistants, and smart assistants; software as a service (SaaS) services featuring software applications for recognizing, authenticating, and verifying the identity of a speaker; software as a service (SaaS) services featuring software applications for the deployment of conversational Artificial Intelligence (AI) technology; Software as a service (SaaS) services featuring software applications for use in and with mobile devices and vehicles for enabling operation, control and performance of vehicle systems; software as a service (SaaS) services featuring software applications for use in and with mobile devices and vehicles for enabling operation and control of mobile device and vehicle functions based on user commands; software as a service (SaaS) services featuring software applications including artificial intelligence software for enabling user interaction with vehicles; software as a service (SaaS) services featuring software applications for understanding user preferences; software as a service (SaaS) services featuring software applications for speech recognition and natural language understanding; software as a service (SaaS) services featuring software applications for gaze and gesture detection in or associated with vehicles; software as a service (SaaS) services featuring software applications for authentication and identification of individuals; software as a service (SaaS) services featuring software applications for reading and translating handwriting and converting text into speech; software as a service (SaaS) services featuring software applications for speech signal enhancement; software as a service (SaaS) services featuring software applications for connecting vehicles with one or more computing devices; software as a service (SaaS) services featuring software applications for connecting, operating, and managing networked vehicles; software as a service (SaaS) services featuring software applications for vehicle navigation; software as a service (SaaS) services featuring software applications for vehicle operation, control and user interaction with vehicles; software as a service (SaaS) services featuring software applications for use in the operation and control of autonomous-driving vehicles; Software as a service (SaaS) services for developing, deploying, maintaining, administering, managing, training, validating, configuring, monitoring, using, querying, and auditing artificial intelligence software including machine learning applications consisting of large language model applications; providing temporary use of online non-downloadable software for developing, deploying, maintaining, administering, managing, training, validating, configuring, monitoring, using, querying, and auditing artificial intelligence software in the nature of machine learning applications consisting of large language model applications; Platform as a service (PaaS) featuring computer software for enabling secure transmission of digital information and data to and from artificial intelligence software including machine learning models consisting of large language models, for accelerating training and development of large language models, for administering large language models, for integrating with other software systems to share data, and for improving the quality of machine learning model in the nature of large language model responses; Platform as a service (PaaS) featuring computer software for developing, deploying, maintaining, managing, training, validating, configuring, monitoring, querying, and auditing large language model applications; Software as a service (SAAS) services featuring software for enterprise-grade, large-language model hosting and fine-tuning on the cloud
32.
Adaptation and training of neural speech synthesis
Disclosed are systems, methods and other implementations for speech generation, including a method that includes obtaining a speech sample for a target speaker, processing, using a trained encoder, the speech sample to produce a parametric representation of the speech sample for the target speaker, receiving configuration data for a speech synthesis system that accepts as an input the parametric representation, and adapting the configuration data for the speech synthesis system according to an input comprising the parametric representation, and a time-domain representation for the speech sample, to generate adapted configuration data for the speech synthesis system. The method further includes causing configuration of the speech synthesis system according to the adapted configuration data, with the speech synthesis system being implemented to generate synthesized speech output data with estimated voice and time-domain speech characteristics approximating actual voice and time-domain speech characteristics for the target speaker.
G10L 25/18 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes caractérisées par le type de paramètres extraits les paramètres extraits étant l’information spectrale de chaque sous-bande
G10L 25/60 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier pour comparaison ou différentiation pour mesurer la qualité des signaux de voix
33.
Selective Disregard of Speech by an Automotive Assistant
A method comprising causing an automotive assistant in a vehicle to disregard an utterance made by an occupant of that vehicle based on a sightline of the occupant.
09 - Appareils et instruments scientifiques et électriques
42 - Services scientifiques, technologiques et industriels, recherche et conception
Produits et services
(1) Downloadable computer software and downloadable computer software platforms for use in and with mobile devices and vehicles for enabling operation, control and performance of vehicle systems; downloadable computer software and downloadable computer software platforms for use in and with mobile devices and vehicles for enabling operation and control of mobile device and vehicle functions based on user commands; downloadable artificial intelligence software for enabling user interaction with vehicles; downloadable computer software for understanding user preferences; downloadable computer software for speech recognition and natural language understanding; downloadable computer software for gaze and gesture detection in or associated with vehicles; downloadable computer software for authentication and identification of individuals; downloadable computer software for reading and translating handwriting and converting text into speech; downloadable computer software for speech signal enhancement; downloadable computer software and downloadable computer software platforms for connecting vehicles with one or more computing devices; downloadable computer software for connecting, operating, and managing networked vehicles software for vehicle navigation; downloadable computer software for vehicle operation, control and user interaction with vehicles; downloadable computer software for use in the operation and control of autonomous-driving vehicles. (1) Providing temporary use of non-downloadable computer software for operating voice recognition and voice-activated personal assistance programs; providing temporary use of non-downloadable computer software for enabling hands-free operation of computing devices using voice activation and voice recognition; software as a service (SaaS) services featuring software using artificial intelligence technology that enables users to use a voice activated virtual assistant; software as a service (SaaS) services featuring software using artificial intelligence technology, namely, a digital assistant featuring speech recognition software; software as a service (SaaS) services featuring software applications for computer understanding, recognition, and processing of natural language; software as a service (SaaS) services featuring software applications for programming and controlling communication with voice assistants, drive-assistants, and smart assistants; software as a service (SaaS) services featuring software applications for recognizing, authenticating, and verifying the identity of a speaker; software as a service (SaaS) services featuring software applications for the deployment of conversational artificial intelligence (AI) technology; software as a service (SaaS) services featuring software applications for use in and with mobile devices and vehicles for enabling operation, control and performance of vehicle systems; software as a service (SaaS) services featuring software applications for use in and with mobile devices and vehicles for enabling operation and control of mobile device and vehicle functions based on user commands; software as a service (SaaS) services featuring software applications including artificial intelligence software for enabling user interaction with vehicles; software as a service (SaaS) services featuring software applications for understanding user preferences; software as a service (SaaS) services featuring software applications for speech recognition and natural language understanding; software as a service (SaaS) services featuring software applications for gaze and gesture detection in or associated with vehicles; software as a service (SaaS) services featuring software applications for authentication and identification of individuals; software as a service (SaaS) services featuring software applications for reading and translating handwriting and converting text into speech; software as a service (SaaS) services featuring software applications for speech signal enhancement; software as a service (SaaS) services featuring software applications for connecting vehicles with one or more computing devices; software as a service (SaaS) services featuring software applications for connecting, operating, and managing networked vehicles; software as a service (SaaS) services featuring software applications for vehicle navigation; software as a service (SaaS) services featuring software applications for vehicle operation, control and user interaction with vehicles; software as a service (SaaS) services featuring software applications for use in the operation and control of autonomous-driving vehicles; software as a service (SaaS) services for developing, deploying, maintaining, administering, managing, training, validating, configuring, monitoring, using, querying, and auditing artificial intelligence software including machine learning applications consisting of large language model applications; providing temporary use of online non-downloadable software for developing, deploying, maintaining, administering, managing, training, validating, configuring, monitoring, using, querying, and auditing artificial intelligence software in the nature of machine learning applications consisting of large language model applications; platform as a service (PaaS) featuring computer software for enabling secure transmission of digital information and data to and from artificial intelligence software including machine learning models consisting of large language models, for accelerating training and development of large language models, for administering large language models, for integrating with other software systems to share data, and for improving the quality of machine learning model in the nature of large language model responses; platform as a service (PaaS) featuring computer software for developing, deploying, maintaining, managing, training, validating, configuring, monitoring, querying, and auditing large language model applications; software as a service (SAAS) services featuring software for enterprise-grade, large-language model hosting and fine-tuning on the cloud.
09 - Appareils et instruments scientifiques et électriques
42 - Services scientifiques, technologiques et industriels, recherche et conception
Produits et services
Downloadable computer software and downloadable computer software platforms for use in and with mobile devices and vehicles for enabling operation, control and performance of vehicle systems; downloadable computer software and downloadable computer software platforms for use in and with mobile devices and vehicles for enabling operation and control of mobile device and vehicle functions based on user commands; downloadable artificial intelligence software for enabling user interaction with vehicles; downloadable computer software for understanding user preferences; downloadable computer software for speech recognition and natural language understanding; downloadable computer software for gaze and gesture detection in or associated with vehicles; downloadable computer software for authentication and identification of individuals; downloadable computer software for reading and translating handwriting and converting text into speech; downloadable computer software for speech signal enhancement; downloadable computer software and downloadable computer software platforms for connecting vehicles with one or more computing devices; downloadable computer software for connecting, operating, and managing networked vehicles software for vehicle navigation; downloadable computer software for vehicle operation, control and user interaction with vehicles; downloadable computer software for use in the operation and control of autonomous-driving vehicles Providing temporary use of non-downloadable computer software for operating voice recognition and voice-activated personal assistance programs; providing temporary use of non-downloadable computer software for enabling hands-free operation of computing devices using voice activation and voice recognition; software as a service (SaaS) services featuring software using artificial intelligence technology that enables users to use a voice activated virtual assistant; software as a service (SaaS) services featuring software using artificial intelligence technology, namely, a digital assistant featuring speech recognition software; software as a service (SaaS) services featuring software applications for computer understanding, recognition, and processing of natural language; software as a service (SaaS) services featuring software applications for programming and controlling communication with voice assistants, drive-assistants, and smart assistants; software as a service (SaaS) services featuring software applications for recognizing, authenticating, and verifying the identity of a speaker; software as a service (SaaS) services featuring software applications for the deployment of conversational Artificial Intelligence (AI) technology; Software as a service (SaaS) services featuring software applications for use in and with mobile devices and vehicles for enabling operation, control and performance of vehicle systems; software as a service (SaaS) services featuring software applications for use in and with mobile devices and vehicles for enabling operation and control of mobile device and vehicle functions based on user commands; software as a service (SaaS) services featuring software applications including artificial intelligence software for enabling user interaction with vehicles; software as a service (SaaS) services featuring software applications for understanding user preferences; software as a service (SaaS) services featuring software applications for speech recognition and natural language understanding; software as a service (SaaS) services featuring software applications for gaze and gesture detection in or associated with vehicles; software as a service (SaaS) services featuring software applications for authentication and identification of individuals; software as a service (SaaS) services featuring software applications for reading and translating handwriting and converting text into speech; software as a service (SaaS) services featuring software applications for speech signal enhancement; software as a service (SaaS) services featuring software applications for connecting vehicles with one or more computing devices; software as a service (SaaS) services featuring software applications for connecting, operating, and managing networked vehicles; software as a service (SaaS) services featuring software applications for vehicle navigation; software as a service (SaaS) services featuring software applications for vehicle operation, control and user interaction with vehicles; software as a service (SaaS) services featuring software applications for use in the operation and control of autonomous-driving vehicles; Software as a service (SaaS) services for developing, deploying, maintaining, administering, managing, training, validating, configuring, monitoring, using, querying, and auditing artificial intelligence software including machine learning applications consisting of large language model applications; providing temporary use of online non-downloadable software for developing, deploying, maintaining, administering, managing, training, validating, configuring, monitoring, using, querying, and auditing artificial intelligence software in the nature of machine learning applications consisting of large language model applications; Platform as a service (PaaS) featuring computer software for enabling secure transmission of digital information and data to and from artificial intelligence software including machine learning models consisting of large language models, for accelerating training and development of large language models, for administering large language models, for integrating with other software systems to share data, and for improving the quality of machine learning model in the nature of large language model responses; Platform as a service (PaaS) featuring computer software for developing, deploying, maintaining, managing, training, validating, configuring, monitoring, querying, and auditing large language model applications; Software as a service (SAAS) services featuring software for enterprise-grade, large-language model hosting and fine-tuning on the cloud
09 - Appareils et instruments scientifiques et électriques
42 - Services scientifiques, technologiques et industriels, recherche et conception
Produits et services
Downloadable computer software and downloadable computer software platforms for use in and with mobile devices and vehicles for enabling operation, control and performance of vehicle systems; downloadable computer software and downloadable computer software platforms for use in and with mobile devices and vehicles for enabling operation and control of mobile device and vehicle functions based on user commands; downloadable artificial intelligence software for enabling user interaction with vehicles; downloadable computer software for understanding user preferences; downloadable computer software for speech recognition and natural language understanding; downloadable computer software for gaze and gesture detection in or associated with vehicles; downloadable computer software for authentication and identification of individuals; downloadable computer software for reading and translating handwriting and converting text into speech; downloadable computer software for speech signal enhancement; downloadable computer software and downloadable computer software platforms for connecting vehicles with one or more computing devices; downloadable computer software for connecting, operating, and managing networked vehicles software for vehicle navigation; downloadable computer software for vehicle operation, control and user interaction with vehicles; downloadable computer software for use in the operation and control of autonomous-driving vehicles Providing temporary use of non-downloadable computer software for operating voice recognition and voice-activated personal assistance programs; providing temporary use of non-downloadable computer software for enabling hands-free operation of computing devices using voice activation and voice recognition; software as a service (SaaS) services featuring software using artificial intelligence technology that enables users to use a voice activated virtual assistant; software as a service (SaaS) services featuring software using artificial intelligence technology, namely, a digital assistant featuring speech recognition software; software as a service (SaaS) services featuring software applications for computer understanding, recognition, and processing of natural language; software as a service (SaaS) services featuring software applications for programming and controlling communication with voice assistants, drive-assistants, and smart assistants; software as a service (SaaS) services featuring software applications for recognizing, authenticating, and verifying the identity of a speaker; software as a service (SaaS) services featuring software applications for the deployment of conversational Artificial Intelligence (AI) technology; Software as a service (SaaS) services featuring software applications for use in and with mobile devices and vehicles for enabling operation, control and performance of vehicle systems; software as a service (SaaS) services featuring software applications for use in and with mobile devices and vehicles for enabling operation and control of mobile device and vehicle functions based on user commands; software as a service (SaaS) services featuring software applications including artificial intelligence software for enabling user interaction with vehicles; software as a service (SaaS) services featuring software applications for understanding user preferences; software as a service (SaaS) services featuring software applications for speech recognition and natural language understanding; software as a service (SaaS) services featuring software applications for gaze and gesture detection in or associated with vehicles; software as a service (SaaS) services featuring software applications for authentication and identification of individuals; software as a service (SaaS) services featuring software applications for reading and translating handwriting and converting text into speech; software as a service (SaaS) services featuring software applications for speech signal enhancement; software as a service (SaaS) services featuring software applications for connecting vehicles with one or more computing devices; software as a service (SaaS) services featuring software applications for connecting, operating, and managing networked vehicles; software as a service (SaaS) services featuring software applications for vehicle navigation; software as a service (SaaS) services featuring software applications for vehicle operation, control and user interaction with vehicles; software as a service (SaaS) services featuring software applications for use in the operation and control of autonomous-driving vehicles; Software as a service (SaaS) services for developing, deploying, maintaining, administering, managing, training, validating, configuring, monitoring, using, querying, and auditing artificial intelligence software including machine learning applications consisting of large language model applications; providing temporary use of online non-downloadable software for developing, deploying, maintaining, administering, managing, training, validating, configuring, monitoring, using, querying, and auditing artificial intelligence software in the nature of machine learning applications consisting of large language model applications; Platform as a service (PaaS) featuring computer software for enabling secure transmission of digital information and data to and from artificial intelligence software including machine learning models consisting of large language models, for accelerating training and development of large language models, for administering large language models, for integrating with other software systems to share data, and for improving the quality of machine learning model in the nature of large language model responses; Platform as a service (PaaS) featuring computer software for developing, deploying, maintaining, managing, training, validating, configuring, monitoring, querying, and auditing large language model applications; Software as a service (SAAS) services featuring software for enterprise-grade, large-language model hosting and fine-tuning on the cloud
A method for providing a driver with a warning includes determining that a vehicle is being operated in a manner that fails to comply with a constraint imposed on motion of vehicles on a section of a road, determining that a driver of the vehicle is gazing in a non-neutral direction, based on having done so, selecting a level of obtrusiveness for the warning message, and outputting the warning message at that level.
A method for applying a watermark signal to a speech signal to prevent unauthorized use of speech signals, the method may include receiving an original speech signal; determining a corresponding spectrogram of the original speech signal; selecting a phase sequence of fixed frame length and uniform distribution; and generating an encoded watermark signal based on the corresponding spectrogram and phase sequence.
A method for providing a driver with a warning includes determining that a vehicle is being operated in a manner that fails to comply with a constraint imposed on motion of vehicles on a section of a road, determining that a driver of the vehicle exhibits a non-neutral sentiment, based on having determined that the driver exhibits a non-neutral sentiment, selecting a level of obtrusiveness for the warning message, and outputting the warning message at that level.
B60W 50/14 - Moyens d'information du conducteur, pour l'avertir ou provoquer son intervention
A61B 5/18 - Dispositifs pour l'exécution des tests de capacité pour conducteurs de véhicules
B60W 40/08 - Calcul ou estimation des paramètres de fonctionnement pour les systèmes d'aide à la conduite de véhicules routiers qui ne sont pas liés à la commande d'un sous-ensemble particulier liés aux conducteurs ou aux passagers
G06V 20/59 - Contexte ou environnement de l’image à l’intérieur d’un véhicule, p. ex. concernant l’occupation des sièges, l’état du conducteur ou les conditions de l’éclairage intérieur
G10L 25/63 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier pour comparaison ou différentiation pour estimer un état émotionnel
A method includes customizing interaction of a person with a vehicle from a fleet of vehicles by receiving, from an access controller in the vehicle, biometric data, the biometric data having been acquired from the person; using the biometric data, retrieving a profile for the person; and providing the profile to the vehicle. This enables the vehicle to transition into, or be reconfigured into, a state that enables the person to interact with the vehicle in a manner based on the profile.
42 - Services scientifiques, technologiques et industriels, recherche et conception
Produits et services
Providing temporary use of non-downloadable computer
software for operating voice recognition and voice-activated
personal assistance programs; providing temporary use of
non-downloadable computer software for enabling hands-free
operation of computing devices using voice activation and
voice recognition; software as a service (SaaS) services
featuring artificial intelligence technology that enables
users to use a voice activated virtual assistant; software
as a service (SaaS) services featuring artificial
intelligence technology, namely, a digital assistant
featuring speech recognition software; software as a service
(SaaS) services featuring software applications for computer
understanding, recognition, and processing of natural
language; software as a service (SaaS) services featuring
software applications for programming and controlling
communication with voice assistants, drive-assistants, and
smart assistants; software as a service (SaaS) services
featuring software applications for recognizing,
authenticating, and verifying the identity of a speaker;
software as a service (SaaS) services featuring software
applications for the deployment of conversational Artificial
Intelligence (AI) technology; Software as a service (SaaS)
services featuring software applications for use in and with
mobile devices and vehicles for enabling operation, control
and performance of vehicle systems; software as a service
(SaaS) services featuring software applications for use in
and with mobile devices and vehicles for enabling operation
and control of mobile device and vehicle functions based on
user commands; software as a service (SaaS) services
featuring software applications including artificial
intelligence software for enabling user interaction with
vehicles; software as a service (SaaS) services featuring
software applications for understanding user preferences;
software as a service (SaaS) services featuring software
applications for speech recognition and natural language
understanding; software as a service (SaaS) services
featuring software applications for gaze and gesture
detection in or associated with vehicles; software as a
service (SaaS) services featuring software applications for
authentication and identification of individuals; software
as a service (SaaS) services featuring software applications
for reading and translating handwriting and converting text
into speech; software as a service (SaaS) services featuring
software applications for speech signal enhancement;
software as a service (SaaS) services featuring software
applications for connecting vehicles with one or more
computing devices; software as a service (SaaS) services
featuring software applications for connecting, operating,
and managing networked vehicles; software as a service
(SaaS) services featuring software applications for vehicle
navigation; software as a service (SaaS) services featuring
software applications for vehicle operation, control and
user interaction with vehicles; software as a service (SaaS)
services featuring software applications for use in the
operation and control of autonomous-driving vehicles;
software as a service (SaaS) services for developing,
deploying, maintaining, administering, managing, training,
validating, configuring, monitoring, using, querying, and
auditing artificial intelligence software including machine
learning applications consisting of large language model
applications; providing temporary use of online
non-downloadable software for developing, deploying,
maintaining, administering, managing, training, validating,
configuring, monitoring, using, querying, and auditing
artificial intelligence software in the nature of machine
learning applications consisting of large language model
applications; platform as a service (PaaS) featuring
computer software for enabling secure transmission of
digital information and data to and from artificial
intelligence software including machine learning models
consisting of large language models, for accelerating
training and development of large language models, for
administering large language models, for integrating with
other software systems to share data, and for improving the
quality of machine learning model in the nature of large
language model responses; platform as a service (PaaS)
featuring computer software for developing, deploying,
maintaining, managing, training, validating, configuring,
monitoring, querying, and auditing large language model
applications; software as a service (SAAS) services
featuring software for enterprise-grade, large-language
model hosting and fine-tuning on the cloud.
A method for providing communication between an intravehicular conferee who is in a vehicle and first and second extravehicular conferees who are outside the vehicle includes causing speech by the first extravehicular conferee to originate from a first zone in the vehicle and causing speech by the second extravehicular conferee to originate from a second zone in the vehicle. The first and second zones are volumes of space in a cabin of the vehicle.
A method includes providing interaction packages for consumption by an application that engages in speech interaction with a human client in an environment. The interaction packages include a speaker event and a scene event, both of which have been tagged with timing information. The method includes continuously listening to the environment to obtain a stream of audio data, partitioning it into audio segments and using those audio segments to obtain the scene events and the speaker events for the interaction packages.
A tunable zone detection approach makes use of multiple microphones in a fixed configuration in an environment. There are multiple zones in the environment and one or more predetermined positions in each zone. Predetermined transfer functions between the positions and the microphones are used to determine beamformed energies for each of the positions based on received microphone signals. These beamformed energies may be computed using normalization of correlations between microphones. The beamformed energies are processed using a tunable transformation to determine whether an acoustic source is in a particular zone, thereby enabling adjustment of the detection approach to situations including variation in acoustics of the environment.
A method includes receiving a representation of a spoken utterance, processing the representation of the spoken utterance to identify, from a number of candidate domains, a request and a serving domain, and routing the request to a personal assistant based on the request and the serving domain. Identification of the serving domain is based on one or more of a contextual state, a behavior profile of a speaker of the utterance, and a semantic content of the utterance.
Each vehicle in a fleet has an automotive assistant and an external speech interface. A person who is authorized to interact with an automotive assistant of any vehicle from that fleet is detected as being outside a vehicle from that fleet. After having determined that the person is indeed authorized, an automotive assistant in that vehicle communicates with that person. It does so either by receiving an utterance by that person or by transmitting an utterance to that person.
A biometric authenticator for use by an application executing on a vehicle's infotainment system carries out a biometric authentication procedure that is tailored to dynamically varying context information that is obtained by vehicle sensors.
A method that includes receiving information indicative of a location of a vehicle. The vehicle has an occupied-vehicle state that includes an occupant state. This occupant state represents the state of one or more occupants within the vehicle. The method further includes receiving information indicative of this occupant state. The information indicative of the occupant state results from an observation by a detector that is in communication with an infotainment system within the vehicle. The method continues with using both the information indicative of the occupant state and the information indicative of the location to select an advertisement from a database of advertisements. This selected advertisement is one that is ultimately for presentation to the occupant.
A contextual answering system for processing a user spoken utterance and providing a response to the user spoken utterance may include a vehicle head unit configured to receive microphone signals indicative of a user utterance; and a processor programed to receive data indicative of a vehicle state, receive the user spoken utterance, perform semantic analysis on the user spoken utterance based at least in part on a context of the user spoken utterance and vehicle state, select a knowledge base as a source for information regarding the user spoken utterance based on the semantic analysis; and provide a response to the user spoken utterance from the selected knowledge base to the vehicle head unit.
42 - Services scientifiques, technologiques et industriels, recherche et conception
Produits et services
(1) Providing temporary use of non-downloadable computer software for operating voice recognition and voice-activated personal assistance programs; providing temporary use of non-downloadable computer software for enabling hands-free operation of computing devices using voice activation and voice recognition; software as a service (SaaS) services featuring artificial intelligence technology that enables users to use a voice activated virtual assistant; software as a service (SaaS) services featuring artificial intelligence technology, namely, a digital assistant featuring speech recognition software; software as a service (SaaS) services featuring software applications for computer understanding, recognition, and processing of natural language; software as a service (SaaS) services featuring software applications for use in and with mobile devices and motor vehicles for programming and controlling communication with voice assistants, drive-assistants, and smart assistants; software as a service (SaaS) services featuring software applications for recognizing, authenticating, and verifying the identity of a speaker consisting of large language model applications; software as a service (SaaS) services featuring software applications for use in and with mobile devices and motor vehicles for the deployment of conversational artificial intelligence (AI) technology; software as a service (SaaS) services featuring software applications for use in and with mobile devices and motor vehicles for enabling operation, control and performance of motor vehicle systems; software as a service (SaaS) services featuring software applications for use in and with mobile devices and motor vehicles for enabling operation and control of mobile device and motor vehicle functions based on user commands; software as a service (SaaS) services featuring software applications including artificial intelligence software for enabling user interaction with motor vehicles; software as a service (SaaS) services featuring software applications for understanding user preferences, namely, computer software that enables users to connect remotely with their vehicles to access vehicle information and control functions consisting of large language model applications, namely, software applications for speech recognition and natural language understanding; software as a service (SaaS) services featuring software applications for speech recognition and natural language understanding; software as a service (SaaS) services featuring software applications for gaze and gesture detection in and associated with motor vehicles; software as a service (SaaS) services featuring software applications for authentication and identification of individuals consisting of large language model applications, namely, software applications for speech recognition and natural language understanding; software as a service (SaaS) services featuring software applications for reading and translating handwriting and converting text into speech; software as a service (SaaS) services featuring software applications for speech signal enhancement, namely, for reading and translating handwriting and converting text into speech; software as a service (SaaS) services featuring software applications for connecting motor vehicles with one or more computing devices in the nature of computers, laptop computers, tablet computers, smart phones, mobile phones and smart watches; software as a service (SaaS) services featuring software applications for connecting, operating, and managing networked motor vehicles; software as a service (SaaS) services featuring software applications for vehicle navigation; software as a service (SaaS) services featuring software applications for motor vehicle operation, control and user interaction with vehicles; software as a service (SaaS) services featuring software applications for use in the operation and control of autonomous-driving motor vehicles; software as a service (SaaS) services for developing, deploying, maintaining, administering, managing, training, validating, configuring, monitoring, using, querying, and auditing artificial intelligence software in the field of machine learning applications consisting of large language model applications, namely, software applications for speech recognition and natural language understanding; providing temporary use of online non-downloadable computer software for developing, deploying, maintaining, administering, managing, training, validating, configuring, monitoring, using, querying, and auditing artificial intelligence software in the field of machine learning applications consisting of large language model applications, namely, software applications for speech recognition and natural language understanding; platform as a service (PaaS) featuring computer software for enabling secure transmission of digital information and data to and from artificial intelligence software in field of machine learning models consisting of large language models, for accelerating training and development of large language models, for administering large language models, for integrating with other software systems to share data, and for improving the quality of machine learning model in the field of large language model responses, namely, software applications for speech recognition and natural language understanding; platform as a service (PaaS) featuring computer software for developing, deploying, maintaining, managing, training, validating, configuring, monitoring, querying, and auditing large language model applications, namely, software applications for speech recognition and natural language understanding; software as a service (SaaS) services featuring software for enterprise-grade, large-language model hosting and fine-tuning on the cloud, namely, software applications for speech recognition and natural language understanding.
51.
Visual Platforms for Configuring Audio Processing Operations
Disclosed are systems, methods, and other implementations, including a method for controlling a configurable audio processor, coupled via a plurality of transducers (such as the microphones 220A-C and/or the loudspeakers 224A-B of FIG. 2) to an acoustic environment, that includes determining a three-dimensional spatial variation in the acoustic environment of a processing characteristic of the audio processor based on configuration values for the audio processor, forming a three-dimensional image of the three-dimensional spatial variation of the processing characteristic, and providing the three-dimensional image for presentation to a user for controlling the configuration values.
A hybrid noise-reducer provides an output audio signal by carrying out noise reduction on an input audio signal over a desired range of frequencies. The desired range of frequencies consists of the union of a base range of frequencies and a remainder range of frequencies. The noise reducer includes first and second noise-reduction paths of different types. The first noise-reduction path relies on a dynamic neural network that has been trained using the base range of frequencies. The second noise-reduction path relies on a noise estimation module that uses an estimate of signal-to-noise ratio estimate to identify noise within the remainder range.
G10L 21/0232 - Traitement dans le domaine fréquentiel
G10L 25/18 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes caractérisées par le type de paramètres extraits les paramètres extraits étant l’information spectrale de chaque sous-bande
G10L 25/21 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes caractérisées par le type de paramètres extraits les paramètres extraits étant l’information sur la puissance
G10L 25/30 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes caractérisées par la technique d’analyse utilisant des réseaux neuronaux
53.
Acoustic interference suppression through speaker-aware processing
Disclosed are systems, methods, and other implementations for acoustic interference suppression, including a method that includes obtaining a multi-source sound signal sample combining multiple sound components from a plurality of sound sources in a sound environment, with the plurality of sounds sources including one or more interfering sound sources produced by one or more loudspeakers in the sound environment, determining interfering sound characteristics for one or more sound signals that correspond to the one or more interfering sound sources, and suppressing at least one of the multiple sound components associated with the determined interfering sound characteristics for at least one of the one or more sound signals.
A computer-implemented Karaoke system, which may be deployed in a vehicle for use by a driver and/or one or more passengers of the vehicle, adjusts relevant settings depending on the properties of the song, for instance as automatically determined by analysis of the audio signal of a song. In some examples, the system may dynamically remix original vocals or user-provided vocals depending on whether the user is singing.
A system for interacting with an audio stream to obtain lyric information, control playback of the audio stream, and control aspects of the audio stream. In some instances, end users can request that the audio stream play with a lead vocal track or without a lead vocal track. Obtaining lyric information includes receiving via a text to speech module an audio playback of the lyric information.
B60K 35/10 - Dispositions d'entrée, c.-à-d. de l'utilisateur au véhicule, associées aux fonctions du véhicule ou spécialement adaptées à celles-ci
B60K 35/26 - Dispositions de sortie, c.-à-d. du véhicule à l'utilisateur, associées aux fonctions du véhicule ou spécialement adaptées à celles-ci utilisant une sortie acoustique
G10L 15/22 - Procédures utilisées pendant le processus de reconnaissance de la parole, p. ex. dialogue homme-machine
G10L 25/54 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier pour comparaison ou différentiation pour la recherche
Disclosed are systems, methods, and other implementations for noise suppression, including a method that includes obtaining a sound signal sample, determining a noise reduction profile, from a plurality of noise reduction profiles, for processing the obtained sound signal sample, and processing the sound signal sample with a machine learning system to produce a noise suppressed signal. The machine learning system implements (executes) a single machine learning model trained to controllably suppress noise in input sound signals according to the plurality of noise reduction profiles. The processing of the sound signal sample is performed according to the determined noise reduction profile.
G10L 21/0224 - Traitement dans le domaine temporel
G10L 25/03 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes caractérisées par le type de paramètres extraits
G10L 25/30 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes caractérisées par la technique d’analyse utilisant des réseaux neuronaux
Disclosed are systems, methods, and other implementations for noise suppression, including a method that includes obtaining a sound signal sample, determining a noise reduction profile, from a plurality of noise reduction profiles, for processing the obtained sound signal sample, and processing the sound signal sample with a machine learning system to produce a noise suppressed signal. The machine learning system implements (executes) a single machine learning model trained to controllably suppress noise in input sound signals according to the plurality of noise reduction profiles. The processing of the sound signal sample is performed according to the determined noise reduction profile.
Methods and systems for deessing of speech signals are described. A deesser of a speech processing system includes an analyzer configured to receive a full spectral envelope for each time frame of a speech signal presented to the speech processing system, and to analyze the full spectral envelope to identify frequency content for deessing. The deesser also includes a compressor configured to receive results from the analyzer and to spectrally weight the speech signal as a function of results of the analyzer. The analyzer can be configured to calculate a psychoacoustic measure from the full spectral envelope, and may be further configured to detect sibilant sounds of the speech signal using the psychoacoustic measure. The psychoacoustic measure can include, for example, a measure of sharpness, and the analyzer may be further configured to calculate deesser weights based on the measure of sharpness. An example application includes in-car communications.
G10L 21/0364 - Amélioration de l'intelligibilité de la parole, p. ex. réduction de bruit ou annulation d'écho en changeant l’amplitude pour améliorer l'intelligibilité
G10L 25/18 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes caractérisées par le type de paramètres extraits les paramètres extraits étant l’information spectrale de chaque sous-bande
H03G 9/02 - Combinaisons de plusieurs types de commande, p. ex. commande de gain et commande de tonalité dans des amplificateurs non accordés
A voice-based system is configured to process commands in a flexible format, for example, in which a wake word does not necessarily have to occur at the beginning of an utterance. As in natural speech, the system being addressed may be named within or at the end of a spoken utterance rather than at the beginning, or depending on the context, may not be named at all.
G10L 15/197 - Grammaires probabilistes, p. ex. n-grammes de mots
G10L 15/06 - Création de gabarits de référenceEntraînement des systèmes de reconnaissance de la parole, p. ex. adaptation aux caractéristiques de la voix du locuteur
A method for applying a watermark signal to a speech signal to prevent unauthorized use of speech signals, the method may include receiving an original speech signal; determining a corresponding spectrogram of the original speech signal; selecting a phase sequence of fixed frame length and uniform distribution; and generating an encoded watermark signal based on the corresponding spectrogram and phase sequence.
42 - Services scientifiques, technologiques et industriels, recherche et conception
Produits et services
Providing temporary use of non-downloadable computer software for operating voice recognition and voice-activated personal
assistance programs; providing temporary use of non-downloadable computer software for enabling hands-free operation of
computing devices using voice activation and voice recognition; software as a service (SaaS) services featuring software using
artificial intelligence technology that enables users to use a voice activated virtual assistant; software as a service (SaaS) services
featuring software using artificial intelligence technology, namely, a digital assistant featuring speech recognition software;
software as a service (SaaS) services featuring software applications for computer understanding, recognition, and processing of
natural language; software as a service (SaaS) services featuring software applications for programming and controlling
communication with voice assistants, drive-assistants, and smart assistants; software as a service (SaaS) services featuring
software applications for recognizing, authenticating, and verifying the identity of a speaker consisting of large language model
applications; software as a service (SaaS) services featuring software applications for the deployment of conversational Artificial
Intelligence (AI) technology; Software as a service (SaaS) services featuring software applications for use in and with mobile
devices and vehicles for enabling operation, control and performance of vehicle systems; software as a service (SaaS) services
featuring software applications for use in and with mobile devices and vehicles for enabling operation and control of mobile device
and vehicle functions based on user commands; software as a service (SaaS) services featuring software applications including
artificial intelligence software for enabling user interaction with vehicles; software as a service (SaaS) services featuring software
applications for understanding user preferences consisting of large language model applications; software as a service (SaaS)
services featuring software applications for speech recognition and natural language understanding; software as a service (SaaS)
services featuring software applications for gaze and gesture detection in or associated with vehicles; software as a service
(SaaS) services featuring software applications for authentication and identification of individuals consisting of large language
model applications; software as a service (SaaS) services featuring software applications for reading and translating handwriting
and converting text into speech; software as a service (SaaS) services featuring software applications for speech signal
enhancement; software as a service (SaaS) services featuring software applications for connecting vehicles with one or more
computing devices; software as a service (SaaS) services featuring software applications for connecting, operating, and managing
networked vehicles; software as a service (SaaS) services featuring software applications for vehicle navigation; software as a
service (SaaS) services featuring software applications for vehicle operation, control and user interaction with vehicles; software
as a service (SaaS) services featuring software applications for use in the operation and control of autonomous-driving vehicles;
Software as a service (SaaS) services for developing, deploying, maintaining, administering, managing, training, validating,
configuring, monitoring, using, querying, and auditing artificial intelligence software including machine learning applications
consisting of large language model applications; providing temporary use of online non-downloadable software for developing,
deploying, maintaining, administering, managing, training, validating, configuring, monitoring, using, querying, and auditing
artificial intelligence software in the nature of machine learning applications consisting of large language model applications;
Platform as a service (PaaS) featuring computer software for enabling secure transmission of digital information and data to and
from artificial intelligence software including machine learning models consisting of large language models, for accelerating
training and development of large language models, for administering large language models, for integrating with other software
systems to share data, and for improving the quality of machine learning model in the nature of large language model responses;
Platform as a service (PaaS) featuring computer software for developing, deploying, maintaining, managing, training, validating,
configuring, monitoring, querying, and auditing large language model applications; Software as a service (SAAS) services
featuring software for enterprise-grade, large-language model hosting and fine-tuning on the cloud
62.
COLLABORATION BETWEEN A RECOMMENDATION ENGINE AND A VOICE ASSISTANT
A method comprising causing a voice assistant and a recommendation engine that are executing in an infotainment system of a vehicle to cooperate in processing a vehicle occupant's acceptance of a recommendation proposed by the recommendation engine by having an interface to enable the recommendation engine to provide recommendation context to the voice assistant to enable the voice assistant to resolve an ambiguity in the occupant's acceptance of the recommendation.
B60R 16/037 - Circuits électriques ou circuits de fluides spécialement adaptés aux véhicules et non prévus ailleursAgencement des éléments des circuits électriques ou des circuits de fluides spécialement adapté aux véhicules et non prévu ailleurs électriques pour le confort des occupants
42 - Services scientifiques, technologiques et industriels, recherche et conception
Produits et services
Providing temporary use of non-downloadable computer
software for operating voice recognition and voice-activated
personal assistance programs; providing temporary use of
non-downloadable computer software for enabling hands-free
operation of computing devices using voice activation and
voice recognition; software as a service (SaaS) services
featuring artificial intelligence technology that enables
users to use a voice activated virtual assistant; software
as a service (SaaS) services featuring artificial
intelligence technology, namely, a digital assistant
featuring speech recognition software; software as a service
(SaaS) services featuring software applications for computer
understanding, recognition, and processing of natural
language; software as a service (SaaS) services featuring
software applications for programming and controlling
communication with voice assistants, drive-assistants, and
smart assistants; software as a service (SaaS) services
featuring software applications for recognizing,
authenticating, and verifying the identity of a speaker;
software as a service (SaaS) services featuring software
applications for the deployment of conversational Artificial
Intelligence (AI) technology; Software as a service (SaaS)
services featuring software applications for use in and with
mobile devices and vehicles for enabling operation, control
and performance of vehicle systems; software as a service
(SaaS) services featuring software applications for use in
and with mobile devices and vehicles for enabling operation
and control of mobile device and vehicle functions based on
user commands; software as a service (SaaS) services
featuring software applications including artificial
intelligence software for enabling user interaction with
vehicles; software as a service (SaaS) services featuring
software applications for understanding user preferences;
software as a service (SaaS) services featuring software
applications for speech recognition and natural language
understanding; software as a service (SaaS) services
featuring software applications for gaze and gesture
detection in or associated with vehicles; software as a
service (SaaS) services featuring software applications for
authentication and identification of individuals; software
as a service (SaaS) services featuring software applications
for reading and translating handwriting and converting text
into speech; software as a service (SaaS) services featuring
software applications for speech signal enhancement;
software as a service (SaaS) services featuring software
applications for connecting vehicles with one or more
computing devices; software as a service (SaaS) services
featuring software applications for connecting, operating,
and managing networked vehicles; software as a service
(SaaS) services featuring software applications for vehicle
navigation; software as a service (SaaS) services featuring
software applications for vehicle operation, control and
user interaction with vehicles; software as a service (SaaS)
services featuring software applications for use in the
operation and control of autonomous-driving vehicles.
64.
IN-CAR ASSISTIVE AUDIO TECHNOLOGIES FOR USERS WITH HEARING LOSS
A hearing application (162) for a vehicle audio system may include at least one speaker (148) configured to play playback content, and at least one hearing application programmed to receive optimization parameters from a hearing device (124) within the vehicle (104), the optimization parameters including signal processing parameters specific to the hearing device (124), apply the optimization parameters to the playback content, and transmit the playback content for playback by one of the hearing device (124) and/or at least one speaker (148).
A hearing application for a vehicle audio system may include at least one speaker configured to play playback content, and at least one hearing application programmed to receive optimization parameters from a hearing device within the vehicle, the optimization parameters including signal processing parameters specific to the hearing device, apply the optimization parameters to the playback content, and transmit the playback content for playback by one of the hearing device and/or at least one speaker.
A method for managing an interaction between a user and a driver interaction system in a vehicle, the method comprising presenting a first audio output to a user from an output device of the driver interaction system, and, while presenting the first audio output to the user, receiving sensed input at the driver interaction system, processing the sensed input including determining an emotional content of the driver, and controlling the interaction based at least in part on the emotional content of the sensed input.
G10L 15/22 - Procédures utilisées pendant le processus de reconnaissance de la parole, p. ex. dialogue homme-machine
G10L 25/63 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier pour comparaison ou différentiation pour estimer un état émotionnel
G10L 25/78 - Détection de la présence ou de l’absence de signaux de voix
G10L 25/18 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes caractérisées par le type de paramètres extraits les paramètres extraits étant l’information spectrale de chaque sous-bande
G10L 15/26 - Systèmes de synthèse de texte à partir de la parole
B60Q 9/00 - Agencement ou adaptation des dispositifs de signalisation non prévus dans l'un des groupes principaux
A voice assistant system for a vehicle includes a microphone configured to detect an audio signal from a user of the vehicle; a speaker configured to output a dialogue in response to the audio signal; and a processor programmed to responsive to detecting a conversation in which the user is involved, decrease a lengthiness setting of the voice assistant system to reduce the length of the dialogue, and increase an independency setting of the voice assistant system to prevent a confirmation question from the voice assistant system.
B60Q 9/00 - Agencement ou adaptation des dispositifs de signalisation non prévus dans l'un des groupes principaux
G10L 17/02 - Opérations de prétraitement, p. ex. sélection de segmentReprésentation ou modélisation de motifs, p. ex. fondée sur l’analyse linéaire discriminante [LDA] ou les composantes principalesSélection ou extraction des caractéristiques
G10L 17/06 - Techniques de prise de décisionStratégies d’alignement de motifs
G10L 25/63 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier pour comparaison ou différentiation pour estimer un état émotionnel
G10L 25/78 - Détection de la présence ou de l’absence de signaux de voix
A vehicle includes a cabin, an internal-loudspeaker set an external-microphone set, and a signal processor that filters a raw audio signal that has been received by the external-microphone set broadcasts the resulting filtered audio signal into the cabin using the internal-loudspeaker set.
G10K 11/178 - Procédés ou dispositifs de protection contre le bruit ou les autres ondes acoustiques ou pour amortir ceux-ci, en général utilisant des effets d'interférenceMasquage du son par régénération électro-acoustique en opposition de phase des ondes acoustiques originales
H04N 7/18 - Systèmes de télévision en circuit fermé [CCTV], c.-à-d. systèmes dans lesquels le signal vidéo n'est pas diffusé
H04N 23/62 - Commande des paramètres via des interfaces utilisateur
H04N 23/695 - Commande de la direction de la caméra pour modifier le champ de vision, p. ex. par un panoramique, une inclinaison ou en fonction du suivi des objets
A voice assistant system for a vehicle includes a microphone configured to detect an audio signal from a user of the vehicle; a speaker configured to output a dialogue in response to the audio signal; and a processor programmed to responsive to detecting a conversation in which the user is involved, decrease a lengthiness setting of the voice assistant system to reduce the length of the dialogue, and increase an independency setting of the voice assistant system to prevent a confirmation question from the voice assistant system.
A method for managing an interaction between a user and a driver interaction system in a vehicle, the method comprising presenting a first audio output to a user from an output device of the driver interaction system, and, while presenting the first audio output to the user, receiving sensed input at the driver interaction system, processing the sensed input including determining an emotional content of the driver, and controlling the interaction based at least in part on the emotional content of the sensed input.
G10L 15/22 - Procédures utilisées pendant le processus de reconnaissance de la parole, p. ex. dialogue homme-machine
G10L 25/63 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier pour comparaison ou différentiation pour estimer un état émotionnel
A method for synthesizing speech from a textual input includes receiving the textual input, the textual input including native words in a native language and foreign words in a foreign language, and processing the textual input to determine a phonetic representation of the textual input. The processing includes determining a native phonetic representation of the of the native words, and determining a nativized phonetic representation of the foreign words. Determining the nativized phonetic representation includes forming a foreign phonetic representation of the foreign words using a foreign phoneme set, and mapping the foreign phonetic representation to the nativized phonetic representation according to a model of a native speaker's pronunciation of foreign words.
G10L 13/08 - Analyse de texte ou génération de paramètres pour la synthèse de la parole à partir de texte, p. ex. conversion graphème-phonème, génération de prosodie ou détermination de l'intonation ou de l'accent tonique
A method for synthesizing speech from a textual input includes receiving the textual input, the textual input including native words in a native language and foreign words in a foreign language, and processing the textual input to determine a phonetic representation of the textual input. The processing includes determining a native phonetic representation of the of the native words, and determining a nativized phonetic representation of the foreign words. Determining the nativized phonetic representation includes forming a foreign phonetic representation of the foreign words using a foreign phoneme set, and mapping the foreign phonetic representation to the nativized phonetic representation according to a model of a native speaker's pronunciation of foreign words.
G10L 13/08 - Analyse de texte ou génération de paramètres pour la synthèse de la parole à partir de texte, p. ex. conversion graphème-phonème, génération de prosodie ou détermination de l'intonation ou de l'accent tonique
An interface customized to a detected emotional state of a user is provided. Audio signals are received from at least one microphone, the audio signals being indicative of spoken words, phrases, or commands. A wake-up word (WuW) is detected in the audio signals. An emotion is also detected in the audio signals containing the WuW. An emotion-aware processing system is configured according to the detected emotion. A voice control session is performed using the emotion-aware processing system configured according to the detected emotion.
G10L 15/22 - Procédures utilisées pendant le processus de reconnaissance de la parole, p. ex. dialogue homme-machine
G10L 25/63 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier pour comparaison ou différentiation pour estimer un état émotionnel
An interface customized to a detected emotional state of a user is provided. Audio signals are received from at least one microphone, the audio signals being indicative of spoken words, phrases, or commands. A wake-up word (WuW) is detected in the audio signals. An emotion is also detected in the audio signals containing the WuW. An emotion-aware processing system is configured according to the detected emotion. A voice control session is performed using the emotion-aware processing system configured according to the detected emotion.
G10L 17/26 - Reconnaissance de caractéristiques spéciales de voix, p. ex. pour utilisation dans les détecteurs de mensongeReconnaissance des voix d’animaux
G10L 25/30 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes caractérisées par la technique d’analyse utilisant des réseaux neuronaux
G10L 15/22 - Procédures utilisées pendant le processus de reconnaissance de la parole, p. ex. dialogue homme-machine
75.
INTERACTIVE MODIFICATION OF SPEAKING STYLE OF SYNTHESIZED SPEECH
Control over speaking style of a text-to- speech (TTS) system is provided without necessarily requiring that the training of the TTS conversion process (e.g., the ANN used for the conversion) take into account the speaking styles of the training data. For example, the TTS system may allow adjustment of characteristics of speaking styles, such as, speed, perceivable degree of "kindness", average pitch, pitch variation, and duration of pauses. In some examples, a voice designer may have a number of independent controls that vary corresponding characteristics without necessarily varying others. Once the designer has configured a desired overall speaking style based on those controllable characteristics, the TTS system can be configured to use that speaking style for deployments of the TTS system. For example, the TTS system may be used for audio output in a voice assistant, for instance, for an in-vehicle voice assistant.
A vehicle control system executing a voice control system for facilitating voice-based dialog with a driver to enable the driver or autonomous vehicle to control certain operational aspects of an autonomous vehicle is provided. Using environmental and sensor input, the vehicle control system can select optimal routes for operating the vehicle in an autonomous mode or choose a preferred operational mode. Occupants of the autonomous vehicle can change a destination, route or driving mode by engaging with the vehicle control system in a dialog enabled by the voice control system.
B60W 50/00 - Détails des systèmes d'aide à la conduite des véhicules routiers qui ne sont pas liés à la commande d'un sous-ensemble particulier
G05D 1/00 - Commande de la position, du cap, de l'altitude ou de l'attitude des véhicules terrestres, aquatiques, aériens ou spatiaux, p. ex. utilisant des pilotes automatiques
G10L 15/22 - Procédures utilisées pendant le processus de reconnaissance de la parole, p. ex. dialogue homme-machine
77.
VOICE REINFORCEMENT IN MULTIPLE SOUND ZONE ENVIRONMENTS
Microphone signal is received from at least one microphone. AEC produces an echo cancelled microphone signal using first adaptive filters to estimate and cancel feedback that is a result of the environment. AFC produces a processed microphone signal using second adaptive filters to estimate and cancel feedback resulting from application of the reinforced voice signal within the environment. The uttered speech is reinforced in the processed microphone signal to produce the reinforced voice signal. The reinforced voice signal and the audio signal is applied to the loudspeakers. A step size of adjustment of the second adaptive filters may be increased responsive to detection of reverberation in the microphone signal. The reverberation that is used to control the step size of the second adaptive filters may be added artificially. This may provide multiple benefits including improving adjustment of the second adaptive filters and also improving the sound impression of the voice.
G10L 21/02 - Amélioration de l'intelligibilité de la parole, p. ex. réduction de bruit ou annulation d'écho
H04M 9/08 - Systèmes téléphoniques à haut-parleur à double sens comportant des moyens pour conditionner le signal, p. ex. pour supprimer les échos dans l'une ou les deux directions du trafic
H04R 3/02 - Circuits pour transducteurs pour empêcher la réaction acoustique
Microphone signal is received from at least one microphone. AEC produces an echo cancelled microphone signal using first adaptive filters to estimate and cancel feedback that is a result of the environment. AFC produces a processed microphone signal using second adaptive filters to estimate and cancel feedback resulting from application of the reinforced voice signal within the environment. The uttered speech is reinforced in the processed microphone signal to produce the reinforced voice signal. The reinforced voice signal and the audio signal is applied to the loudspeakers. A step size of adjustment of the second adaptive filters may be increased responsive to detection of reverberation in the microphone signal. The reverberation that is used to control the step size of the second adaptive filters may be added artificially. This may provide multiple benefits including improving adjustment of the second adaptive filters and also improving the sound impression of the voice.
H04R 3/02 - Circuits pour transducteurs pour empêcher la réaction acoustique
G10L 15/20 - Techniques de reconnaissance de la parole spécialement adaptées de par leur robustesse contre les perturbations environnantes, p. ex. en milieu bruyant ou reconnaissance de la parole émise dans une situation de stress
A system for interactive and iterative media generation may include loudspeakers configured to play back audio signals into an environment, the audio signals including karaoke content; at least one microphone configured to receive microphone signals indicative of sound in the environment; and a processor programmed to receive a first microphone signal from the at least one microphone, the first microphone signal including a first user sound and karaoke content, instruct the loudspeakers to play back the first microphone signal, receiving a second microphone signal from the at least one microphone, the second microphone signal including the first user sound of the first microphone signal and a second user sound, transmitting the second microphone signal, including the first and second microphone signals and the karaoke content, as an instance of iteratively-generated media content.
G10L 15/22 - Procédures utilisées pendant le processus de reconnaissance de la parole, p. ex. dialogue homme-machine
H04W 4/46 - Services spécialement adaptés à des environnements, à des situations ou à des fins spécifiques pour les véhicules, p. ex. communication véhicule-piétons pour la communication de véhicule à véhicule
An In-Car Communication (ICC) system supports the communication paths within a car by receiving the speech signals of a speaking passenger and playing it back for one or more listening passengers. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in a vehicle having multiple acoustic zones includes a plurality of microphone In-Car Communication (Mic-ICC) instances coupled and a plurality of loudspeaker In-Car Communication (Ls-ICC) instances. The system further includes a dynamic audio routing matrix with a controller and coupled to the Mic-ICC instances, a mixer coupled to the plurality of Mic-ICC instances and a distributor coupled to the Ls-ICC instances.
G10L 25/84 - Détection de la présence ou de l’absence de signaux de voix pour différencier la parole du bruit
H04M 9/08 - Systèmes téléphoniques à haut-parleur à double sens comportant des moyens pour conditionner le signal, p. ex. pour supprimer les échos dans l'une ou les deux directions du trafic
H04R 1/40 - Dispositions pour obtenir la fréquence désirée ou les caractéristiques directionnelles pour obtenir la caractéristique directionnelle désirée uniquement en combinant plusieurs transducteurs identiques
A vehicle system for classifying spoken utterance within a vehicle cabin as one of system-directed and non-system directed, the system may include at least one microphone configured to detect at least one audio signal from at least one occupant of a vehicle, and a processor programmed to receive the at least one audio signal including at least one acoustic utterance, determine a number of vehicle occupants based at least in part on the at least one signal, determine a probability that the utterance is system directed based at least in part one the utterance and the number of vehicle occupants, determine a classification threshold based at least in part on the number of vehicle occupants, compare the classification threshold to the probability to determine whether the at least one acoustic utterance is one of a system directed utterance and a non-system directed utterance.
G10L 15/22 - Procédures utilisées pendant le processus de reconnaissance de la parole, p. ex. dialogue homme-machine
G10L 25/48 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier
82.
ADAPTATION AND TRAINING OF NEURAL SPEECH SYNTHESIS
Disclosed are systems, methods and other implementations for speech generation, including a method that includes obtaining a speech sample for a target speaker, processing, using a trained encoder, the speech sample to produce a parametric representation of the speech sample for the target speaker, receiving configuration data for a speech synthesis system that accepts as an input the parametric representation, and adapting the configuration data for the speech synthesis system according to an input comprising the parametric representation, and a time-domain representation for the speech sample, to generate adapted configuration data for the speech synthesis system. The method further includes causing configuration of the speech synthesis system according to the adapted configuration data, with the speech synthesis system being implemented to generate synthesized speech output data with estimated voice and time- domain speech characteristics approximating actual voice and time-domain speech characteristics for the target speaker.
A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.
A method for selecting a speech recognition result on a computing device includes receiving a first speech recognition result determined by the computing device, receiving first features, at least some of the features being determined using the first speech recognition result, determining whether to select the first speech recognition result or to wait for a second speech recognition result determined by a cloud computing service based at least in part on the first speech recognition result and the first features.
G08G 1/04 - Détection du mouvement du trafic pour le comptage ou la commande utilisant des détecteurs optiques ou ultrasonores
H04R 1/40 - Dispositions pour obtenir la fréquence désirée ou les caractéristiques directionnelles pour obtenir la caractéristique directionnelle désirée uniquement en combinant plusieurs transducteurs identiques
G01S 3/801 - Radiogoniomètres pour déterminer la direction d'où proviennent des ondes infrasonores, sonores, ultrasonores ou électromagnétiques ou des émissions de particules sans caractéristiques de direction utilisant des ondes ultrasonores, sonores ou infrasonores Détails
86.
Contextual utterance resolution in multimodal systems
A system and method of responding to a vocal utterance may include capturing and converting the utterance to word(s) using a language processing method, such as natural language processing. The context of the utterance and of the system, which may include multimodal inputs, may be used to determine the meaning and intent of the words.
G10L 15/18 - Classement ou recherche de la parole utilisant une modélisation du langage naturel
B60R 16/037 - Circuits électriques ou circuits de fluides spécialement adaptés aux véhicules et non prévus ailleursAgencement des éléments des circuits électriques ou des circuits de fluides spécialement adapté aux véhicules et non prévu ailleurs électriques pour le confort des occupants
B60R 25/25 - Moyens pour enclencher ou arrêter le système antivol par biométrie
G06F 3/01 - Dispositions d'entrée ou dispositions d'entrée et de sortie combinées pour l'interaction entre l'utilisateur et le calculateur
A method for a user device, including receiving a first acoustic input of a user speaking a wake-up word in the target language; providing a first acoustic feature derived from the first acoustic input to an acoustic model stored on the user device to obtain a first sequence of speech units corresponding to the wake-up word spoken by the user in the target language, the acoustic model trained on a corpus of training data in a source language different than the target language; receiving a second acoustic input including the wake-up word in the target language; providing a second acoustic feature derived from the second acoustic input to the acoustic model to obtain a second sequence of speech units corresponding to the wake-up word in the target language; and comparing the first and second sequences of speech units to recognize the wake-up word in the target language.
A method for managing location-aware reminders in an automobile includes monitoring a geographic location of the automobile using a computer system installed in the vehicle. The computer system detects that the automobile has entered a geographic region associated with a location-aware reminder and issues a reminder message associated with the location-aware reminder to a driver of the automobile based on the detecting.
G08G 1/0968 - Systèmes impliquant la transmission d'indications de navigation au véhicule
G06Q 10/06 - Ressources, gestion de tâches, des ressources humaines ou de projetsPlanification d’entreprise ou d’organisationModélisation d’entreprise ou d’organisation
89.
Vehicle avatar devices for interactive virtual assistant
A system and method for providing avatar device status indicators for voice assistants in multi-zone vehicles. The method comprises: receiving at least one signal from a plurality of microphones, wherein each microphone is associated with one of a plurality of spatial zones, and one of a plurality of avatar devices; wherein the at least one signal further comprises a speech signal component from a speaker; wherein the speech signal component is a voice command or question; sending zone information associated with the speaker and with one of the plurality of spatial zones to an avatar; activating one the plurality of avatar devices in a respective one of the plurality of spatial zones associated with the speaker.
G10L 25/63 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier pour comparaison ou différentiation pour estimer un état émotionnel
H04R 1/40 - Dispositions pour obtenir la fréquence désirée ou les caractéristiques directionnelles pour obtenir la caractéristique directionnelle désirée uniquement en combinant plusieurs transducteurs identiques
H05B 47/12 - Commande de la source lumineuse en réponse à des paramètres détectés en détectant la présence ou le mouvement d'objets ou d'êtres vivants en détectant un son audible
B60Q 9/00 - Agencement ou adaptation des dispositifs de signalisation non prévus dans l'un des groupes principaux
G01C 21/36 - Dispositions d'entrée/sortie pour des calculateurs embarqués
90.
Method and apparatus for enhancing a geolocation database
While current voice assistants can respond to voice requests, creating smarter assistants that leverage location, past requests, and user data to enhance responses to future requests and to provide robust data about locations is desirable. A method for enhancing a geolocation database (“database”) associates a user-initiated triggering event with a location in a database by sensing user position and orientation within the vehicle and a position and orientation of the vehicle. The triggering event is detected by sensors arranged within a vehicle with respect to the user. The method determines a point of interest (“POI”) near the location based on the user-initiated triggering event. The method, responsive to the user-initiated triggering event, updates the database based on information related to the user-initiated triggering event at an entry of the database associated with the POI. The database and voice assistants can leverage the enhanced data about the POI for future requests.
A method detects presence of a multi-tone siren type in an acoustic signal. The multi-tone siren type is associated with one or more siren patterns, where each siren pattern includes a number of time patterns at corresponding frequencies. The method includes processing a number of frequency components of a frequency domain representation of the acoustic signal over time to determine a corresponding plurality of values. That processing includes determining, for each frequency component, a value characterizing a presence of a time pattern associated with at least one siren pattern. The method also includes processing the values according to the siren patterns to determine a detection result indicating whether the multi-tone siren type is present in the acoustic signal.
B60W 40/04 - Calcul ou estimation des paramètres de fonctionnement pour les systèmes d'aide à la conduite de véhicules routiers qui ne sont pas liés à la commande d'un sous-ensemble particulier liés aux conditions ambiantes liés aux conditions de trafic
G06K 9/00 - Méthodes ou dispositions pour la lecture ou la reconnaissance de caractères imprimés ou écrits ou pour la reconnaissance de formes, p.ex. d'empreintes digitales
G08G 1/0965 - Dispositions pour donner des instructions variables pour le trafic avec un indicateur monté à l'intérieur du véhicule, p. ex. délivrant des messages vocaux répondant à des signaux provenant d'un autre véhicule, p. ex. d'un véhicule de secours
B60W 50/14 - Moyens d'information du conducteur, pour l'avertir ou provoquer son intervention
B60W 60/00 - Systèmes d’aide à la conduite spécialement adaptés aux véhicules routiers autonomes
B60W 30/16 - Contrôle de la distance entre les véhicules, p. ex. pour maintenir la distance avec le véhicule qui précède
A hybrid noise-reducer provides an output audio signal by carrying out noise reduction on an input audio signal over a desired range of frequencies. The desired range of frequencies consists of the union of a base range of frequencies and a remainder range of frequencies. The noise reducer includes first and second noise-reduction paths of different types. The first noise-reduction path relies on a dynamic neural network that has been trained using the base range of frequencies. The second noise-reduction path relies on a noise estimation module that uses an estimate of signal-to-noise ratio estimate to identify noise within the remainder range.
G10L 21/0232 - Traitement dans le domaine fréquentiel
G10L 21/0264 - Filtration du bruit caractérisée par le type de mesure du paramètre, p. ex. techniques de corrélation, techniques de passage par zéro ou techniques prédictives
G10L 25/78 - Détection de la présence ou de l’absence de signaux de voix
G10L 25/30 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes caractérisées par la technique d’analyse utilisant des réseaux neuronaux
93.
Methods and apparatus for detecting a voice command
According to some aspects, a method of monitoring an acoustic environment of a mobile device, at least one computer readable medium encoded with instructions that, when executed, perform such a method and/or a mobile device configured to perform such a method is provided. The method comprises receiving acoustic input from the environment of the mobile device while the mobile device is operating in the low power mode, detecting whether the acoustic input includes a voice command based on performing a plurality of processing stages on the acoustic input, wherein at least one of the plurality of processing stages is performed while the mobile device is operating in the low power mode, and using at least one contextual cue to assist in detecting whether the acoustic input includes a voice command.
A voice-based system is configured to process commands in a flexible format, for example, in which a wake word does not necessarily have to occur at the beginning of an utterance. As in natural speech, the system being addressed may be named within or at the end of a spoken utterance rather than at the beginning, or depending on the context, may not be named at all.
G10L 15/197 - Grammaires probabilistes, p. ex. n-grammes de mots
G10L 15/06 - Création de gabarits de référenceEntraînement des systèmes de reconnaissance de la parole, p. ex. adaptation aux caractéristiques de la voix du locuteur
95.
Infotainment system having awareness of local dynamic features
A vehicle and many dynamic features move relative to the same reference frame. An infotainment system responds to a request from an occupant of a vehicle to provide information concerning a particular dynamic feature. The occupant provides the infotainment system with information concerning a bearing to the dynamic feature and the infotainment system identifies the dynamic feature in response.
B60K 35/10 - Dispositions d'entrée, c.-à-d. de l'utilisateur au véhicule, associées aux fonctions du véhicule ou spécialement adaptées à celles-ci
B60K 35/00 - Instruments spécialement adaptés aux véhiculesAgencement d’instruments dans ou sur des véhicules
B60K 35/28 - Dispositions de sortie, c.-à-d. du véhicule à l'utilisateur, associées aux fonctions du véhicule ou spécialement adaptées à celles-ci caractérisées par le type d’informations de sortie, p. ex. divertissement vidéo ou informations sur la dynamique du véhiculeDispositions de sortie, c.-à-d. du véhicule à l'utilisateur, associées aux fonctions du véhicule ou spécialement adaptées à celles-ci caractérisées par la finalité des informations de sortie, p. ex. pour attirer l'attention du conducteur
B60K 35/85 - Dispositions pour le transfert de données relatives au véhicule ou au conducteur
G06F 3/01 - Dispositions d'entrée ou dispositions d'entrée et de sortie combinées pour l'interaction entre l'utilisateur et le calculateur
96.
Determining whether an acoustic event originated inside or outside a vehicle
A vehicle defines an interior space and an exterior space. Within the vehicle are internal microphones that are disposed to capture an acoustic event that originated in an origination space, which is either the interior space or the exterior space. An infotainment system includes circuitry that forms a head unit having an acoustic-signal processor that is configured to receive, from the microphones, a sound vector indicative of the acoustic event and to identify the origination space based at least in part on the sound vector.
An apparatus comprising an infotainment system including a proactive automotive assistant that executes a first action and a second action, wherein the first action is that of permitting spontaneous communication to an occupant in a vehicle and the second action is that of providing information indicating that spontaneous communication with the occupant is impermissible. The automotive assistant is configured to receive information selected from the group consisting of vehicle-status information concerning operation of the vehicle and occupant-status information concerning the occupant and to base the first and second actions at least in part on the information.
G08G 1/0967 - Systèmes impliquant la transmission d'informations pour les grands axes de circulation, p. ex. conditions météorologiques, limites de vitesse
B60R 16/023 - Circuits électriques ou circuits de fluides spécialement adaptés aux véhicules et non prévus ailleursAgencement des éléments des circuits électriques ou des circuits de fluides spécialement adapté aux véhicules et non prévu ailleurs électriques pour la transmission de signaux entre des parties ou des sous-systèmes du véhicule
B60R 16/037 - Circuits électriques ou circuits de fluides spécialement adaptés aux véhicules et non prévus ailleursAgencement des éléments des circuits électriques ou des circuits de fluides spécialement adapté aux véhicules et non prévu ailleurs électriques pour le confort des occupants
G08G 1/0962 - Dispositions pour donner des instructions variables pour le trafic avec un indicateur monté à l'intérieur du véhicule, p. ex. délivrant des messages vocaux
H04M 1/72454 - Interfaces utilisateur spécialement adaptées aux téléphones sans fil ou mobiles avec des moyens permettant d’adapter la fonctionnalité du dispositif dans des circonstances spécifiques en tenant compte des contraintes imposées par le contexte ou par l’environnement
H04M 1/72463 - Interfaces utilisateur spécialement adaptées aux téléphones sans fil ou mobiles avec des moyens permettant d’adapter la fonctionnalité du dispositif dans des circonstances spécifiques pour limiter la fonctionnalité du dispositif
H04M 3/42 - Systèmes fournissant des fonctions ou des services particuliers aux abonnés
H04W 4/16 - Services supplémentaires liés aux communications, p. ex. transfert ou mise en attente d'appels
H04W 4/40 - Services spécialement adaptés à des environnements, à des situations ou à des fins spécifiques pour les véhicules, p. ex. communication véhicule-piétons
98.
INTERACTIVE AUDIO ENTERTAINMENT SYSTEM FOR VEHICLES
A system for interacting with an audio stream to obtain lyric information, control playback of the audio stream, and control aspects of the audio stream. In some instances, end users can request that the audio stream play with a lead vocal track or without a lead vocal track. Obtaining lyric information includes receiving via a text to speech module an audio playback of the lyric information.
G10L 13/00 - Synthèse de la paroleSystèmes de synthèse de la parole à partir de texte
G10L 25/54 - Techniques d'analyse de la parole ou de la voix qui ne se limitent pas à un seul des groupes spécialement adaptées pour un usage particulier pour comparaison ou différentiation pour la recherche
99.
PLATFORM FOR INTEGRATING DISPARATE ECOSYSTEMS WITHIN A VEHICLE
A system for integrating disparate ecosystems including smart home and internet-of-things (IoT) ecosystems. The system including a vehicle assistant that executes within the context of a cloud-based application and that retrieves sensor data and utterances from a vehicle and forwards the sensor data and passenger-spoken utterances to a cloud-based application. Using the sensor data and utterances, the cloud-based application selects and executes a predetermined routine that includes at least one action to be completed in vehicle, on mobile phone or Smart Home/IoT ecosystem. The action is then complete by issuing the command to the vehicle head-unit, specified mobile phone or target ecosystem selected from a group of disparate ecosystems.
H04W 4/44 - Services spécialement adaptés à des environnements, à des situations ou à des fins spécifiques pour les véhicules, p. ex. communication véhicule-piétons pour la communication entre véhicules et infrastructures, p. ex. véhicule à nuage ou véhicule à domicile
H04L 12/28 - Réseaux de données à commutation caractérisés par la configuration des liaisons, p. ex. réseaux locaux [LAN Local Area Networks] ou réseaux étendus [WAN Wide Area Networks]
An automotive assistant that is connected to microphones and loudspeakers that are associated with different seats in a passenger vehicle includes a dialog manager that is configured to initiate a dialog based on an utterance received at a first one of the microphones and to advance that dialog based on an utterance received from another of the microphones.
G10L 15/22 - Procédures utilisées pendant le processus de reconnaissance de la parole, p. ex. dialogue homme-machine
B60K 35/10 - Dispositions d'entrée, c.-à-d. de l'utilisateur au véhicule, associées aux fonctions du véhicule ou spécialement adaptées à celles-ci
B60K 35/60 - Instruments caractérisés par leur emplacement ou leur positionnement relatif dans ou sur les véhicules
G10L 15/20 - Techniques de reconnaissance de la parole spécialement adaptées de par leur robustesse contre les perturbations environnantes, p. ex. en milieu bruyant ou reconnaissance de la parole émise dans une situation de stress
G10L 17/14 - Par catégorisation phonémique ou reconnaissance de la parole avant identification ou vérification du locuteur