The present systems and methods for video decoding for machine processing extract features and image statistics from the decoded bitstream and evaluate the image statistics to predict whether augmentation of a frame will enhance task performance and generate at least one parameter for selectively applying an augmentation process. The augmentation process further including a DCT filter process, which may simulate JPT compression. The parameter is applied to selectively alter at least a portion of a frame to enhance task performance by the machine processing the decoded bitstream.
2.
SYSTEMS AND METHOD FOR DECODED FRAME AUGMENTATION FOR VIDEO CODING FOR MACHINES
The present systems and methods for video decoding for machine processing extract features and image statistics from the decoded bitstream and evaluate the image statistics to predict whether augmentation of a frame will enhance task performance and generate at least one parameter for selectively applying an augmentation process. The parameter is applied to selectively alter at least a portion of a frame to enhance task performance by the machine processing the decoded bitstream.
A video encoder and decoder system employing a compression-decision generative adversarial model (“CDGAM”) has an encoder with a compression network and a decoder with a task network. The compression network is a generator portion of a pretrained CDGAM and the task network is a discriminator portion of the pretrained CDGAM. The compression network and task network are trained using common image data wherein the compression network is trained to a generate minimal compressed representation that results in a substantially similar loss as the original image at the discriminator.
H04N 19/42 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
4.
ENCODER AND BITSTREAM FOR ADAPTIVE RESOLUTION MANAGEMENT SIGNALING
A video encoder includes circuitry to generate an encoded bitstream for decoding by a compliant decoder, the encoded bitstream including a first picture to be used as a reference picture, a current picture comprising a first coded block and a second coded block, a signal indicating that a block-based mode when a reference picture and a current picture have different resolutions is enabled, and information including an index useful for determining a scaling constant. The compliant decoder has circuitry configured to receive the bitstream; determine whether the block-based mode is enabled, and if enabled utilizing the mode to reconstruct the first block and reconstruct the second block without using the mode.
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/132 - Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/186 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
5.
ENCODER AND BITSTREAM FOR ADAPTIVE RESOLUTION MANAGEMENT PREDICTION RESCALING
A video encoder generates an encoded bitstream which includes a reference picture, a current picture having a resolution different from the reference picture and also including a first block, an intra-coded second block, and information useful for determining a scaling constant. The encoded bitstream is decodable by a video decoder receiving the encoded bitstream and determining whether a block-based mode for decoding a current picture of one resolution using a reference picture of a different resolution is enabled. When enabled, the decoder uses the mode to decode the first block and decodes the intra-coded second block without using the mode.
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/137 - Motion inside a coding unit, e.g. average field, frame or block difference
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
6.
SYSTEMS, METHODS, AND BITSTREAM STRUCTURE FOR VIDEO CODING AND DECODING FOR MACHINES WITH ADAPTIVE INFERENCE
Systems and methods are provided for encoding and decoding video data for machine applications, such as Video Coding for Machines, using inference models. An encoder uses an inference selector to determine an appropriate inference model to encode a feature substream. The encoder further employs an inference metadata encoder to encode parameters of the selected inference model into an inference metadata substream, which can be multiplexed with the feature substream to generate an encoded bitstream to send to a decoder site. A decoder receiving the encoded bitstream, extracts the inference metadata, selects the appropriate inference model, and applies that inference model in decoding the feature substream and generating a decoded output signal for machine consumption.
H04N 19/102 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
H04N 19/169 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
7.
Methods and systems of video coding using reference regions
A decoder includes circuitry configured to receive a bitstream, identify a first frame, locate a first independent reference region within the first frame, extract the first independent reference region from the first frame, decode a second frame using the first independent reference region as a reference for the second frame.
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/139 - Analysis of motion vectors, e.g. their magnitude, direction, variance or reliability
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/80 - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
8.
SYSTEMS AND METHODS FOR VIDEO CODING FOR MACHINES USING AN AUTOENCODER
Systems and methods for encoding and decoding video for machine consumption (video coding for machines) are provided in which an autoencoder is employed. The autoencoder has an encoder portion, a bottleneck portion, and a decoder portion. The autoencoder being distributed between a VCM encoder and VCM decoder such that the VCM encoder includes the encoder portion and bottleneck portion and the VCM decoder includes the bottleneck portion and the decoder portion.
H04N 19/42 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
H04N 19/169 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
9.
ENCODER AND BITSTREAM FOR GEOMETRIC PARTITIONING WITH AN ADAPTIVE NUMBER OF REGIONS
An encoder provides a bitstream having a coded current picture including a coded coding tree unit and signaling information. The signaling information indicating that geometric partitioning is enabled for the bitstream and further including partition index information enabling the decoder to determine endpoints for first and second straight line partition boundaries in the coding tree unit. The bitstream enables a compliant decoder to reconstruct the coding tree unit, which is partitioned into three non-rectangular regions by the first and second straight line partition boundaries.
H04N 19/12 - Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
H04N 19/119 - Adaptive subdivision aspects e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
H04N 19/625 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using discrete cosine transform [DCT]
10.
VIDEO AND FEATURE CODING FOR MULTI-TASK MACHINE LEARNING
A system and method for video and feature coding of neural-network structures used for multi-task machine learning includes an encoder, decoder and a decoder-compliant bitstream. A task-specific video decoder includes a first decoder receiving a bitstream having at least one feature and a description of a neural network backbone used to generate the bitstream, and a task-specific neural network head. The neural network head recreates the neural network backbone from the description, receives a feature from the bitstream a generates a task-specific output.
H04N 19/42 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
H04N 19/192 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding the adaptation method, adaptation tool or adaptation type being iterative or recursive
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
11.
SYSTEMS AND METHODS FOR VIDEO CODING OF FEATURES USING SUBPICTURES
Systems and methods for video coding of visual features that uses picture structures such as subpictures to independently encode individual features or groups of features are disclosed. An encoding method includes extracting a plurality of features from an image, representing each of the image features as a two-dimensional feature unit, grouping the feature units into at least one subpicture of the frame, and encoding the video frame into a bitstream. A compatible decoder for reconstructing the sequence of features is also provided.
H04N 19/119 - Adaptive subdivision aspects e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
H04N 19/136 - Incoming video signal characteristics or properties
H04N 19/42 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
12.
INTELLIGENT MULTI-STREAM VIDEO CODING FOR VIDEO SURVEILLANCE
A video surveillance system includes a plurality of cameras including an action recognition engine that classifies the video content to at least one of a plurality of predetermined actions in real time. The action recognition engine of a first camera communicates with at least one other action recognition engine, whereby detected actions and tasks related to the detected action can be exchanged. The cameras include a feature encoder operatively coupled to the action recognition engine that generates an encoded feature substream and a video encoder that receives the video content and provides an encoded video substream. A multiplexor receives the encoded feature substream and encoded video substream and generates an encoded camera bitstream including encoded video content and detected action content, which can be decoded at a receiver site for human and machine consumption.
An image transmission system with privacy protection includes a privacy request processor which receives a request to secure at least one object type in an image prior to transmission of the image to a receiving system. An object detector receives the image data from a camera and processes the image data to identify at least one predetermined object type in the image data. An object scrambler receives a set of protected objects from the privacy request processor and receives the image data with identified objects of the predetermined object type and operates to obscure the image data for any identified objects identified as protected objects prior to encoding and transmission to a remote receiver site.
G06F 21/62 - Protecting access to data via a platform, e.g. using keys or access control rules
H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
H04N 19/46 - Embedding additional information in the video signal during the compression process
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
A method includes receiving a bit stream, decoding a first frame using the bit stream, determining a scaled first frame using the first frame and a scaling constant, storing the first frame in a first picture buffer at a first index location, and storing the scaled first frame in a second picture buffer at the first index location. Related apparatus, systems, techniques and articles are also described.
H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/11 - Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
H04N 19/112 - Selection of coding mode or of prediction mode according to a given display mode, e.g. for interlaced or progressive display mode
H04N 19/13 - Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
H04N 19/16 - Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter for a given display mode, e.g. for interlaced or progressive display mode
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/423 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements
H04N 19/463 - Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
H04N 19/625 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using discrete cosine transform [DCT]
H04N 19/80 - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
15.
GLOBAL MOTION MODELS FOR MOTION VECTOR INTERPREDICTION
A decoder is configured to receive a bit stream including a current frame and a picture header associated with the entire current frame, determine, as a function of the picture header, that one global motion mode is enabled for the entire current frame, the enabled global motion mode being selected from a group including translational motion, 4-parameter affine motion, and 6-parameter affine motion, detect, based on the enabled global motion mode, a plurality of parameters applicable to the entire frame, and decode the current frame using the detected parameters.
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
H04N 19/46 - Embedding additional information in the video signal during the compression process
A system for low complexity and/or energy efficient encoding includes an encoding device, the encoding device configured to select a video application for an input video, set at least a boundary parameter, determine at least a goal parameter, generate a plurality of tool combinations, wherein each tool combination of the plurality of tool combinations has fewer tools than the full encoder toolset, tabulate measurements of each tool combination of the plurality of tool combinations, and select a tool combination of the plurality of tool combinations, wherein the selected tool combination generates optimal measurements as a function of the at least a goal parameter. An energy measurement device may be coupled to the encoding device and the selected tool combination may achieve the goal parameter at the highest energy efficiency.
H04N 19/156 - Availability of hardware or computational resources, e.g. encoding based on power-saving criteria
H04N 19/436 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation using parallelised computational arrangements
H04N 19/463 - Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
17.
ENCODING VIDEO WITH EXTENDED LONG TERM REFERENCE PICTURE RETENTION
An encoder for video encoding and decoding with long term reference picture retention. The encoder generates an encoded bitream for a compliant decoder. At the decoder, coded pictures are decoded using a list of references pictures, of which at least one is a long term reference picture. A first picture is decoded using the long term reference picture. A second picture is decoded without using the long term reference picture which is inactive but is retained in a buffer. A subsequent picture is then decoded again using the long term reference picture.
Systems and methods for motion information transfer from visual to feature domain are disclosed which provide for mapping motion information from coding units in video content to corresponding convolution unit(s) in feature content. Systems and methods are also provided for improved decoder-side motion vector refinement of video content based on characteristics of corresponding feature units.
H04N 19/139 - Analysis of motion vectors, e.g. their magnitude, direction, variance or reliability
H04N 19/12 - Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
H04N 19/157 - Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/60 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
19.
SYSTEMS AND METHODS FOR OBJECT AND EVENT DETECTION AND FEATURE-BASED RATE-DISTORTION OPTIMIZATION FOR VIDEO CODING
Systems and methods for event and object detection and annotation in the video streams may include extracting a plurality of features in a picture in a video frame, grouping at least a portion of the plurality of features into at least one object, determining a region for the at least one object, assigning object identifiers to the at least one object and encoding the object identifiers into the bitstream. Feature-based rate distortion optimization may be employed for video coding including extracting a set of features from a picture in the video, generating a relevance map for the extracted features, determining a relevance score for portions of the picture using the relevance map, and encoding the portion of the picture with a bit rate determined at least in part by the relevance score.
H04N 19/147 - Data rate or code amount at the encoder output according to rate distortion criteria
G06V 10/26 - Segmentation of patterns in the image fieldCutting or merging of image elements to establish the pattern region, e.g. clustering-based techniquesDetection of occlusion
G06V 10/77 - Processing image or video features in feature spacesArrangements for image or video recognition or understanding using pattern recognition or machine learning using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]Blind source separation
G06V 20/40 - ScenesScene-specific elements in video content
G06V 20/70 - Labelling scene content, e.g. deriving syntactic or semantic representations
H04N 19/115 - Selection of the code volume for a coding unit prior to coding
H04N 19/167 - Position within a video image, e.g. region of interest [ROI]
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
20.
SYSTEMS AND METHODS FOR VIDEO ENCODING USING IMAGE SEGMENTATION
A video encoder is provided that includes an image detection and segmentation processor receiving the video frame and generating object recognition data and at least one image segmentation mask. A mask to coding block mapping processor maps the segmentation mask to the CTUs of the video frame and partitions at least one CTU into a plurality of coding units (CUs) based on a detected object boundary in the CTU. A video encoding processor receives the video frame, the object recognition data and the partitioned CUs and encodes the CUs with at least one of a resolution or quantization parameter determined at least in part on whether the CU includes an object.
H04N 19/119 - Adaptive subdivision aspects e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
G06V 10/82 - Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
G06V 20/40 - ScenesScene-specific elements in video content
H04N 19/167 - Position within a video image, e.g. region of interest [ROI]
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
Aspects relate to systems and methods for optimizing a loss function for video coding for machines. An exemplary system includes a computing device including circuitry and configured to receive an input video, extract a feature map as a function of the input video and at least a feature extraction parameter, encode a feature layer as a function of the feature map, calculate a loss function as a function of the feature layer, and optimize the at least a feature extraction parameter as a function of the loss function.
H04N 19/147 - Data rate or code amount at the encoder output according to rate distortion criteria
H04N 19/136 - Incoming video signal characteristics or properties
H04N 19/42 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
A system for predictive coding includes a computing device configured to receive an input video, determine a quality impact as a function of the input video, wherein determining the quality impact further comprises identifying a frame drop indicator as a function of the input video, and determining the quality impact as a function of the frame drop indicator, and produce an encoded video as a function of the quality impact and an encoding process.
H04N 19/154 - Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion
H04N 19/136 - Incoming video signal characteristics or properties
H04N 19/169 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/50 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
H04N 19/65 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using error resilience
23.
SYSTEMS AND METHODS FOR SCALABLE VIDEO CODING FOR MACHINES
Systems and methods for scalable video coding for machines is provided. In an aspect, a decoder is provided which includes circuitry configured to receive a bitstream, the bitstream including at least a header, at least a base feature layer, and at least a residual visual layer. The decoder is configured to decode the at least a base feature layer, decode the at least a residual visual layer, combine the at least a decoded base feature layer with the at least a residual visual layer, and output a human-viewable video as a function of the combined at least a decoded base feature layer and the at least a residual visual layer.
H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
24.
SYSTEMS AND METHODS FOR JOINT OPTIMIZATION TRAINING AND ENCODER SIDE DOWNSAMPLING
A method of joint optimization-training, the method comprising identifying, by a feature extractor, an input feature, generating, by the feature extractor, a feature map for the input feature, wherein generating the feature map further comprises receiving an initial parameter set, instantiating a loss function representing a feature representation precision and an encoding complexity level as a function of the input feature and the initial parameter set, and generating the feature map that minimizes the loss function, and transmitting, by the feature extractor, the feature map to an encoder.
G06V 10/77 - Processing image or video features in feature spacesArrangements for image or video recognition or understanding using pattern recognition or machine learning using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]Blind source separation
G06T 3/4046 - Scaling of whole images or parts thereof, e.g. expanding or contracting using neural networks
G06V 10/82 - Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
25.
SYSTEMS AND METHODS FOR REGION PACKING BASED ENCODING AND DECODING
Systems and methods for video coding and decoding using region packing are provided. At an encoder, an encoded bitstream is generated having packed frames with a plurality of regions of interest therein. At least a portion of the region parameters to reconstruct the packed frame are encoded as Supplemental Enhancement Information. At the decoder, the encoded bitstream is decoded and parameters sufficient to place the regions within a reconstructed frame are extracted, including parameters extracted from Supplemental Enhancement Information. A reconstructed frame is generated which substantially maintains the spatial relationship and size of regions of interest in the original video frame.
H04N 19/167 - Position within a video image, e.g. region of interest [ROI]
H04N 19/46 - Embedding additional information in the video signal during the compression process
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
H04N 19/147 - Data rate or code amount at the encoder output according to rate distortion criteria
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/593 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
A video encoder is configured to encode a bitstream to be decoded by a compliant decoder receiving the bistream. The bitstream including a first coded picture having a first resolution and a second coded picture having a second resolution, wherein the first coded picture is an intra-coded picture, parameters defining a cropping window. The decoder decodes the first coded picture and stores the first picture as a reference picture, determines a scaling factor from at least one parameter in the bitstream defining the first cropping window, determines a predictor for the second picture from the reference picture and the scaling factor, decodes the second coded picture using the predictor, and outputs for display the first picture as cropped in accordance with the parameters defining the cropping window.
H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
H04N 19/103 - Selection of coding mode or of prediction mode
H04N 19/136 - Incoming video signal characteristics or properties
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
27.
SYSTEMS AND METHODS FOR DECODER-SIDE SYNTHESIS OF VIDEO SEQUENCES
A method of decoder-side synthesis of video sequences includes receiving, by an encoder, an input video, detecting, by the encoder and in the input video, a first region including synthesizable content, and encoding, by the encoder and in a bitstream, the video, wherein encoding further comprises signaling the first region.
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/12 - Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
H04N 19/137 - Motion inside a coding unit, e.g. average field, frame or block difference
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/463 - Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
H04N 19/60 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
28.
SYSTEMS AND METHODS FOR ENCODING AND DECODING VIDEO WITH MEMORY-EFFICIENT PREDICTION MODE SELECTION
A method of memory-efficient prediction mode selection includes receiving, by an encoder, a coded bitstream including a current frame, determining, by the encoder, costs of a first prediction mode and a second prediction mode, wherein determining further comprises determining, for the first prediction mode, a first bit cost and a first memory cost and determining, for the second prediction mode, a second bit cost and a second memory cost, selecting, by the encoder, a current prediction mode of the first prediction mode and the second prediction mode as a function of the first bit cost, first memory cost, second bit cost, and second memory cost, and encoding, by the encoder, the current frame using the current prediction mode. The prediction mode may be informed by at least one parameter received from the decoder.
H04N 19/11 - Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
H04N 19/127 - Prioritisation of hardware or computational resources
H04N 19/136 - Incoming video signal characteristics or properties
H04N 19/156 - Availability of hardware or computational resources, e.g. encoding based on power-saving criteria
H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/423 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements
H04N 19/463 - Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
29.
SYSTEMS AND METHODS FOR ORGANIZING AND SEARCHING A VIDEO DATABASE
A system for organizing and searching a video database includes a logic circuit configured to extract, from a video, at least a video feature, generate at least a first hash value as a function of the at least a video feature, wherein generating the at least a first hash value further comprises performing a robust hash algorithm on the at least a feature, and store the video in a data structure, wherein storing the video further includes storing a representation of the video in a leaf node of the data structure and storing the at least a first hash value in a traversal index linking hash values to leaf nodes.
Systems and methods for encoding and decoding video for machine consumption in which superresolution is used to upsample regions of interest at a decoder are provided. A decoder receives an encoded bitstream that has at least one packed frame with at least one region of interest therein being downsampled prior to encoding. A superresolution processor selectively performs upsampling of the identified regions. Parameters related to downsampling may be signaled in the bitstream and used to identify regions of interest to upsample. The superresolution processor may include a neural network trained on a dataset of images that have similar statistical properties to the pictures that are anticipated by a machine system receiving the decoded bitstream and may be trained based on a loss function in a task network based on that dataset.
H04N 19/119 - Adaptive subdivision aspects e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
H04N 21/4728 - End-user interface for requesting content, additional data or servicesEnd-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification or for manipulating displayed content for selecting a ROI [Region Of Interest], e.g. for requesting a higher resolution version of a selected region
H04N 21/23 - Processing of content or additional dataElementary server operationsServer middleware
H04N 19/00 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
31.
SYSTEMS, METHODS AND BITSTREAM STRUCTURE FOR HYBRID FEATURE VIDEO BITSTREAM AND DECODER
Methods and systems for hybrid feature video bitstream encoding and decoding are provided. Encoding can be performed in one of a plurality of modes. In one mode, the video components of the encoded bitstream include residual data generated encoding the difference between feature data and input video data. A compatible decoder can operate in a plurality of decoding modes. In one decoding mode, the video components are decoded using the encoded video and encoded feature data in the bitstream. The hybrid bitstream between the encoder and decoder is formed with a plurality of hybrid segments, each having a plurality of components specifying characteristics the feature and video content of the segment.
H04N 19/157 - Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
H04N 19/103 - Selection of coding mode or of prediction mode
H04N 19/119 - Adaptive subdivision aspects e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
H04N 19/136 - Incoming video signal characteristics or properties
H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
A video encoder encodes a bitstream for decoding by a compliant decoder which is configured to selectively operate in a mode for generating a motion compensation predictor from a reference picture for a subsequent picture having a resolution different from the reference picture. The encoded bitstream includes a coded picture having a first coded region and a second coded region and signaling information from which the decoder determines first and second spatial resolution scale factors for reconstructing the first and second regions.
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/174 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a slice, e.g. a line of blocks or a group of blocks
H04N 19/33 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability in the spatial domain
H04N 19/46 - Embedding additional information in the video signal during the compression process
33.
SYSTEMS AND METHODS FOR REGION DETECTION AND REGION PACKING IN VIDEO CODING AND DECODING FOR MACHINES
A video encoder for encoding data for machine consumption includes a region detector selection module receiving source video and detector selection parameters and selecting an object detector model. A region detection module applies a selected model to the source video to identify regions of interest in the source video. A region extractor module extracts the identified regions from the source video and a region packing module packs the extracted regions into a packed frame which excludes pixels outside the regions of interest. A video encoder receives the packed frames and data related to the region parameters required to recreate the frame and generates an encoded bitstream. The encoder and encoding methods also include region padding and region merge and region split processing. Compatible decoders and bitstreams are also provided.
H04N 19/134 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
G06V 10/96 - Management of image or video recognition tasks
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/167 - Position within a video image, e.g. region of interest [ROI]
34.
SYSTEM AND METHOD FOR ADAPTIVE DECODER SIDE PADDING IN VIDEO REGION PACKING
Systems and methods for video encoding and decoding for machines consumption are disclosed. A decoder is provided for a decoding a bitstream encoded with a packed frame having at least on region of interest defined therein and encoded region parameters associated therewith. The decoder includes a video decoder receiving the bitstream and extracting the packed frame and region parameters. A region unpacking module receives the packed frame and region parameters and reconstructs an unpacked frame with at least one region of interest. A region padding module is provided in the decoder and applies at least one padding parameter to at least one dimension of a region of interest in the unpacked frame. The region padding may be fixed or dynamic.
H04N 19/167 - Position within a video image, e.g. region of interest [ROI]
H04N 19/00 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
H04N 13/161 - Encoding, multiplexing or demultiplexing different image signal components
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/10 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
H04N 21/2343 - Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
H04N 19/194 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding the adaptation method, adaptation tool or adaptation type being iterative or recursive involving only two passes
35.
SYSTEMS AND METHODS FOR OBJECT BOUNDARY MERGING, SPLITTING, TRANSFORMATION AND BACKGROUND PROCESSING IN VIDEO PACKING
Systems and methods for encoding and decoding video content for machine consumption with enhanced region packing strategies. An encoder includes a region detector module which receives a source video and identifies regions of interest therein. A top-down region extractor module receives the identified regions of interest and generates modified set of regions of interest that can be packed in a frame more efficiently. A region packing module receives the modified set of regions of interest and arranges the modified set of regions of interest into a packed frame in which pixels outside the modified regions of interest are substantially excluded. A video encoder encodes the packed frame and region parameters into a coded bitstream. A compliant decoder provides complimentary processing to reconstruct a frame with the regions of interest arranged as they were in the source frame.
H04N 19/119 - Adaptive subdivision aspects e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
H04N 21/4728 - End-user interface for requesting content, additional data or servicesEnd-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification or for manipulating displayed content for selecting a ROI [Region Of Interest], e.g. for requesting a higher resolution version of a selected region
H04N 21/23 - Processing of content or additional dataElementary server operationsServer middleware
H04N 19/00 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
36.
IMAGE AND VIDEO CODING WITH ADAPTIVE QUANTIZATION FOR MACHINE-BASED APPLICATIONS
A video coding system for machines employs adaptive quantization based on the frequency response of the machine model receiving the data. The encoder receives the machine model for the machine-based system and generates a frequency importance map from the machine model. An adjustment matrix is then generated based on the frequency importance map and is used to adjust coefficients of the default quantization matrix. The video data is quantized using the adjusted quantization matrix and encoded in a bitstream for transmission to a decoder site. At the decoder site, the decoder can extract parameters of the adjusted quantization matrix from the bitstream or calculate the adjusted quantization matrix using the machine model to inverse quantize the received data for machine consumption.
A video encoder is configured to encode a bit stream including a coded picture having a first contiguous region and a second contiguous region, the first contiguous region containing common motion, the second contiguous region containing local motion. The encoded bitstream is provided to a decoder which is configured to decode the first contiguous region of the coded picture to reconstruct the common motion by utilizing a motion model common to all of the coding blocks in the first region. A merge candidate list is selectively created and a merge candidate is selected to decode the first region based on the common motion model. The second contiguous region is decoded using individual motion information for each coding block in the second contiguous region.
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
38.
METHODS, SYSTEMS AND ENCODED BITSTREAM FOR COMBINED LOSSLESS AND LOSSY CODING
An encoder includes circuitry configured to receive an input video, select a current frame identify a first sub-picture of the current frame to be encoded using a lossless encoding protocol, and encode the current frame, wherein encoding the current frame includes encoding the first sub-picture using the lossless encoding protocol. An encoded bitstream provided by an encoder is also provided
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/169 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/90 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups , e.g. fractals
39.
Encoder for Coded Pictures Having Regions With Common Motion Models
An encoder is configured to generate an encoded bitstream that can be decoded by a decoder that is configured to receive a bitstream. The bitstream includes a picture comprising a first contiguous region of a first plurality of coding blocks and a second contiguous region comprising a second plurality of coding blocks. The first plurality of blocks having a common motion model used to decode each of the blocks of the first region and the second plurality of blocks having local motion.
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
40.
SIGNALING OF GLOBAL MOTION RELATIVE TO AVAILABLE REFERENCE FRAMES
Computer readable medium stores a bitstream which is decodable by a decoder that is f configured to receive a bitstream, extract a header including a list of reference frames available for motion compensation, such as global motion compensation, determine, using the header, a motion model for a current block, the motion relative to a reference frame contained in the list of reference frames, and decode the current block using the motion model. Related apparatus, systems, techniques and articles are also described.
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/117 - Filters, e.g. for pre-processing or post-processing
H04N 19/13 - Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
H04N 19/46 - Embedding additional information in the video signal during the compression process
A decoder includes circuitry configured to receive a bitstream; determine, for a current block and using the bitstream, a global motion vector candidate utilized by an adjacent block; construct a motion vector candidate list including adding the determined global motion vector candidate to the motion vector candidate list; and reconstruct pixel data of the current block and using the motion vector candidate list. Related apparatus, systems, techniques and articles are also described.
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/56 - Motion estimation with initialisation of the vector search, e.g. estimating a good candidate to initiate a search
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
42.
ENCODER AND BITSTREAM FOR EFFICIENT CODING OF GLOBAL MOTION VECTORS
A video encoder is configured to generate a bitstream for decoding by a compatible decoder which includes circuitry configured to receive a bitstream, extract a residual of a control point motion vector for a current frame and from the bitstream, and combine the residual of the control point motion vector with a prediction of the control point motion vector for the current frame.
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/593 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
H04N 19/80 - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
43.
ONLINE AND OFFLINE SELECTION OF EXTENDED LONG TERM REFERENCE PICTURE RETENTION
An encoder includes circuitry configured to receive a video, determine a plurality of long-term reference frames, determine a long-term reference retention time, and encode the video into a bitstream including the determined plurality of long-term reference fames and the determined retention time. A decoded picture buffer for retaining a plurality of reconstructed pictures, including a long term reference picture, is also provided.
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/46 - Embedding additional information in the video signal during the compression process
H04N 19/58 - Motion compensation with long-term prediction, i.e. the reference frame for a current frame not being the temporally closest one
44.
SIGNALING OF GLOBAL MOTION VECTOR IN PICTURE HEADER
A decoder includes circuitry configured to receive a bitstream, extract a header, determine, using the header, a global motion model, and decode a current block of a current frame using the global motion model. Related apparatus, systems, techniques and articles are also described.
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/91 - Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
A decoder includes circuitry configured to receive a bitstream, extract a header associated with a current frame and including a signal characterizing that global motion is enabled and further characterizing parameters of a global motion model, and decoding the current frame, the decoding including using a motion model for each current block having a complexity that is less than or equal to a complexity of the global motion model. Related apparatus, systems, techniques, and articles are also described.
H04N 19/137 - Motion inside a coding unit, e.g. average field, frame or block difference
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
46.
ENCODER AND BITSTREAM FOR VIDEO WITH FRAMES WITH GLOBAL MOTION
A video encoder s configured to encode a bitstream to be decoded by a compatible decoder, the bitstream including a coded picture with a first region with a first contiguous plurality of coded blocks and signaling information associated with each coded block. For each coded block in the first region, in response to the associated signaling information, the decoder receiving the bitstream constructs a motion vector candidate list including a motion vector candidate enabling the reconstruction of global motion in the first region. If the motion vector candidate is a translational motion vector, the decoder decodes the blocks in the first region using translational motion compensation to reconstruct global motion in the first region. If the one motion vector candidate comprises multiple control point motion vectors, the decoder uses affine motion compensation to reconstruct global motion in the first region.
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
A system for coding and decoding of signals, such as pictures and video, intended for machine use is disclosed. An encoder implements a method for kernel-based transformation of the signal samples that reduces and preferably minimizes entropy of the input signal yet is equivalent to the input signal for the intended machine operation. As a result, the transformed signal sample can be compressed using fewer bits than the original. The encoded signal is received by a complimentary decoder and decoded for machine use.
H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
Systems and methods for video coding and decoding using region packing are provided. At an encoder, a region detection module receives a video frame for encoding, identifies regions of interest in the video frame, and generates a bounding box for each region of interest. A region extractor module obtains the pixels within the bounding box from the video frame. A region packing module receives the identified regions of interest and arranges the bounding boxes within a packed frame substantially reducing the data to be encoded outside the identified regions of interest. A video encoder receives the packed frame and generates an encoded bitstream therefrom. At the decoder, the encoded bitstream is decoded and parameters sufficient to place the regions within a reconstructed frame are extracted. A reconstructed frame is generated which substantially maintains the spatial relationship and size of regions of interest in the original video frame.
H04N 13/117 - Transformation of image signals corresponding to virtual viewpoints, e.g. spatial image interpolation the virtual viewpoint locations being selected by the viewers or determined by viewer tracking
H04N 21/234 - Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
H04N 21/44 - Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
H04N 13/161 - Encoding, multiplexing or demultiplexing different image signal components
H04N 19/115 - Selection of the code volume for a coding unit prior to coding
H04N 19/17 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
H04N 19/187 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scalable video layer
49.
Encoder for interprediction in geometric partitioning with an adaptive number of regions
An encoder configured to generate an encoded bitstream for decoding by a compatible decoder, wherein the decoder receives the encoded bitstream with a current picture including a current block having a first partition boundary partitioning the block into first and second non-rectangular regions and a second partition boundary, intersecting the first partition boundary, and partitioning the second non-rectangular region into three portions. Using the encoded bitstream, the decoder determines a first predictor for use within the first non-rectangular region using a first motion vector selected from a first list of motion vector candidates, determines a second predictor for use within the second non-rectangular region using a second motion vector selected from a second list of motion vector candidates; and decodes the current block using the first predictor and the second predictor. Decoding further includes smoothing the first predictor and the second predictor across the first partition boundary.
H04N 19/136 - Incoming video signal characteristics or properties
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/13 - Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/593 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
50.
SYSTEMS AND METHODS FOR CODING AND DECODING IMAGE DATA USING GENERAL ADVERSARIAL MODELS
A video encoder and decoder system employing a compression-decision generative adversarial model ("CDGAM") has an encoder with a compression network and a decoder with a task network. The compression network is a generator portion of a pretrained CDGAM and the task network is a discriminator portion of the pretrained CDGAM. The compression network and task network are trained using common image data wherein the compression network is trained to a generate minimal compressed representation that results in a substantially similar loss as the original image at the discriminator.
H04N 19/15 - Data rate or code amount at the encoder output by monitoring actual compressed data size at the memory before deciding storage at the transmission buffer
H04N 19/154 - Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion
H04N 19/463 - Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
H04N 19/82 - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
Systems and methods are provided for encoding and decoding video data for machine applications, such as Video Coding for Machines, using inference models. An encoder uses an inference selector to determine an appropriate inference model to encode a feature substream. The encoder further employs an inference metadata encoder to encode parameters of the selected inference model into an inference metadata substream, which can be multiplexed with the feature substream to generate an encoded bitstream to send to a decoder site. A decoder receiving the encoded bitstream, extracts the inference metadata, selects the appropriate inference model, and applies that inference model in decoding the feature substream and generating a decoded output signal for machine consumption.
H04N 19/103 - Selection of coding mode or of prediction mode
H04N 19/20 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
An encoder configured to encode a video bitstream for decoding by a compliant decoder configured to receive the bitstream, the encoded bitstream including a reference picture and a current picture including a coded current block, the bitstream also including an index useful for determining a scaling factor. The decoder determines that a mode when a reference picture and a current picture have different resolutions is enabled. Utilizing the mode, the decoder reconstructs the coded current block encoded by the encoder by determining a scaling factor using information in the bitstream including the index, determining a scaled prediction block by scaling a block of the reference picture using the scaling factor and by applying a pixel interpolation filter determined specifically for the prediction block and combining the scaled prediction block with a residual.
H04N 19/00 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/132 - Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
H04N 19/169 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/186 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
53.
SYSTEMS AND METHODS FOR VIDEO CODING FOR MACHINES USING AN AUTOENCODER
Systems and methods for encoding and decoding video for machine consumption (video coding for machines) are provided in which an autoencoder is employed. The autoencoder has an encoder portion, a bottleneck portion, and a decoder portion. The autoencoder being distributed between a VCM encoder and VCM decoder such that the VCM encoder includes the encoder portion and bottleneck portion and the VCM decoder includes the bottleneck portion and the decoder portion.
H04N 19/102 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
G06N 3/04 - Architecture, e.g. interconnection topology
H04N 19/31 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability in the temporal domain
H04N 19/635 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using sub-band based transform, e.g. wavelets characterised by filter definition or implementation details
H04N 19/20 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding
H04N 19/42 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
H04N 19/149 - Data rate or code amount at the encoder output by estimating the code amount by means of a model, e.g. mathematical model or statistical model
H04N 19/117 - Filters, e.g. for pre-processing or post-processing
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
54.
SYSTEMS AND METHODS FOR PRIVACY PROTECTION IN VIDEO COMMUNICATION SYSTEMS
An image transmission system with privacy protection includes a privacy request processor which receives a request to secure at least one object type in an image prior to transmission of the image to a receiving system. An object detector receives the image data from a camera and processes the image data to identify at least one predetermined object type in the image data. An object scrambler receives a set of protected objects from the privacy request processor and receives the image data with identified objects of the predetermined object type and operates to obscure the image data for any identified objects identified as protected objects prior to encoding and transmission to a remote receiver site.
A video encoder is provided which generates a bitstream for decoding by a compliant decoder having an interprediction mode for generating a predictor from a reference picture for a subsequent picture having a spatial resolution different from the reference picture. The encoder provides a bitstream in which the decoder can scale the spatial resolution of a block of pixels of the reference picture using a scale factor signaled in the encoded bitstream and using a filter chosen for the particular block of the reference picture and not the reference picture as a whole to form a predictor.
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/117 - Filters, e.g. for pre-processing or post-processing
H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/59 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
H04N 19/625 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using discrete cosine transform [DCT]
A system and method for video and feature coding of neural-network structures used for multi-task machine learning includes an encoder, decoder and a decoder-compliant bitstream. A task-specific video decoder includes a first decoder receiving a bitstream having at least one feature and a description of a neural network backbone used to generate the bitstream, and a task-specific neural network head. The neural network head recreates the neural network backbone from the description, receives a feature from the bitstream a generates a task-specific output.
Systems and methods for video coding of visual features that uses picture structures such as subpictures to independently encode individual features or groups of features are disclosed. An encoding method includes extracting a plurality of features from an image, representing each of the image features as a two-dimensional feature unit, grouping the feature units into at least one subpicture of the frame, and encoding the video frame into a bitstream. A compatible decoder for reconstructing the sequence of features is also provided.
A video surveillance system includes a plurality of cameras including an action recognition engine that classifies the video content to at least one of a plurality of predetermined actions in real time. The action recognition engine of a first cameral communicates with at least one other action recognition engine, whereby detected actions and tasks related to the detected action can be exchanged. The cameras include a feature encoder operatively coupled to the action recognition engine which generates an encoded feature substream and a video encoder that receives the video content and provides an encoded video substream. A multiplexor receives the encoded feature substream and encoded video substream and generates an encoded camera bitstream including encoded video content and detected action content, which can be decoded at a receiver site for human and machine consumption.
A decoder is configured to receive a bit stream including a current frame and a picture header associated with the entire current frame, determine, as a function of the picture header, that one global motion mode is enabled for the entire current frame, the enabled global motion mode being selected from a group including translational motion, 4-parameter affine motion, and 6-parameter affine motion, detect, based on the enabled global motion mode, a plurality of parameters applicable to the entire frame, and decode the current frame using the detected parameters.
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/46 - Embedding additional information in the video signal during the compression process
60.
SYSTEMS AND METHODS FOR OBJECT AND EVENT DETECTION AND FEATURE-BASED RATE-DISTORTION OPTIMIZATION FOR VIDEO CODING
Systems and methods for event and object detection and annotation in the video streams may include extracting a plurality of features in a picture in a video frame, grouping at least a portion of the plurality of features into at least one object, determining a region for the at least one object, assigning object identifiers to the at least one object, and encoding the object identifiers into the bitstream. Systems and methods for feature-based rate distortion optimization for video coding are disclosed and include extracting a set of features from a picture in the video, generating a relevance map for the extracted features, determining a relevance score for portions of the picture using the relevance map, and encoding the portion of the picture with a bit rate determined at least in part by the relevance score.
H04N 19/17 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
H04N 19/14 - Coding unit complexity, e.g. amount of activity or edge presence estimation
G06V 20/40 - ScenesScene-specific elements in video content
H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
61.
SYSTEMS AND METHODS FOR MOTION INFORMATION TRANSFER FROM VISUAL TO FEATURE DOMAIN
Systems and methods for motion information transfer from visual to feature domain are disclosed which provide for mapping motion information from coding units in video content to corresponding convolution unit(s) in feature content. Systems and methods are also disclosed for providing improved decoder-side motion vector refinement of video content based on characteristics of corresponding feature units.
H04N 19/17 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
H04N 19/20 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding
A video encoder is provided that includes an image detection and segmentation processor receiving the video frame and generating object recognition data and at least one image segmentation mask. A mask to coding block mapping processor maps the segmentation mask to the CTUs of the video frame and partitions at least one CTU into a plurality of coding units (CUs) based on a detected object boundary in the CTU. A video encoding processor receives the video frame, the object recognition data and the partitioned CUs and encodes the CUs with at least one of a resolution or quantization parameter determined at least in part on whether the CU includes an object.
H04N 19/119 - Adaptive subdivision aspects e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/20 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding
A system for low complexity and/or energy efficient encoding includes an encoding device, the encoding device configured to select a video application for an input video, set at least a boundary parameter, determine at least a goal parameter, generate a plurality of tool combinations, wherein each tool combination of the plurality of tool combinations has fewer tools than the full encoder toolset, tabulate measurements of each tool combination of the plurality of tool combinations, and select a tool combination of the plurality of tool combinations, wherein the selected tool combination generates optimal measurements as a function of the at least a goal parameter. An energy measurement device may be coupled to the encoding device and the selected tool combination may achieve the goal parameter at the highest energy efficiency.
H04N 19/187 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scalable video layer
A method includes receiving a bit stream; determining, using the bit stream, whether an adaptive resolution management mode is enabled; determining, using the bit stream, a scaling factor including a vertical scaling constant component and a horizontal scaling constant component; and reconstructing pixel data of a current block in adaptive resolution management mode and using the determined scaling factor, the reconstructing including scaling the current block horizontally by the horizontal scaling constant component and scaling the current block vertically by the vertical scaling constant component. Related apparatus, systems, techniques and articles are also described.
H04N 19/00 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/132 - Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/186 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
65.
SYSTEMS AND METHODS FOR OPTIMIZING A LOSS FUNCTION FOR VIDEO CODING FOR MACHINES
Aspects relate to systems and methods for optimizing a loss function for video coding for machines. An exemplary system includes a computing device including circuitry and configured to recieve an input video, extract a feature map as a function of the input video and at least a feature extraction parameter, encode a feature layer as a function of the feature map, calculate a loss function as a function of the feature layer, and optimize the at least a feature extraction parameter as a function of the loss function.
H04N 19/85 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
H04N 19/436 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation using parallelised computational arrangements
A system for predictive coding includes a computing device configured to receive an input video, determine a quality impact as a function of the input video, wherein determining the quality impact further comprises identifying a frame drop indicator as a function of the input video, and determining the quality impact as a function of the frame drop indicator, and produce an encoded video as a function of the quality impact and an encoding process.
H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/31 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability in the temporal domain
H04N 19/50 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
H04N 19/67 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using error resilience involving unequal error protection [UEP], i.e. providing protection according to the importance of the data
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
67.
SYSTEMS AND METHODS FOR SCALABLE VIDEO CODING FOR MACHINES
Systems and methods for scalable video coding for machines is provided. In an aspect, a decoder is provided which includes circuitry configured to receive a bitstream, the bitstream including at least a header, at least a base feature layer, and at least a residual visual layer. The decoder is configured to decode the at least a base feature layer, decode the at least a residual visual layer, combine the at least a decoded base feature layer with the at least a residual visual layer, and output a human-viewable video as a function of the combined at least a decoded base feature layer and the at least a residual visual layer.
H04N 19/187 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a scalable video layer
H04N 19/39 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability involving multiple description coding [MDC], i.e. with separate layers being structured as independently decodable descriptions of input picture data
H04N 19/503 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
H04N 19/60 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
H04N 19/91 - Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
68.
Decoder for coded pictures having regions with common motion models
A decoder includes circuitry configured to receive a bitstream; construct, for a current block, a motion vector candidate list including adding a single global motion vector candidate to the motion vector candidate list, the single global motion vector candidate selected based on a global motion model utilized by the current block; and reconstruct pixel data of the current block and using the motion vector candidate list. Related apparatus, systems, techniques and articles are also described.
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
A method of joint optimization-training, the method comprising identifying, by a feature extractor, an input feature, generating, by the feature extractor, a feature map for the input feature, wherein generating the feature map further comprises receiving an initial parameter set, instantiating a loss function representing a feature representation precision and an encoding complexity level as a function of the input feature and the initial parameter set, and generating the feature map that minimizes the loss function, and transmitting, by the feature extractor, the feature map to an encoder.
H04N 19/82 - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
A decoder includes circuitry configured to receive a bitstream; store a plurality of long-term reference frames in a reference list; retain a long-term reference frame in the reference list for a length of time based on a retention time; and decode at least a portion of video using the long-term reference frame retained in the reference list. Related apparatus, systems, techniques and articles are also described.
A method of memory-efficient prediction mode selection includes receiving, by an encoder, a coded bitstream including a current frame, determining, by the encoder, costs of a first prediction mode and a second prediction mode, wherein determining further comprises determining, for the first prediction mode, a first bit cost and a first memory cost and determining, for the second prediction mode, a second bit cost and a second memory cost, selecting, by the encoder, a current prediction mode of the first prediction mode and the second prediction mode as a function of the first bit cost, first memory cost, second bit cost, and second memory cost, and encoding, by the encoder, the current frame using the current prediction mode. The prediction mode may be informed by at least one parameter received from the decoder.
A method of decoder-side synthesis of video sequences includes receiving, by an encoder, an input video, detecting, by the encoder and in the input video, a first region including synthesizable content, and encoding, by the encoder and in a bitstream, the video, wherein encoding further comprises signaling the first region.
H04N 19/17 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
H04N 19/189 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
H04N 19/00 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
A decoder is configured to receive a bitstream that has a reference picture, a current picture having a resolution different from the reference picture and including a first block and a second block that is an intra-coded block, and information useful for determining a scaling constant. The decoder is further configured to determine a scaling constant using the information in the bitstream, determine a scaled prediction block from the reference picture by determining a location of the scaled prediction block by scaling a motion vector component using the scaling constant, scaling the resolution of a block of the reference picture using the scaling constant and applying a filter determined specifically for the prediction block and not determined for the whole reference picture, reconstruct the first block using the scaled prediction block, and reconstruct the intra-coded second block without using a scaled prediction block.
H04N 19/00 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/137 - Motion inside a coding unit, e.g. average field, frame or block difference
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
74.
SYSTEMS AND METHODS FOR ORGANIZING AND SEARCHING A VIDEO DATABASE
A system for organizing and searching a video database includes a logic circuit configured to extract, from a video, at least a video feature, generate at least a first hash value as a function of the at least a video feature, wherein generating the at least a first hash value further comprises performing a robust hash algorithm on the at least a feature, and store the video in a data structure, wherein storing the video further includes storing a representation of the video in a leaf node of the data structure and storing the at least a first hash value in a traversal index linking hash values to leaf nodes.
G06F 16/583 - Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
75.
ENCODER AND DECODER FOR VIDEO CODING FOR MACHINES (VCM)
A video coding for machines (VCM) encoder includes a first video encoder, the first video encoder configured to encode an input video into a bitstream. The VCM encoder includes a feature extractor, the feature extractor configured to detect at least a feature in the input video. The VCM encoder includes a second encoder, the second encoder configured to encode a feature bitstream as a function of the input video and at least a feature.
G06V 10/40 - Extraction of image or video features
G06V 10/764 - Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
G06V 10/82 - Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
76.
VIDEO CODING FOR MACHINES (VCM) ENCODER AND DECODER FOR COMBINED LOSSLESS AND LOSSY ENCODING
A video coding form machines (VCM) encoder for combined lossless and lossy encoding includes a feature encoder, the feature encoder configured to encode a sub-picture containing a feature in an input video and provide an indication of the sub-picture, and a video encoder, the video encoder configured to receive an indication of the sub-picture from the feature encoder and encode the picture using a lossy encoding protocol
H04N 19/154 - Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion
H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
H04N 19/182 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel
77.
VIDEO SIGNAL PROCESSOR FOR BLOCK-BASED PICTURE PROCESSING
A video signal processor is provided, having an inverse quantizer, an inverse transform processor, an in-loop filter; and a decoded picture buffer. The video signal processor is configured to receive a video signal including a picture comprising quantized pixels, wherein the picture includes a first region and a second region. The first region comprising a first plurality of blocks and having a first quantization parameter based on a first average measure of spatial activity information of the first plurality of blocks; and a second region comprising a second plurality of blocks and having a second quantization parameter based on a second average measure of information of the second plurality of blocks. The inverse quantizer being configured to inverse quantize the pixels of the blocks comprising the first region using the first quantization parameter and inverse quantize the pixels of the blocks comprising the second region using the second quantization parameter.
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/625 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using discrete cosine transform [DCT]
A decoder includes circuitry configured to receive a bitstream, extract a residual of a control point motion vector for a current frame and from the bitstream, and combine the residual of the control point motion vector with a prediction of the control point motion vector for the current frame. Related apparatus, systems, techniques and articles are also described.
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/593 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
H04N 19/80 - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
A method includes receiving a bit stream, decoding a first frame using the bit stream, determining a scaled first frame using the first frame and a scaling constant, storing the first frame in a first picture buffer at a first index location, and storing the scaled first frame in a second picture buffer at the first index location. Related apparatus, systems, techniques and articles are also described.
H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/423 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/463 - Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
H04N 19/16 - Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter for a given display mode, e.g. for interlaced or progressive display mode
H04N 19/13 - Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
H04N 19/625 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using discrete cosine transform [DCT]
H04N 19/80 - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
H04N 19/11 - Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
H04N 19/112 - Selection of coding mode or of prediction mode according to a given display mode, e.g. for interlaced or progressive display mode
80.
SYSTEMS, METHODS AND BITSTREAM STRUCTURE FOR HYBRID FEATURE VIDEO BITSTREAM AND DECODER
Methods and systems for hybrid feature video bitstream encoding and decoding are provided. Encoding can be performed in one of a plurality of modes. In one mode, the video components of the encoded bitstream include residual data generated encoding the difference between feature data and input video data. A compatible decoder can operate in a plurality of decoding modes. In one decoding mode, the video components are decoded using the encoded video and encoded feature data in the bitstream. The hybrid bitstream between the encoder and decoder is formed with a plurality of hybrid segments, each having a plurality of components specifying characteristics the feature and video content of the segment.
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
H04N 19/82 - Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
81.
Methods and systems for combined lossless and lossy coding
A decoder includes circuitry configured to receive a bitstream identify, in the bitstream, a current frame, wherein the current frame includes a plurality of regions, detect, in the bitstream, an indication that a first region is encoded according to a lossless encoding protocol and another region is encoded according to a lossy encoding protocol, and decode the current frame, wherein decoding the current frame further comprises decoding the first region using a lossless decoding protocol corresponding to the lossless encoding protocol.
H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
H03M 7/30 - CompressionExpansionSuppression of unnecessary data, e.g. redundancy reduction
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/60 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
82.
Methods and systems for combined lossless and lossy coding
A decoder includes circuitry configured to receive a bitstream identify, in the bitstream, a current frame, wherein the current frame includes a first region and a third region, detect, in the bitstream, an indication that the first region is encoded according to a lossless encoding protocol, and decode the current frame, wherein decoding the current frame further comprises decoding the first region using a lossless decoding protocol corresponding to the lossless encoding protocol.
H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/60 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H03M 7/30 - CompressionExpansionSuppression of unnecessary data, e.g. redundancy reduction
A decoder includes circuitry configured to receive a bitstream; construct, for a current block, a motion vector candidate list including adding a global motion vector candidate to the motion vector candidate list; and reconstruct pixel data of the current block and using the motion vector candidate list. Related apparatus, systems, techniques and articles are also described.
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
A decoder includes circuitry configured to receive a bitstream, including a current picture having a current block with a first boundary partitioning the block into first and second non-rectangular regions and a second boundary further partitioning the second non-rectangular region. The decoder determines a first predictor for use within the first non-rectangular region using a first motion vector selected from a first list of motion vector candidates, determines a second predictor for use within the second non-rectangular region using a second motion vector selected from a second list of motion vector candidates, smooths the predictors across the partition boundary and decodes the current block using the first and second predictors.
H04N 19/119 - Adaptive subdivision aspects e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/543 - Motion estimation other than block-based using regions
H04N 19/593 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
A decoder includes circuitry configured to receive a bitstream, extract a header including a list of reference frames available for motion compensation, such as global motion compensation, determine, using the header, a motion model for a current block, the motion relative to a reference frame contained in the list of reference frames, and decode the current block using the motion model. Related apparatus, systems, techniques and articles are also described.
H04N 19/00 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/117 - Filters, e.g. for pre-processing or post-processing
H04N 19/13 - Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
H04N 19/46 - Embedding additional information in the video signal during the compression process
A decoder includes circuitry configured to receive a bitstream, extract a header, determine, using the header, a global motion model, and decode a current block of a current frame using the global motion model. Related apparatus, systems, techniques and articles are also described.
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/91 - Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
A decoder comprising circuitry configured to receive a bitstream, partition the second non-rectangular region of the current block via a geometric partitioning mode to partition the current block into three portions, determine a first predictor for use on a first side of the at least a first partition boundary using a first motion vector, wherein the first motion vector extends from the first partition boundary to the second partition boundary as a function a line segment slope angle, determine a second predictor as a function of a second motion vector, wherein the second motion vector originates at a geometric reference of the current block of pixels and extends to the first motion vector, and decode the current block using the first motion vector and the second motion vector.
An encoder comprising circuitry configured to receive a bitstream, partition the second non-rectangular region of the current block via a geometric partitioning mode to partition the current block into three portions, determine a first predictor for use on a first side of the at least a first partition boundary using a first motion vector, wherein the first motion vector extends from the first partition boundary to the second partition boundary as a function a line segment slope angle, determine a second predictor as a function of a second motion vector, wherein the second motion vector originates at a geometric reference of the current block of pixels and extends to the first motion vector, and decode the current block using the first motion vector and the second motion vector.
H04N 7/12 - Systems in which the television signal is transmitted via one channel or a plurality of parallel channels, the bandwidth of each channel being less than the bandwidth of the television signal
89.
METHODS AND SYSTEMS FOR COMBINED LOSSLESS AND LOSSY CODING
A decoder includes circuitry configured to receive a bitstream, identify, in the bitstream, a current frame, wherein the current frame includes a first region, detect, in the bitstream, an indication that the first region is encoded according to a lossless encoding protocol, and decode the current frame, wherein decoding the current frame further comprises decoding the first region using a lossless decoding protocol.
H04N 7/12 - Systems in which the television signal is transmitted via one channel or a plurality of parallel channels, the bandwidth of each channel being less than the bandwidth of the television signal
90.
METHODS AND SYSTEMS FOR COMBINED LOSSLESS AND LOSSY CODING
An encoder includes circuitry configured to receive an input video, select a current frame, identify a first region of the current frame to be encoded using a lossless encoding protocol, and encode the current frame, wherein encoding the current frame further comprises encoding the first region using the lossless encoding protocol.
H04N 7/12 - Systems in which the television signal is transmitted via one channel or a plurality of parallel channels, the bandwidth of each channel being less than the bandwidth of the television signal
91.
Global motion constrained motion vector in inter prediction
A decoder includes circuitry configured to receive a bitstream, extract a header associated with a current frame and including a signal characterizing that global motion is enabled and further characterizing parameters of a global motion model, and decoding the current frame, the decoding including using a motion model for each current block having a complexity that is less than or equal to a complexity of the global motion model. Related apparatus, systems, techniques, and articles are also described.
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/70 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
92.
METHODS AND SYSTEMS FOR CONSTRUCTING MERGE CANDIDATE LIST INCLUDING ADDING A NON- ADJACENT DIAGONAL SPATIAL MERGE CANDIDATE
A decoder includes circuitry configured to receive a bitstream, determine whether a merge mode is enabled for a block, construct a merge candidate list including adding a non-adjacent diagonal spatial candidate to the merge candidate list, and reconstruct pixel data of the block and using the merge candidate list. Related apparatus, systems, techniques and articles are also described.
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/169 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
A method includes receiving a reference frame, determining, for a current block, a scaling constant; scaling a block of the reference frame according to the scaling constant, the block of the reference frame corresponding to the current block, and reconstructing pixel data of the current block and using the scaled block of the reference frame. Related apparatus, systems, techniques and articles are also described.
H04N 19/00 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/132 - Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
H04N 19/169 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
H04N 19/186 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
H04N 19/30 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
94.
Block-based picture fusion for contextual segmentation and processing
An encoder includes circuitry configured to receive a video frame, partition the video frame into blocks, determine a first area within the video frame including a first grouping of a first subset of the blocks, determine a first average measure of information of the first area, and encode the video frame, the encoding including controlling a quantization parameter based on the first average measure of information of the first area. Related apparatus, systems, techniques and articles are also described.
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/625 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using discrete cosine transform [DCT]
95.
Online and offline selection of extended long term reference picture retention
An encoder includes circuitry configured to receive a video, determine a plurality of long-term reference frames, determine a long-term reference retention time, and encode the video into a bitstream including the determined plurality of long-term reference fames and the determined retention time. Related apparatus, systems, techniques and articles are also described.
H04N 19/105 - Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
H04N 19/172 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
H04N 19/46 - Embedding additional information in the video signal during the compression process
H04N 19/58 - Motion compensation with long-term prediction, i.e. the reference frame for a current frame not being the temporally closest one
96.
Shape adaptive discrete cosine transform for geometric partitioning with an adaptive number of regions
A decoder includes circuitry configured to receive a bitstream; determine a first region, a second region, and a third region of a current block and according to a geometric partitioning mode and decoding the current block. Related apparatus, systems, techniques and articles are also described. Decoder may determine, from a signal contained in the bitstream, a coding transformation type to decode the first region, the second region, and/or the third region, the coding transformation type characterizing at least an inverse block discrete cosine transformation and an inverse shape adaptive discrete cosine transformation, and the decoding of the current block may include using the determined transformation type for inverse transformation for each of the first region, the second region and/or the third region.
H04N 19/119 - Adaptive subdivision aspects e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
H04N 19/12 - Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
H04N 19/159 - Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
H04N 19/61 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
H04N 19/625 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using discrete cosine transform [DCT]
97.
METHODS AND SYSTEMS FOR COMBINED LOSSLESS AND LOSSY CODING
An encoder includes circuitry configured to receive an input video, select a current frame identify a first sub -picture of the current frame to be encoded using a lossless encoding protocol, and encode the current frame, wherein encoding the current frame includes encoding the first sub-picture using the lossless encoding protocol.
H04N 7/12 - Systems in which the television signal is transmitted via one channel or a plurality of parallel channels, the bandwidth of each channel being less than the bandwidth of the television signal
98.
Global motion models for motion vector inter prediction
A decoder is configured to receive a bit stream including a current frame and a picture header associated with the entire current frame, determine, as a function of the picture header, that one global motion mode is enabled for the entire current frame, the enabled global motion mode being selected from a group including translational motion, 4-parameter affine motion, and 6-parameter affine motion, detect, based on the enabled global motion mode, a plurality of parameters applicable to the entire frame, and decode the current frame using the detected parameters.
H04N 19/00 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
H04N 19/44 - Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
H04N 19/176 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
H04N 19/184 - Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
H04N 19/46 - Embedding additional information in the video signal during the compression process
A decoder includes circuitry configured to receive a bitstream, identify a first frame, locate a first independent reference region within the first frame, extract the first independent reference region from the first frame, decode a second frame using the first independent reference region as a reference for the second frame.
A decoder includes circuitry configured to receive a bitstream identify, in the bitstream, a current frame, wherein the current frame includes a first region and a third region, detect, in the bitstream, an indication that the first region is encoded according to a lossless encoding protocol, and decode the current frame, wherein decoding the current frame further comprises decoding the first region using a lossless decoding protocol corresponding to the lossless encoding protocol.