Video Search Engines Glossary

AAF - Advanced Authoring Format

ADI - CableLabs® Asset Distribution Interface Specification used for VoD metadata

AMDF - Average magnitude difference function used for pitch calculation

AMG - All Media Guide, provider of metadata services

ANNIE - A Nearly-New Information Extraction System

ASF - Microsoft’s Advanced Streaming Format

ASR - Automatic Speech Recognition

ATM - Asynchronous Transfer Mode – a networking technology providing guaranteed quality of service (QoS)

ATSC - Advanced Television Standards Committee

ATIS/IIF - Alliance for Telecommunications Industry Solutions / IPTV Interoperability Forum

BiM - Binary format for MPEG-7 provides high compression of XML representations using the schema definition to remove the syntax redundancy and allows separate source coders to be used for sets of element or attribute values

CGM - Consumer generated media

CID - Content Identifier

CMML - Continuous Media Markup Language

CMS - Content management system

COV - Consumer originated video

CSP - Communications Service Provider

DAM - Digital asset management


DBMS - Database management system

DCT - Discrete cosine transform

DOM - Document Object Model: an interface for accessing HTML and XML in a tree structure; used from languages such as JavaScript (ECMAScript).

DMA - FCC defined metropolitan area for television / radio broadcasting, 210 in the US

DLNA - Digital living network alliance; develops standards for home media device interoperability

DVB - Digital video broadcasting

DVR - Digital video recorder

EMD - earth movers distance

ETSI - European Telecommunications Standards Institute

FAR - Frame aspect ratio

GATE - General Architecture for Text Engineering

Geoblocking - restricting content based on location (blackouts)

GMM - Gaussian mixture model

GoP - Group of pictures

GPS - Global positioning system

GXF - General exchange format

HMM - Hidden Markov model

IAR - Image aspect ratio K-Space – Knowledge Space

Lemmatize - Convert a word to its root form; a more advanced form of stemming.

LSCOM - Large-Scale Concept Ontology for Multimedia workshop sponsored by the Disruptive Technology Office (DTO)

LVCASR, LVASR, VLVASR – Large Vocabulary Continuous Automatic Speech Recognition, sometimes VLVASR for “Very_.” Continuous implies that the input speech waveforms are not segmented and may continue without interruption for many minutes. Also the “C” sometimes references “conversational” connoting a task with multiple speakers and differentiating from “read speech” – a less demanding task since there are fewer disfluencies, and better adherence to rules of grammar, etc.

MFCC - Mel-frequency cepstral coefficients, acoustic features used widely in speech signal processing

MIC - Memory in cassette, an NVRAM chip in a tape cassette for improving access time and storing metadata

MPEG - Moving Picture Experts Group, a working group of ISO/IEC charged with development of video and audio encoding standards.

MPF - Metadata Production Framework; Metadata Editor tools from NHK

NEE - Named entity extraction

NLP / NLU - Natural language processing / understanding refers to the study of computational linguistics typically with the goal of recovering some form of semantics or meaning from textual data

OCR - Optical character recognition

OTT - Over the top: delivery of video on the Internet without guaranteed QoS; may refer to download delivery or services like Joost that deliver a TV like experience a potential threat to Cable and IPTV VoD and service providers.

PAR - Pixel aspect ratio

PDA - Personal digital assistant

POS tagging - Part of speech tagging is the NLP operation of assigning tags to input text to classify words as parts of speech like nouns, verbs, etc.

QbH - Query by humming

QoS - quality of service; as opposed to best effort, QoS is provided by the network and used to guarantee bandwidth for streaming media delivery

SAP - Secondary audio program

SDP - Session description protocol for streaming media initialization, or service delivery platform for providers to deliver media services.

Semantic gap – low level extracted features vs. meaning, understanding

Square pixel - 1:1 PAR

SVG - Scaleable vector graphics

SVM - Support vector machine

TRECVID - TREC video retrieval evaluation, sponsored by the National Institute of Standards and Technology (NIST)

TVML - TV program making language from NHK

UCC - User contributed content

UGC - User generated content

VDF - Virage Data Format

VSF - Video Services Forum

VXML - Vector markup language

WPL - Windows Play List: MS extensions to SMIL that allow for representing queries against media libraries, such as “play all with at least a three star rating”

WSX - SMIL with MS extensions

Zipf’s Law - a model with roots in NLP commonly used for describing the long tail phenomenon with regard to the popularity of VoD titles

Next post:

Previous post: