MPEG-DASH Content Generation Using MP4Box and x264
An interesting article describing how to produce MPEG-DASH content with open source tools.
A curation about new media technologies
An interesting article describing how to produce MPEG-DASH content with open source tools.
RGB Networks has announced that it is developing an open source version of its popular TransAct Transcoder. Called ‘Ripcode Transcoder’, after the company Ripcode, which was acquired by RGB Networks in 2010 and which originally developed TransAct, the new, cloud-enabled software transcoder will provide RGB Networks’ customers with greater control, integration and flexibility in their video delivery workflows.
In a pioneering move, and harnessing the industry momentum toward developing cloud-based solutions, RGB Networks is actively welcoming operators and vendors to be part of a community of contributors to the open source project.
RGB Networks’ CloudXtream solution for nDVR and dynamic Ad Insertion for Multiscreen (AIM) environments, launched in October 2013 and built on the industry standard open source cloud operating system OpenStack, has paved the way for this latest innovation. The company intends to build on this success with Ripcode, which will be an “open core” project, where the core technology from the TransAct Transcoder is being used to create the foundations of the open source project.
Suitable for a variety of applications, the Ripcode Transcoder will include the full feature set expected of an industrial-strength, professional transcoder, leaving customers open to select and integrate the packaging solution of their choice necessary to produce their desired Adaptive Bit Rate output formats.
The intended feature set of the open source Ripcode Transcoder will include:
Friday, September 26, 2014
The Society of Motion Picture and Television Engineers published a standard that codifies the Archive eXchange Format. An IT-centric file container that can encapsulate any number and type of files in a fully self-contained and self-describing package, AXF supports interoperability among disparate content storage systems and ensures content’s long-term availability, no matter how storage or file system technology evolves.
Designed for operational storage, transport, and long-term preservation, AXF was formulated as a wrapper, or container, capable of holding virtually unlimited collections of files and metadata related to one another in any combination. Known as “AXF Objects,” such containers can package, in different ways, all the specific information different kinds of systems would need in order to restore the content data. The format relies on the Extensible Markup Language to define the information in a way that can be read and recovered by any modern computer system to which the data is downloaded.
AXF Objects are essentially immune to changes in technology and formats. Thus, they can be transferred from one archive system into remote storage—geographically remote or in the cloud, for instance—and later retrieved and read by different archive systems without the loss of any essence or metadata.
AXF Objects hold files of any kind and any size. By automatically segmenting, storing on multiple media, and reassembling AXF Objects when necessary, “spanned sets” enable storage of AXF Objects on more than one medium. Consequently, AXF Objects may be considerably larger than the individual media on which they are stored. This exceptional scalability helps to ensure that AXF Objects may be stored on any type or generation of media. The use of “collected sets” permits archive operators to make changes to AXF Objects or files within them, while preserving all earlier versions, even when write-once storage is used.
The nature of AXF makes it possible for equipment manufacturers and content owners to move content from their current archive systems into the AXF domain in a strategic way that does not require content owners to abandon existing hardware unless or until they are ready to do so. In enabling the recovery of archived content in the absence of the systems that created the archives, AXF also offers a valuable means of protecting users’ investment in content. By maintaining preservation information such as fixity and provenance as specified by the OAIS model, AXF further enables effective long-term archiving of content. Resilience of data is ensured through use of redundant AXF structures and cryptographic hash algorithms.
AXF already has been employed around the world to help businesses store, protect, preserve, and transport many petabytes of file-based content, and the format is proving fundamental to many of the cloud-based storage, preservation, and IP-based transport services available today.
Source: TV Technology
Thursday, September 18, 2014
This article shows you how to setup GPAC for your OnDemand and Live contents.
CableLabs has launched a 4K-focused microsite that provides access to Ultra HD/4K video clips to help platform developers, vendors, network operators and other video pros conduct tests with the emerging eye-popping format.
CableLabs said it’s offering the content under the Creative Commons License, meaning it can be used freely for non-commercial testing, demonstrations and the general advancement of technology.
As vendors utilize content from the site to test new technology, CableLabs helps the industry get one step closer to standardizing 4K content and delivering to the home.
As of this writing, the site hosts seven videos, all shot with a Red Epic camera. The longest of the batch is a fireman-focused clip titled “Seconds That Count” that runs 5 minutes and 22 seconds.
On the site, CableLabs has integrated an upload form for anyone who wants to share their 4K videos for the purpose of testing. Interested particiapnts are directed to provide a lower bite-rate HD file for preview purposes along with a 4K version. CableLabs is accepting pre-transcoded versions using MPEG HEVC or AVC, or Apple ProRes version. CableLabs will take on the task of transcoding the content into two high quality versions available for download on the website. CableLabs notes that uploaded content might be used for demos at forums, shows, and conferences.
CableLabs is launching the site as the cable industry just begins to develop plans around 4K. Among major U.S. MSOs, Comcast plans to launch an Internet-based, on-demand Xfinity TV 4K app before the end of the year that will initially be available on new Samsung UHD. The MSO is also working with partners on a new generation of boxes for its X1 platform that uses HEVC and can decode native 4K signals.
On the competitive front, DirecTV president and CEO Mike White said on the company's second quarter earnings call that the satellite TV giant will be ready to deliver 4K video on an on-demand basis this year, and be set up to follow with live 4K streaming next year or by early 2016.
Source: Multichannel News
HTTP Streaming is a recent topic in multimedia communications with on-going standardization activities, especially with the MPEG DASH standard which covers on demand and live services. One of the main issues in live services deployment is the reduction of the overall latency. Low or very low latency streaming is still a challenge.
In this paper, we push the use of DASH to its limits with regards to latency, down to fragments being only one frame, and evaluate the overhead introduced by that approach and the combination of: low latency video coding techniques, in particular Gradual Decoding Refresh; low latency HTTP streaming, in particular using chunked-transfer encoding; and associated ISOBMF packaging.
We experiment DASH streaming using these techniques in local networks to measure the actual end-to-end latency, as low as 240 milliseconds, for an encoding and packaging overhead in the order of 13% for HD sequences and thus validate the feasibility of very low latency DASH live streaming in local networks.
By Nassima Bouzakaria, Cyril Concolato and Jean Le Feuvre
A significant step in the road to Ultra High Definition TV services has been taken with the approval of the DVB-UHDTV Phase 1 specification at the 77th meeting of the DVB Steering Board. The specification includes an HEVC Profile for DVB broadcasting services that draws, from the options available with HEVC, those that will match the requirements for delivery of UHDTV Phase 1 and other formats. The specification updates ETSI TS 101 154 (Specification for the use of Video and Audio Coding in Broadcasting Applications based on the MPEG-2 Transport Stream).
The new DVB-UHDTV Phase 1 will allow images with four times the static resolution of the 1080p HDTV format, at frame rates of up to 60 images per second. Contrast will be drastically improved by increasing the number of bits per pixel to 10 bit. From the wide range of options defined in the HEVC Main 10 profile, Level 5.1 is specified for UHD content for resolutions up to 2160p. For HD content, HEVC Main profile level 4.1 is specified for supporting resolutions up to 1080p.
The DVB-UHDTV Phase 1 specification takes into account the possibility that UHDTV Phase 2 may use higher frame rates in a compatible way, which will add further to the image quality of UHDTV Phase 1.
“HEVC is the most recently-developed compression technology and, among other uses, it is the key that will unlock UHDTV broadcasting,” said DVB Steering Board Chairman, Phil Laven. “This new DVB–UHDTV Phase 1 specification not only opens the door to the age of UHDTV delivery but also potentially sets the stage for Phase 2, the next level of UHDTV quality, which will be considered in upcoming DVB work,” he continued.
Also approved was the specification for Companion Screens and Streams, Part 2: Content Identification and Media Synchronization. Companion Devices (tablets, smart phones) enable new user experiences for broadcast service consumption. Many of these require synchronisation between the Broadcast Service at the TV Device and the Timed Content presented at the Companion Device. This specification focuses on the identification and synchronisation of a Broadcast Service on a TV Device (Connected TV or STB and screen) and Timed Content on a Companion Screen Application running on a Companion Device. Part 2 outlines the enabling factors for the identification of, and synchronisation with, broadcast content, timed content and trigger events on TV devices (for example a Connected TV or STB) and related content presented by an application running on a personal device.
Another specification to gain approval from the Steering Board was the MPEG-DASH Profile for Transport of ISO BMFF Based DVB Services over IP Based Networks. This specification defines the delivery of TV content via HTTP adaptive streaming. MPEG-DASH covers a wide range of use cases and options. Transmission of audiovisual content is based on the ISOBMFF file specification. Video and audio codecs from the DVB toolbox that are technically appropriate with MPEG-DASH have been selected. Conditional Access is based on MPEG Common Encryption and delivery of subtitles will be XML based. The DVB Profile of MPEG-DASH reduces the number of options and also the complexity for implementers. The new specification will facilitate implementation and usage of MPEG-DASH in a DVB environment.
The three new specifications will now be sent to ICT standards body ETSI for formal standardisation and the relevant BlueBooks will be published shortly.
Source: Advanced Television
The Consumer Electronics Association has announced updated core characteristics for ultra high-definition TVs, monitors and projectors for the home. As devised and approved by CEA’s Video Division Board, these characteristics build on the first-generation UHD characteristics released by CEA in October 2012.
These expanded display characteristics (CEA’s Ultra High-Definition Display Characteristics V2) – voluntary guidelines that take effect in September 2014 – are designed to address various attributes of picture quality and help move toward interoperability, while providing clarity for consumers and retailers alike.
Under CEA’s expanded characteristics, a TV, monitor or projector may be referred to as Ultra High-Definition if it meets the following minimum performance attributes:
This week Netflix is pleased to begin streaming all 62 episodes of Breaking Bad in UltraHD 4K. Breaking Bad in 4K comes from Sony Pictures Entertainment’s beautiful remastering of Breaking Bad from the original film negatives. This 4K experience is available on select 4K Smart TVs.
As pleased as I am to announce Breaking Bad in 4K, this blog post is also intended to highlight the collaboration between Sony Pictures Entertainment and Netflix to modernize the digital supply chain that transports digital media from content studios, like Sony Pictures, to streaming retailers, like Netflix.
Netflix and Sony agreed on an early subset of IMF for the transfer of the video and audio files for Breaking Bad. IMF stands for Interoperable Master Format, an emerging SMPTE specification governing file formats and metadata for digital media archiving and B2B exchange.
IMF specifies fundamental building blocks like immutable objects, checksums, globally unique identifiers, and manifests (cpl). These building blocks hold promise for vastly improving the efficiency, accuracy, and scale of the global digital supply chain.
At Netflix, we are excited about IMF and we are committing significant R&D efforts towards adopting IMF for content ingestion. Netflix has an early subset of IMF in production today and we will support most of the current IMF App 2 draft by the end of 2014.
We are also developing a roadmap for IMF App 2 Extended and Extended+. We are pleased that Sony Pictures is an early innovator in this space and we are looking forward to the same collaboration with additional content studio partners.
Breaking Bad is joining House of Cards season 2 and the Moving Art documentaries in our global 4K catalog. We are also adding a few more 4K movies for our USA members. We have added Smurfs 2, Ghostbusters, and Ghostbusters 2 in the United States. All of these movies were packaged in IMF by Sony Pictures.
By Kevin McEntee, VP Digital Supply Chain, Netflix
We're excited to announce that Netflix streaming in HTML5 video is now available in Safari on OS X Yosemite! We've been working closely with Apple to implement the Premium Video Extensions in Safari, which allow playback of premium video content in the browser without the use of plugins.
If you're in Apple's Mac Developer Program, or soon the OS X Beta Program, you can install the beta version of OS X Yosemite. With the OS X Yosemite Beta on a modern Mac, you can visit Netflix.com today in Safari and watch your favorite movies and TV shows using HTML5 video without the need to install any plugins.
We're especially excited that Apple implemented the Media Source Extensions (MSE) using their highly optimized video pipeline on OS X. This lets you watch Netflix in buttery smooth 1080p without hogging your CPU or draining your battery. In fact, this allows you to get up to 2 hours longer battery life on a MacBook Air streaming Netflix in 1080p - that’s enough time for one more movie!
Apple also implemented the Encrypted Media Extensions (EME) which provides the content protection needed for premium video services like Netflix.
Finally, Apple implemented the Web Cryptography API (WebCrypto) in Safari, which allows us to encrypt and decrypt communication between our JavaScript application and the Netflix servers.
The Premium Video Extensions do away with the need for proprietary plugin technologies for streaming video. In addition to Safari on OS X Yosemite, plugin-free playback is also available in IE 11 on Windows 8.1, and we look forward to a time when these APIs are available on all browsers.
Congratulations to the Apple team for advancing premium video on the web with Yosemite! We’re looking forward to the Yosemite launch this Fall.
By Anthony Park and Mark Watson, Netflix Tech Blog
Wednesday, June 04, 2014
Labels: Encrypted Media Extensions, HTML 5, Media Source Extensions, OTT TV
As part of the 4Ever project, we have been releasing an HEVC and DASH ultra high definition dataset, ranging from 8bit 720p 30Hz up to 10bit 2160p 60 Hz. The dataset is released under CC BY-NC-ND.
The data set web page is here, and more information on the dataset can also be found in this article.
Source: GPAC
With most competing browsers and the content industry embracing the W3C EME specification, Mozilla has little choice but to implement EME as well so our users can continue to access all content they want to enjoy. Read on for some background on how we got here, and details of our implementation.
Digital Rights Management (DRM) is a tricky issue. On the one hand content owners argue that they should have the technical ability to control how users share content in order to enforce copyright restrictions. On the other hand, the current generation of DRM is often overly burdensome for users and restricts users from lawful and reasonable use cases such as buying content on one device and trying to consume it on another.
DRM and the Web are no strangers. Most desktop users have plugins such as Adobe Flash and Microsoft Silverlight installed. Both have contained DRM for many years, and websites traditionally use plugins to play restricted content.
In 2013 Google and Microsoft partnered with a number of content providers including Netflix to propose a “built-in” DRM extension for the Web: the W3C Encrypted Media Extensions (EME).
The W3C EME specification defines how to play back such content using the HTML5 video element, utilizing a Content Decryption Module (CDM) that implements DRM functionality directly in the Web stack. The W3C EME specification only describes the JavaScript APIs to access the CDM. The CDM itself is proprietary and is not specified in detail in the EME specification, which has been widely criticized by many, including Mozilla.
Mozilla believes in an open Web that centers around the user and puts them in control of their online experience. Many traditional DRM schemes are challenging because they go against this principle and remove control from the user and yield it to the content industry.
Instead of DRM schemes that limit how users can access content they purchased across devices we have long advocated for more modern approaches to managing content distribution such as watermarking. Watermarking works by tagging the media stream with the user’s identity. This discourages copyright infringement without interfering with lawful sharing of content, for example between different devices of the same user.
Mozilla would have preferred to see the content industry move away from locking content to a specific device (so called node-locking), and worked to provide alternatives.
Instead, this approach has now been enshrined in the W3C EME specification. With Google and Microsoft shipping W3C EME and content providers moving over their content from plugins to W3C EME Firefox users are at risk of not being able to access DRM restricted content (e.g. Netflix, Amazon Video, Hulu), which can make up more than 30% of the downstream traffic in North America.
We have come to the point where Mozilla not implementing the W3C EME specification means that Firefox users have to switch to other browsers to watch content restricted by DRM.
This makes it difficult for Mozilla to ignore the ongoing changes in the DRM landscape. Firefox should help users get access to the content they want to enjoy, even if Mozilla philosophically opposes the restrictions certain content owners attach to their content.
As a result we have decided to implement the W3C EME specification in our products, starting with Firefox for Desktop. This is a difficult and uncomfortable step for us given our vision of a completely open Web, but it also gives us the opportunity to actually shape the DRM space and be an advocate for our users and their rights in this debate. The existing W3C EME systems Google and Microsoft are shipping are not open source and lack transparency for the user, two traits which we believe are essential to creating a trustworthy Web.
The W3C EME specification uses a Content Decryption Module (CDM) to facilitate the playback of restricted content. Since the purpose of the CDM is to defy scrutiny and modification by the user, the CDM cannot be open source by design in the EME architecture. For security, privacy and transparency reasons this is deeply concerning.
From the security perspective, for Mozilla it is essential that all code in the browser is open so that users and security researchers can see and audit the code. DRM systems explicitly rely on the source code not being available. In addition, DRM systems also often have unfavorable privacy properties. To lock content to the device DRM systems commonly use “fingerprinting” (collecting identifiable information about the user’s device) and with the poor transparency of proprietary native code it’s often hard to tell how much of this fingerprinting information is leaked to the server.
We have designed an implementation of the W3C EME specification that satisfies the requirements of the content industry while attempting to give users as much control and transparency as possible. Due to the architecture of the W3C EME specification we are forced to utilize a proprietary closed-source CDM as well. Mozilla selected Adobe to supply this CDM for Firefox because Adobe has contracts with major content providers that will allow Firefox to play restricted content via the Adobe CDM.
Firefox does not load this module directly. Instead, we wrap it into an open-source sandbox. In our implementation, the CDM will have no access to the user’s hard drive or the network. Instead, the sandbox will provide the CDM only with communication mechanism with Firefox for receiving encrypted data and for displaying the results.
Traditionally, to implement node-locking DRM systems collect identifiable information about the user’s device and will refuse to play back the content if the content or the CDM are moved to a different device.
By contrast, in Firefox the sandbox prohibits the CDM from fingerprinting the user’s device. Instead, the CDM asks the sandbox to supply a per-device unique identifier. This sandbox-generated unique identifier allows the CDM to bind content to a single device as the content industry insists on, but it does so without revealing additional information about the user or the user’s device. In addition, we vary this unique identifier per site (each site is presented a different device identifier) to make it more difficult to track users across sites with this identifier.
Adobe and the content industry can audit our sandbox (as it is open source) to assure themselves that we respect the restrictions they are imposing on us and users, which includes the handling of unique identifiers, limiting the output to streaming and preventing users from saving the content. Mozilla will distribute the sandbox alongside Firefox, and we are working on deterministic builds that will allow developers to use a sandbox compiled on their own machine with the CDM as an alternative. As plugins today, the CDM itself will be distributed by Adobe and will not be included in Firefox. The browser will download the CDM from Adobe and activate it based on user consent.
For all of Netflix’s complaints about Internet service providers harming video performance, one of the company’s top technology experts is confident that the streaming company can solve most of its customers’ problems.
David Fullagar, Netflix’s director of content delivery architecture, spoke about the company’s plans Monday at the Content Delivery Summit in New York. He described the hardware Netflix uses in its Open Connect content delivery network (CDN), noting that the company has a technological advantage over traditional CDNs because it’s always delivering content to devices running Netflix’s own software rather than using a hodgepodge of products built by other companies.
The best-known parts of Open Connect are probably the storage boxes that Internet service providers can take into their own networks to bring content closer to consumers. ISPs can also peer with Netflix, exchanging traffic directly without hosting Netflix equipment. But these aren’t the only ways Netflix’s Open Connect technology can deliver good quality.
Netflix used to use third-party CDNs such as Akamai, but it has moved most of its traffic over to Open Connect in the past couple of years. Outside the US, 100 percent of Netflix traffic is distributed using Open Connect equipment. The percentage is in the “high 90s” in the US, with plans to hit 100 percent this summer. Even if the storage boxes aren’t inside an ISP’s network, they’re not too far away. They could even be in the same data centers, the Internet exchange points where Netflix transit providers connect to ISPs.
Fullagar was asked by an audience member how Netflix works with ISPs who offer competing products. “From a quality point of view we don’t need to be that close to the end user for the sort of video we serve,” Fullagar said. “Having extremely low latency is nice” because it allows videos to start playing faster. However, “what we’re most interested in is a good, uncongested link, and that doesn’t necessarily have to be very low latency.”
Netflix’s peering with ISPs has been controversial because some of the Internet providers have demanded payment in exchange for accepting Netflix traffic. Netflix gave in to Verizon and Comcast, agreeing to pay both companies, but it has claimed that the Federal Communications Commission should force the ISPs to provide free peering. Netflix has sent its traffic through congested links when its business disputes have gone unresolved, deteriorating quality despite the other steps Netflix takes to improve it. (Comcast and analyst Dan Rayburn accused Netflix of purposely sending traffic through congested links.)
When asked how much Netflix can affect streaming performance given that it controls the server end of the connection as well as the user’s software, Fullagar said, “I think we’re on the tip of the iceberg of being able to do quite a lot there.” Netflix’s access to information about each customer’s device and Internet connection will fuel some as-yet-unrevealed strategies for improving quality, he said.
“We have extra information beyond just, hey this is someone wanting this file," he said. "At connection time we know the sort of client they are, whether it’s a Wii or a PS4 or a streaming stick. We know the network they’re on, we know a bunch of historical information about latency and quality of service we’ve had to those networks. We know whether they’re connected on a device that’s wired or wireless. There’s a bunch of hints that we have there.”
The company has started some “experiments that are working out really well, and in the future we’ll talk more about that.”
Netflix itself has equipment at about 20 Internet exchange points in North America and Europe and has "tens if not hundreds of embedded caches in ISP networks," Fullagar said.
The Network Team
Netflix’s Open Connect division has about 40 people, Fullagar said. About 20 are software engineers who either build software for Netflix servers or work on the company's management software, which runs on Amazon’s cloud network and performs functions such as load balancing. Another 10 Open Connect employees are network architects, and another 10 are in operations.
Netflix stores video on two types of boxes that it designed, one that’s heavy on HDDs and another that’s all SSDs. Netflix built them in part because it couldn’t find the right mix of compute and storage capabilities in products from hardware vendors.
The HDD unit is a 4U-sized chassis that holds 216TB on 36 drives of 6TB each. It has 64GB RAM, a 10 Gigabit NIC, and some SSD for frequently accessed content.
The smaller, 1U, SSD-only unit contains 14 drives of a terabyte each, 256GB of RAM and a 40 Gigabit NIC. About 75 percent of the cost of both the HDD and SSD boxes is taken up by storage. Each unit uses Intel CPUs.
Netflix refreshes hardware annually to improve performance. At its biggest locations, Netflix keeps multiple copies of its entire video library in case of failure. That’s more than a petabyte of video files for its North American catalog.
The company relies heavily on open source software, including FreeBSD and the Nginx Web server, as well as several management applications the company wrote itself.
Netflix distributes multiple terabits per second and accounts for an astonishing one-third of North American Internet traffic at peak times, i.e. the traditional TV “prime time” each evening. During off-peak hours in the middle of the night, Netflix fills disks with the videos its algorithms say people are most likely to watch the next day. This dramatically reduces network utilization during peak hours.
The management software Netflix runs on Amazon Web Services handles distribution of content, analyzes network performance, and connects users to the proper video sources. Netflix wrote its own adaptive bitrate algorithms to react to changes in throughput, and a CDN selection algorithm to adapt to changing network conditions such as overloaded links, overloaded servers, and errors, the company said.
When Netflix used multiple third-party CDNs, connections would fail over from one to another in case of error. Netflix still uses the same failover technology, but with “multiple hierarchies” within Open Connect instead of multiple CDNs, Fullagar said.
Although Netflix is moving all its data onto Open Connect hardware, that doesn’t automatically reduce the controversial role its transit providers Level 3 and Cogent have played in carrying traffic. Level 3 and Cogent have warred with ISPs over whether they should have to pay in order to send Netflix traffic onto their networks. As a result, interconnections between these transit providers and ISPs have gotten congested, reducing the quality of Netflix and other Web services that travel over the links.
The role of transit providers is only reduced when Netflix signs direct interconnection agreements with ISPs, as it has done Verizon and Comcast, a Netflix spokesperson said. In the absence of such agreements, Netflix data passes through the company’s own CDN and then through a transit provider before hitting an ISP's network.
The payment controversies don’t necessarily affect the working relationship between the technical teams of Netflix and ISPs, though. “Engineering people at companies, whether large or small, operate independently of commercial interests,” Fullagar said. “In the UK, one of our biggest competitors is one of our best networking partners.”
Source: Ars Technica
The PrestoCentre Standards Register gathers information on standards for content and metadata used across all communities involved in audiovisual digital preservation.
Orban has introduced Version 2.7 of the free Loudness Meter for Windows (Vista/7/8) and Mac (OS X 10.6 or greater). The software adds two important new features: support for up to 7.1-channel surround, and the ability to analyse files in several common formats offline to measure their ITU-R BS.1770-3 Integrated Loudness and Loudness Range. This combination of new features allows any organization to qualify files, whether stereo or surround, for compliance with the Calm Act and EBU R128.
Like its predecessor, version 2.7 measures loudness using both the Jones & Torick (CBS Technology Center) and BS.1770-3 algorithms, displaying BS.1770-3 Short-Term, Momentary, and Integrated loudness in addition to the Jones and Torick loudness.
HEVC Version 1 demonstrates it has achieved more than 50% bitrate savings compared to MPEG 4 AVC/H.264, the MPEG standards group announced following its 108th meeting held at the beginning of April in Valencia, Spain.
In a verification test campaign of the HEVC, ITU-T Rec H.265 | ISO/IEC 23008-2 compression standard following the finalisation of the standard last year “a formal subjective quality assessment has been executed using a large variety of video material, ranging from wide-screen VGA resolution up to 4K. The material had not previously been used in optimizing HEVC's compression technology”, MPEG, announced in a prepared statement, adding: “Clear evidence was found that HEVC is able to achieve 50% bitrate savings and more, compared to the AVC High Profile”. The results will be made publicly available in the report N14420, to be published on the MPEG website.
HEVC's scope continues to be extended with a call for proposals on Screen Content Coding, the compression of video containing rendered graphics and animation. This extention is scheduled for completion by the end of 2015.
HEVC's 2nd edition includes support for additional colour formats and higher precision. The ‘Range Extensions amendment’, with technology allowing efficient compression of video content for colour sampling formats beyond 4:2:0 and up to 16 bits of processing precision has been finalised. In particular, the lossless and near lossless range of visual quality is more efficiently compressed than is possible with the current version 1 technology.
Web Video Coding the standard for a worldwide, reasonable and non-discriminatory, and free of charge licensable online compression scheme for use in browser has reached the final stage before approval. MPEG expects to complete the Final Draft International Standard in February 2015.
MPEG is also working on standardising what it refers to as ‘free-viewpoint television’. A public seminar on FTV (Free-viewpoint Television) will be held on July 8th to align MPEG's future standardization of FTV technologies with users and industry needs. Targeted application scenarios are Super Multiview Displays “where hundreds of very densely rendered views provide horizontal motion parallax for realistic 3D visualization, extracted from a dense or sparse set of input views/cameras in a circular or linear arrangement”.
“Integral Photography, where 3D video with both horizontal and vertical motion parallax are captured for realistic display”. And “Free Navigation that allows the user to freely navigate or fly through the scene, not just along predefined pathways”.
MPEG expects that future FTV systems will require new functionalities such as a substantial increase in coding efficiency and rendering capability compared to technology currently available. The FTV initiative will also consider novel means for acquiring 3D content that have recently emerged, e.g. plenoptic and light field cameras. You are invited to join the FTV seminar to learn more about MPEG activities in this area and to help revolutionise the viewing experience.
The increases in spatial resolution and colour resolution, scalable coding and autostereoscopic 3D, or in MPEG speak Multi-view, leads to amendments in the trusty old MPEG-TS, transport stream layer. The amendment specifies transport of layered coding extensions for the scalable and multiview enhancements of HEVC, and the signaling of associated descriptors so that different layers can be encapsulated and transported individually.
To support the new standards to offer 4K/8K UHDTV services by using a newly developed MPEG standard, MPEG Media Transport (MMT) an effort to promote the new transport layer standard has begun in Japan with a growing number of companies implementing MMT for various applications. MPEG is organising MMT Developers' Day in conjunction with its 109th meeting this July in Sapporo, Japan.
MPEG is also working on 3D audio and dynamic range control. The DRC system provides comprehensive control to adapt the audio as appropriate for the particular content, the listening device, environment, and user preferences. The loudness control can be applied to meet regulatory requirements and to improve the user experience, especially for content with large loudness variations.
By Donald Koeleman, Broadband TV News
LG Electronics is making Connect SDK, an open source software development kit, available to Google Android and iOS developers to extend their mobile experience across tens of millions of big TV screens around the world.
By unifying device discovery and connectivity across multiple television platforms, Connect SDK is the first to truly address the complexity associated with implementing second screen capabilities while reaching the largest installed base of smart TVs and connected devices.
For consumers, this means that LG's new Smart TVs, powered by the webOS platform, as well as other popular TV devices, will be able to connect and interact with more mobile apps – further enhancing their second screen television experience.
Using Connect SDK, application developers can connect their mobile applications with 2014 LG webOS Smart TVs, LG Smart TVs from 2012 and 2013, Roku, Google Chromecast, and Amazon Fire TV devices:
The adoption of the Digital Cinema projection has opened new opportunities for the distribution of films. The digitization of commercial film collections and film heritage is developing rapidly. Up to now there has been no specific format for the interoperable exchange and conservation of cinematographic works with the highest required quality.
In February 2011, at the request of the French Centre National du Cinema, the Commission Supérieure Technique started to work on the technical recommendation CST-RT021. The goal of this recommendation is to ensure that the commercial exploitation of digitized cinematographic works on modern digital distribution channels is made possible with the required quality.
The first version of the CST-RT021-MFFS specification has been published. It has been designed to be consistent with current developing standards, specifically the SMPTE Interoperable Master Format (Technical Committee 35 PM). The expert group led by CST is proposing that film laboratories, audio-visual equipment manufacturers and all interested parties participate in a Plugfest to test the specification.
This event will allow testing the wrapping, encoding of image and sound and colour coding detailed in the CST-RT021-MFFS, in order to ensure the interoperability of commercial solutions.
Source: ETSI
During the 2014 NAB show in Las Vegas, the DASH Industry Forum organized a 2hr session with nine presentations giving you the latest technical, business and deployment updates on MPEG-DASH. Speakers represented companies from across the DASH ecosystem. Here is the video replay of this event.
As the UK broadcast industry moves towards full digital delivery, the DPP has launched A Producer’s Guide to File Delivery, a complete handbook that explains – from a production point of view – all that is involved and required in the new process.
At Technology Summit on Cinema here at NAB, Walt Disney’s Howard Lukk said there can now be a total of 35,000 possible versions of a movie that will have to be generated to serve all possible ways the movie can be scene. This is apparently based on a permutation (or multiplication) of all of the variables in creating a particular version.
This presumably means cinematic versions, packaged media versions and versions for cable, satellite, broadcast and internet distribution. While this high may be possible, it is also unlikely, but nonetheless is in the thousands and represents a huge challenge for the industry.
For theatrical distribution for example, Walt Disney’s Leon Silverman said they typically need to create over 100 masters of each film. That includes versions with different audio mixes, different languages and different platform specific needs.
To illustrate what they are doing he described two movies in his talk. The new movie “Planes” for example, had 126 different masters with one of the plane characters having a different name and artwork depending on what country it was screened in.
He also showed a clip from the movie “Frozen” playing the song “Let it Go” where every verse was sung in a different language. There must have been 40 different languages with each performance created by a difference signer, yet they were blended perfectly to sound seamless. Incredible! He also lamented the versioning that is required to market a film requiring dozens of thumbnail images that must be used on various web sites.
Overall, he described what he called the “new post post world order,” which he said has changed the landscape for just about every aspect of the way movies are made today. He started his talk by noting that the workflow of cinema is increasingly being merged with the TV production workflow and that it may be hard to tell the difference in the future. He then gave details on ways the industry is complex (versioning being one aspect), connected, global and secure. While the title of the session was “From Camera to Consumer”, he renamed it “From Camera to Netflix”.
Filmmakers must work in a connected, networked global environment, but he did not seem particularly concerned about technology being able to handle the needs going forward. Security is more of a concern for Disney with isolated networks, multiple security protocol and audits done to help protect their IP. Success or failure here can have huge impacts on the studio and careers as well.
His description of what is needed was so incredible it led others on the panel to hope they never had his job.
How one gets to 35,000 versions is still a little unclear, but presumably includes all the different aspect ratios, video formats, audio formats, broadcast formats, Internet formats and may include encoding formats and all of those variables as well. While a studio would not necessarily have to produce all those versions, someone somewhere would adding enormous overheads to the process.
One solution to the format issue is a project that was also described at the event called IMF (Interoperable Master Format). This is an industry-wide effort started by the major studios that began as a Business to Consumer version of the Business to Business cinema formatting standards effort that is now called DCI (Digital Cinema Initiatives). Speakers from Disney, Sony and Fox described their efforts to create a SMPTE standard (now issued) and to implement the format at their studios.
The basic idea is to be able to have a “core framework” that consists of the main visual aspects plus a series of “modular application” that are plug ins to the format that add specific functionality like codecs, specific resolutions and frame rates and other aspects. This is all managed by a Composition Play List (CPL). This allows the generation of localized versions from a single file format.
Vantrix announced the creation of an open source version of the H.265 encoder, calling it the F265 project. The project aims to accelerate the industry-wide development and adoption of H.265, also known as High Efficiency Video Coding (HEVC).
H.265 is the successor to the industry standard H.264 codec used for video compression. The new specification, ratified in 2013, provides for double the data compression ratio of H.264 while ensuring the same level of visual quality. It is expected to be a major driver in the adoption of 4k Ultra High Definition and beyond by reducing the amount of transmission bandwidth required versus current standards.
Vantrix's F265 encoder will be licensed under the OSI BSD terms, enabling access to source code, free redistribution, and derived works. The intent is to encourage both researchers and commercial entities to contribute to the refinement and evolution of the code to accelerate the implementation of both software and hardware systems.
The project will initially target high quality offline encoding, but will not be limited to this scope. It is designed from the ground up to maximize quality and performance in offline and real time scenarios using recent hardware technology and interfaces such as the Intel AVX2 instruction set.
The F265 project site is in the process of being finalized and those interested in being notified of its availability can sign up at www.vantrix.com/f265.
Source: Vantrix
The new version of EBUCore can be downloaded as EBU Tech 3293 v1.5. Thanks to the efforts of the metadata experts in the EBUCore developer and user community, the new version of the EBU's metadata flagship includes several enrichments.
The most prominent update probably is the integration of the recently published EBU Audio Definition Model (ADM) (EBU Tech 3364). The ADM provides a complete set of technical and informative metadata to describe a file's audio content.
The PlayReady team, working in conjunction with the Windows Azure Media Services team is pleased to announce the availability of the Microsoft Smooth Streaming Client 2.5 with MPEG DASH support.
This release adds the ability to parse and play MPEG DASH manifests in the Smooth Streaming Media Engine (SSME) to provide a Windows7/Windows8 and MacOS solution using MPEG DASH for On-Demand scenarios.
Developers that wish to move content libraries to DASH have the option of using DASH in places where Silverlight is supported. The existing SSME object model forms the basis of DASH support in the SSME. For example, DASH concepts like Adaptation Sets and Representations have been mapped to their logical counterpart in the SSME.
Also, Adaptation Sets are exposed as Smooth Streams and Representations are exposed as Smooth Tracks. Existing Track selection and restriction APIs can be expected to function identically for Smooth and DASH content.
In most other respects, DASH support is transparent to the user and a programmer who has worked with the SSME APIs can expect the same developer experience when working with DASH content.
Some details on the DASH support compared to Client 2.0:
The DPP has released a set of standardised UK Quality Control Requirements to help producers carry out the necessary QC checks to ensure they deliver broadcast quality files, which meet the necessary standards.
The guidelines, which are published six months out from the 1st October, when UK broadcasters will make the move to full digital delivery, have been produced in collaboration with the EBU.
The EBU’s Strategic Programme for Quality Control is currently defining all possible QC criteria as well as guidance on their implementation. The DPP has taken the EBU definitions and created a minimum set of tests and tolerance levels required for UK broadcasters.
Included in the new guidelines are AS-11 file compliance checks, and Automated Quality Control tests for Video and Audio. Examples are loudness levels, and freeze frames.
The guidelines also includes a list of ‘Eyeball’ tests that a producer needs to undertake before delivering the programme. Included on the checklist are such things as Audio Sync, Buzzing, Unclear Sound, Clock, and Visual Focus etc.
Commenting on the new work, Kevin Burrows, DPP Technical Standards Lead and CTO Broadcast & Distribution, Channel 4, said, “The DPP’s QC guidelines offer a standardised set of checks expected prior to the digital delivery of broadcast programmes to the UK Broadcasters. They are designed to streamline the QC process and help minimise the issues that can arise in programme delivery.”
Andy Quested, Principal Technologist BBC, who has been instrumental in devising the guidelines as well as leading the EBU’s work, added, “Post houses and broadcasters are seeing a significant increase in the volume of file-based programmes they need to handle. It is really important that the QC tests give accurate and consistent results. The new guidelines don’t just explain the process and the test to be carried out, they also make it clear who is responsible for signing off the QC process.”
The guidelines will be implemented by broadcasters as they move towards digital file delivery. Production companies will be required to deliver their compliant files along with a valid QC report, as has previously been the case with the PSE report.
The QC report can be delivered using a separate PDF or XML file output from the QC tool. DPP broadcasters will accept PDF QC reports initially, but will encourage XML reports over time once they are standardised by the EBU group and the DPP.
Source: Digital Production Partnership
Usually when creating a video, all that is needed is to encode it using a codec (for example H.264 or HEVC). However, to transmit a video using MPEG-DASH, an extra segmentation step is required. Typical encoders do not provide this step and produce content which is not compatible with DASH.
Hosted as a GitHub project, dash.encrypt is available as an open-source application written in Java. It takes encoded video and audio from an array of different formats and repackages them as valid DASH streams. It also generates the required manifest which is the table of contents for the stream.
Firefly Cinema launches FirePlay, the first free universal player able to read all digital file formats in real time, up to a resolution of 4K.
FirePlay is extremely easy to use and requires no specific knowledge. It is intended for all professionals working in digital cinema, and simplifies the work of DPs, camera assistants, or DITs.
The UK's Digital Production Partnership (DPP) has agreed an industry document format standard for the exchange of subtitles for the hard of hearing and audio description.
ligned with the EBU’s new subtitle document format (EBU-TT), produced in July 2012, the new standard has been created to support UK broadcasters’ requirements for subtitle and audio description script transfer.
The DPP’s subtitle document format extends EBU-TT with metadata required to support the workflow for delivering prepared subtitles and captured live subtitles in addition to the scripts used during the production of audio description.
All DPP subtitle documents are valid EBU-TT, itself based on the W3C TTML recommendation. The format separates the text and it’s associated display timing from information about that text, such as where it should be placed on the screen, the font style, size and colour, and separate metadata such as whether a given subtitle is describing dialogue, music, sounds effects etc., and what language the subtitles are in.
The format is a flavour of XML whose format can be validated using off the shelf tools and extended to meet specific requirements while still being interchangeable. In addition it can support all Unicode characters and arbitrary fonts, should downstream platforms support them. EBU is in the process of finalising the draft guidance for converting from the legacy STL format to EBU-TT.
DPP’s subtitle document format will allow UK broadcasters and access service providers to move away from legacy formats, proprietary or otherwise, and towards an open future-facing format that can be used to provide subtitles on broadcast television and online. By agreeing this format before making the transition, DPP is able to lay the groundwork for a common UK interchange format that will benefit all businesses that need to exchange these documents.
Companies that manufacture access services authoring and processing tools have a clear target format, which is vital in a historically fragmented marketplace. Broadcasters and distributors similarly will have a lower cost of adoption of this richer format that is not encumbered by the constraints of legacy formats.
Kevin Burrows, DPP Technical Standards Lead and CTO Broadcast & Distribution, Channel 4, said, “This new subtitle standard, encompassing the existing EBU TT specification, will allow for the display of subtitles on current and future consumer platforms by the UK broadcasters. This will benefit viewers by enabling a consistent viewer experience across their services.”
Source: Digital Production Partnership
For the next two weeks, we’re running a trial in conjunction with Radio 3 to deliver surround sound to your browser for a series of classical music concerts. On the Radio 3 blog, Rupert Brun explains the background to the trial and how to get involved.
Here in Broadcast & Connected Systems at BBC R&D, we are always looking for new ways to apply our technology research to extend the reach of BBC content to the maximum number of users. Surround sound isn’t new, but delivering it to the home via the Internet has traditionally meant installing plugins or other applications, limiting the platforms and consumers we can target.
In this experiment, we believe we are the first broadcaster in the world to stream a live outside broadcast in discrete multichannel audio to the home using MPEG-DASH, and we're doing it using just a compliant web browser - no plugins, no separate software installation required.
Why Stream to the Browser, and Why Now?
The beauty of the browser is that it is (almost) ubiquitous. Every PC, tablet and smart phone has a browser installed when it ships. Increasingly, smart TVs, set top boxes and games consoles have some form of browser environment available, bringing HTML5, CSS and Javascript functionality to the majority of consumer electronic devices.
People expect to be able to consume BBC content on any platform. To enable this, the BBC currently has to support a number of streaming protocols and has to maintain many different applications with differing code bases and levels of functionality in order to support hundreds of different set top boxes, smart TVs, mobile devices and desktop environments.
What if we could have a single encoding and distribution workflow and a single cross-platform client application, reducing the complexity of distribution and allowing our developers to concentrate on delivering great user experiences?
From a listener’s perspective, the browser “just works” which makes accessing our services much easier. Removing the requirement to install plugins or other software removes a significant barrier for some users. Indeed, for cross-platform compatibility, security and stability, many browser vendors have decided not to support plugins in the future so we need to move away from these anyway.
Three particular technical standards should enable us to do this in the future: HTML5, MPEG-DASH and W3C Media Source Extensions.
HTML5 and Media Source Extensions (MSE)
In HTML5 the HTMLMediaElement, typically a video or audio tag, exposes a source element which accepts a URL of the content to be played. The browser retrieves, decodes and plays the media data automatically, providing it knows how to handle your media type. This offers simplicity for the developer and, in theory, has removed the need for plugins, but the trade-off is that there is no control over many important variables: how data is downloaded and from where, how much data is buffered, which adaptive streaming algorithms to use or what to do in case of failure.
The ability to control these variables is key to providing a world-class user experience, but by default they are hard-coded into the browser. Ideally we want to hand as much control as possible to the Javascript application, while still deferring to the browser for parsing, decoding and rendering the media data. Typically, Adobe Flash or Microsoft Silverlight applications have been used to provide all of this functionality on those platforms that support those plugins.
Most of these features can be replicated in Javascript but, until now, it has not been possible to feed media data to the HTMLMediaElement. The Media Source Extensions define a Javascript API which allows media streams to be constructed dynamically within a Javascript application.
At the heart of MSE is the MediaSource object. This object is created by the application and attached to the media element. Its purpose is to provide the media data for playback as requested by the media element.
The MediaSource object maintains a collection of SourceBuffers. These are the interface through which the application appends media data to the source and methods are provided to insert, remove and manage media data. They are essentially an abstraction of a timeline – media data can appended to the buffer based on media playback timestamp, or it can be appended sequentially, ignoring timestamps. The latter mode enables unrelated media to be spliced together, which allows uses such as advert insertion or even video editing in the browser.
If you need to deliver to mobile devices and via OTT platforms, you need to deliver HTTP Live Streaming (HLS). Apple provides plenty of advice for compressionists, but here are some tips and tricks for encoding and testing your HLS files.
By Jan Ozer, StreamingMedia.com
The launch of the Joint Task Force on File Formats and Media Interoperability was announced today by its sponsors, the North American Broadcasters Association (NABA), Advanced Media Workflow Association (AMWA), Society of Motion Picture and Television Engineers (SMPTE), International Association of Broadcast Manufacturers (IABM), American Association of Advertising Agencies (4A’s), and Association of National Advertisers (ANA). The European Broadcasting Union (EBU) is participating as an observer.
Bringing together manufacturers, broadcasters, advertisers, ad agencies, and industry organizations (standards bodies and trade associations) serving the professional media market, the Task Force has an ultimate goal to create greater efficiencies and cost savings for exchange of file-based content.
The group’s initial focus will be to gather and analyze requirements for a machine-generated and readable file interchange and delivery specification — including standardized and common structured metadata — for the professional media industry. Use case examples include promo, spot, and program delivery from a provider to a broadcaster.
In one of its initial actions, the task force has published a survey designed to collect data on user requirements. Open to any member of the media industry, the survey asks participants to create a one-sentence “user story” by identifying the nature of their work, the specific function they seek, and the business value that would be provided by that function.
Other task force activities will include the collection of data on existing products for transcode, transform, and file QC, and their ability to be driven by data from UML, XML, API, script, and other machine-to-machine communication mechanisms.
In addition to analyzing and publishing this data within a formal report, the task force will analyze the data in terms of current, planned, and unplanned standards activities and publish recommendations for future activities.
Source: SMPTE