ISO/IEC 18477-9:2016 情報技術—連続トーン静止画像のスケーラブルな圧縮とコーディング—パート9：アルファチャネルコーディング

この規格プレビューページの目次

※一部、英文及び仏文を自動翻訳した日本語訳を使用しています。

3 用語、定義、記号および略語

3.1 用語と定義

このドキュメントでは、次の定義が適用されます。

ISO と IEC は、次のアドレスで標準化に使用する用語データベースを維持しています。

3.1.1

アスキーエンコーディング

ISO/IEC 10646 に準拠したテキスト文字とテキスト文字列のエンコード

3.1.2

ベースデコードパス

残差データが残差コードストリームから得られた値に追加されるまで、すべての以降のステップと一緒に、レガシーコードストリームとリファインメントデータをベース画像にデコードするプロセス

3.1.3

ベース画像

レガシーコードストリームとリファインメントコードストリームのDCT係数をエントロピー復号し、逆DCT変換して得られるサンプル値の集合

3.1.4

アルファチャンネル

メイン画像の各サンプルの不透明度をエンコードする追加のスカラー画像チャネル

3.1.5

アルファ成分

アルファチャンネルの同義語

3.1.6

バイナリ決定

2 つの選択肢からの選択

3.1.7

ブロック

サンプルの 8×8 配列または 1 つの成分の DCT 係数値の 8×8 配列

3.1.8

箱

1 つまたは複数の APP ₁₁マーカーセグメントに埋め込まれた画像または画像デコードプロセスを記述する構造化されたデータの集合。

注記 1:ボックスの定義については、ISO/IEC 18477-3:2015, 附属書 B を参照してください。

3.1.9

バイト

8 ビットのグループ

3.1.10

コーダー

コーディングプロセスの具体化

3.1.11

コード

エンコードまたはデコード

3.1.12

コーディングプロセス

符号化プロセス、復号化プロセス、またはその両方への一般的な参照

3.1.13

圧縮

ソース画像データを表すために使用されるビット数の削減

3.1.14

成分

出力または表示装置で同じ名称を持つサンプルの二次元配列。

注記 1画像は通常、赤、緑、青などの複数のコンポーネントで構成されます。

3.1.15

構成

不透明度情報を使用してデコードされた画像データを背景画像データとマージし、1 つの最終出力画像を生成するプロセス

3.1.16

連続階調画像

コンポーネントがサンプルごとに複数のビットを持つイメージ

3.1.17

デコーダ

復号化プロセスの実施形態

3.1.18

デコード処理

圧縮された画像データを入力として受け取り、連続階調画像を出力するプロセス。

3.1.19

エンコーダー

符号化プロセスの実施形態

3.1.20

エンコード処理

連続階調画像を入力として受け取り、圧縮された画像データを出力するプロセス。

3.1.21

エントロピーデコーダー

エントロピー復号手順の実施形態

3.1.22

エントロピー復号化

エントロピーエンコーダーによって生成されたビットのシーケンスからシンボルのシーケンスを復元する無損失手順。

3.1.23

エントロピーエンコーダ

エントロピー符号化手順の実施形態

3.1.24

エントロピー符号化

シンボルあたりの平均ビット数が入力シンボルのエントロピーに近づくように、入力シンボルのシーケンスをビットのシーケンスに変換する無損失手順。

3.1.25

固定小数点離散コサイン変換

固定小数点 DCT

ISO/IEC 18477-8:2016, 付属書 E の仕様に準拠した固定小数点演算に基づく離散コサイン変換の実装

3.1.26

高ダイナミックレンジ

HDR

サンプルあたり 8 ビットを超える画像または画像データ

3.1.27

整数ベースの離散コサイン変換

整数点 DCT

ISO/IEC 18477-8:2016, 付属書 E で指定されている離散コサイン変換の整数近似を使用した、空間ドメインから周波数ドメインへの 8×8 サンプルブロックの変換

3.1.28

共同写真専門家グループ

JPEG

この文書を作成した委員会の非公式の名前

注記 1: 「ジョイント」は、ITU-T と ISO/IEC のコラボレーションに由来します。

3.1.29

従来のコードストリーム

Rec. ITU-T T.81 によって定義されたマーカーと構文要素のコレクション | ISO/IEC 10918-1 は、ISO/IEC 18477 ファミリの標準によって定義された構文要素を一切含んでいません。つまり、レガシーコードストリームは、ISO/IEC 18477-3:2015, 付属書 A で定義された構文によって JPEG XT ボックスを記述する APP ₁₁マーカーを除くすべてのマーカーのコレクションで構成されます。

3.1.30

従来のデコードパス

Rec. ITU-T T.81 | ISO/IEC 10918-1 と Legacy Refinement を組み合わせてスキャンしてから、このデータを残りのデータとマージして最終的な出力画像を形成します。

3.1.31

レガシーデコーダ

T.81 | Rec. ITU-T T.81に準拠した復号プロセスの実施形態ISO/IEC 10918-1, 可逆 DCT プロセスおよびベースライン、シーケンシャルまたはプログレッシブモードに限定され、最大 4 つのコンポーネントをコンポーネントあたり 8 ビットにデコード

3.1.32

無損失の

復号化手順の出力が符号化手順への入力と同一である符号化および復号化プロセスおよび手順。

3.1.33

ロスレスコーディング

すべての手順がロスレスである、ISO/IEC 18477-8:2016 で定義されたコーディングプロセスのいずれかを参照する動作モード

3.1.34

ロッシー

ロスレスではないエンコードとデコードのプロセス

3.1.35

低ダイナミックレンジ

LDR

画像またはサンプルあたり 8 ビット以下のデータで構成される画像データ

3.1.36

ピクセル

すべて同じサンプル座標を持つ空間画像ドメインのサンプル値のコレクション。たとえば、ピクセルは、赤、緑、青の値を表す 3 つのサンプルで構成されます。

3.1.37

ポイント変換

空間ドメインで再構築されたサンプル値への位置に依存しないグローバル関数の適用

3.1.38

精度

特定のサンプルまたは DCT 係数に割り当てられたビット数

3.1.39

事前に乗算されたコンテンツ

背景との画像の構成を容易にするために、ピクセル単位でアルファチャネルのスケーリングされた値が既に乗算されている画像コンポーネント

3.1.40

手順

符号化または復号化プロセスを構成するタスクの 1 つを達成する一連のステップ

3.1.41

クオンタイズ

DCT係数の量子化手順を実行する行為

3.1.42

残差復号化パス

残差データボックスと残差精査スキャンボックスに含まれるエントロピー符号化データに、このデータがベースイメージとマージされて最終的な出力イメージを形成するポイントまで適用される操作の集合。

3.1.43

残像

残差スキャンおよび残差精緻化スキャンによって記述されたエントロピー復号化係数に適用される逆量子化および逆 DCT 変換によって再構築されたサンプル値

3.1.44

残留スキャン

高ダイナミックレンジまたは広色域データの再生を可能にするために、従来のスキャンの加法的および/または乗法的補正データを提供する、従来のデコーダには見えない画像データに対する追加のパス

3.1.45

精緻化スキャン

DCT変換された係数の精度を拡張するために追加の最下位ビットを提供するレガシーデコーダには見えない画像データの追加パス

注記 1:精緻化スキャンは、レガシーまたは残差復号パスのいずれかに適用できます。

3.1.46

サンプル

コンポーネントを構成する 2 次元画像配列の 1 つの要素

3.1.47

サンプルグリッド

画像のすべてのサンプルに共通の座標系

注記 1:画像の左上端のサンプルの座標は (0,0) で、最初の座標は右に向かって増加し、2 番目の座標は下に向かって増加します。

3.1.48

スーパーボックス

他のボックスをペイロードデータとして運ぶボックス

3.1.49

サブボックス

スーパーボックス内にペイロードデータとして含まれるボックス

3.2 アイコン

X	位置のサンプルグリッドの幅
Y	位置のサンプルグリッドの高さ
nf	画像内のコンポーネント数
_六	コンポーネント i の水平方向のサブサンプリング係数
_シ、イ	コンポーネント i の垂直方向のサブサンプリング係数
_i	フレームヘッダーのコンポーネント i のサブサンプリングインジケータ
_i	フレームヘッダーのコンポーネント i のサブサンプリングインジケータ
v _x,y	サンプルグリッド位置 x, y でのサンプル値
R _h	精緻化スキャンによって表される DCT 係数ビットの追加数である 8+h は、逆 DCT プロセスの出力の非分数ビット (つまり、「バイナリドット」の前のビット) の数です。
_r	残差の精緻化スキャンによって表される DCT 係数ビットの追加数。P+R _hは、残差画像の逆 DCT プロセスの出力の非分数ビット (つまり、「バイナリドット」の前のビット) の数です。ここで、P は残差コードストリームのフレームヘッダーで示されるビット深度です。
R _b	HDR 画像の追加ビット。 8+Rb は、再構成された HDR 画像のサンプル精度です。

3.3 略語

アスキー	情報交換用米国標準コード
DCT	離散コサイン変換
LSB	最下位ビット
MSB	上位ビット
TMO	トーンマッピングオペレータ

参考文献

	Rec. ITU-T T.83 \| ISO/IEC 10918‑2:情報技術—連続階調静止画像のデジタル圧縮と符号化: コンプライアンステスト
	ISO/IEC 10646, 情報技術 - Universal Coded Character Set (UCS)

3 Terms, definitions, symbols and abbreviated terms

3.1 Terms and definitions

For the purposes of this document, the following definitions apply.

ISO and IEC maintain terminological databases for use in standardization at the following addresses:

3.1.1

ASCII encoding

encoding of text characters and text strings according to ISO/IEC 10646

3.1.2

base decoding path

process of decoding legacy codestream and refinement data to the base image, jointly with all further steps until residual data is added to the values obtained from the residual codestream

3.1.3

base image

collection of sample values obtained by entropy decoding the DCT coefficients of the legacy codestream and the refinement codestream, and inversely DCT transforming them jointly

3.1.4

alpha channel

additional scalar image channel that encodes the opacity of each sample in the main image

3.1.5

alpha component

synonym for alpha channel

3.1.6

binary decision

choice between two alternatives

3.1.7

block

8×8 array of samples or an 8×8 array of DCT coefficient values of one component

3.1.8

box

structured collection of data describing the image or the image decoding process embedded into one or multiple APP₁₁ marker segments

Note 1 to entry: See ISO/IEC 18477-3:2015, Annex B for the definition of boxes.

3.1.9

byte

group of 8 bits

3.1.10

coder

embodiment of a coding process

3.1.11

coding

encoding or decoding

3.1.12

coding process

general reference to an encoding process, a decoding process, or both

3.1.13

compression

reduction in the number of bits used to represent source image data

3.1.14

component

two-dimensional array of samples having the same designation in the output or display device

Note 1 to entry: An image typically consists of several components, e.g. red, green and blue.

3.1.15

composition

process of merging the decoded image data with background image data using opacity information and generating one single final output image

3.1.16

continuous-tone image

image whose components have more than one bit per sample

3.1.17

decoder

embodiment of a decoding process

3.1.18

decoding process

process which takes as its input compressed image data and outputs a continuous-tone image

3.1.19

encoder

embodiment of an encoding process

3.1.20

encoding process

process which takes as its input a continuous-tone image and outputs compressed image data

3.1.21

entropy decoder

embodiment of an entropy decoding procedure

3.1.22

entropy decoding

lossless procedure which recovers the sequence of symbols from the sequence of bits produced by the entropy encoder

3.1.23

entropy encoder

embodiment of an entropy encoding procedure

3.1.24

entropy encoding

lossless procedure which converts a sequence of input symbols into a sequence of bits such that the average number of bits per symbol approaches the entropy of the input symbols

3.1.25

fixed point discrete cosine transformation

fixed point DCT

implementation of the discrete cosine transformation based on fixed point arithmetic following the specifications in ISO/IEC 18477-8:2016, Annex E

3.1.26

high dynamic range

HDR

image or image data comprised of more than eight bits per sample

3.1.27

integer based discrete cosine transformation

integer point DCT

transformation of an 8×8 sample block from the spatial domain to the frequency domain using the integer approximation of the discrete cosine transformation as specified in ISO/IEC 18477-8:2016, Annex E

3.1.28

joint photographic experts group

JPEG

informal name of the committee which created this document

Note 1 to entry: The"joint" comes from the ITU-T and ISO/IEC collaboration.

3.1.29

legacy codestream

collection of markers and syntax elements defined by Rec. ITU-T T.81 | ISO/IEC 10918-1 bare any syntax elements defined by the ISO/IEC 18477 family of standards. That is, the legacy codestream consists of the collection of all markers except those APP₁₁ markers that describe JPEG XT boxes by the syntax defined in ISO/IEC 18477-3:2015, Annex A

3.1.30

legacy decoding path

collection of operations to be performed on the entropy coded data as described by Rec. ITU-T T.81 | ISO/IEC 10918-1 jointly with the Legacy Refinement scans before this data is merged with the residual data to form the final output image

3.1.31

legacy decoder

embodiment of a decoding process conforming to Rec. ITU-T T.81 | ISO/IEC 10918-1, confined to the lossy DCT process and the baseline, sequential or progressive modes, decoding at most four components to eight bits per component

3.1.32

lossless

encoding and decoding processes and procedures in which the output of the decoding procedure(s) is identical to the input to the encoding procedure(s)

3.1.33

lossless coding

mode of operation which refers to any one of the coding processes defined in ISO/IEC 18477-8:2016 in which all of the procedures are lossless

3.1.34

lossy

encoding and decoding processes which are not lossless

3.1.35

low-dynamic range

LDR

image or image data comprised of data with no more than eight bits per sample

3.1.36

pixel

collection of sample values in the spatial image domain having all the same sample coordinates, e.g. a pixel may consist of three samples describing its red, green and blue value

3.1.37

point transformation

application of a location independent global function to reconstructed sample values in the spatial domain

3.1.38

precision

number of bits allocated to a particular sample or DCT coefficient

3.1.39

premultiplied content

image component that has already multiplied by the scaled value of the alpha channel on a pixel-by-pixel basis to ease the composition of the image with the background

3.1.40

procedure

set of steps which accomplishes one of the tasks which comprise an encoding or decoding process

3.1.41

quantize

act of performing the quantization procedure for a DCT coefficient

3.1.42

residual decoding path

collection of operations applied to the entropy coded data contained in the residual data box and residual refinement scan boxes up to the point where this data is merged with the base image to form the final output image

3.1.43

residual image

sample values as reconstructed by inverse quantization and inverse DCT transformation applied to the entropy-decoded coefficients described by the residual scan and residual refinement scans

3.1.44

residual scan

additional pass over the image data invisible to legacy decoders which provides additive and/or multiplicative correction data of the legacy scans to allow reproduction of high-dynamic range or wide colour gamut data

3.1.45

refinement scan

additional pass over the image data invisible to legacy decoders which provides additional least significant bits to extend the precision of the DCT transformed coefficients

Note 1 to entry: Refinement scans can be either applied in the legacy or residual decoding path.

3.1.46

sample

one element in the two-dimensional image array which comprises a component

3.1.47

sample grid

common coordinate system for all samples of an image

Note 1 to entry: The samples at the top left edge of the image have the coordinates (0,0), the first coordinate increases towards the right, the second towards the bottom.

3.1.48

superbox

box that carries other boxes as payload data

3.1.49

sub box

box that is contained as payload data within a superbox

3.2 Symbols

X	Width of the sample grid in positions
Y	Height of the sample grid in positions
Nf	Number of components in an image
s_i,x	Subsampling factor of component i in horizontal direction
s_i,y	Subsampling factor of component i in vertical direction
H_i	Subsampling indicator of component i in the frame header
V_i	Subsampling indicator of component i in the frame header
v_x,y	Sample value at the sample grid position x,y
R_h	Additional number of DCT coefficients bits represented by refinement scans, 8+h is the number of non-fractional bits (i.e. bits in front of the “binary dot”) of the output of the inverse DCT process.
R_r	Additional number of DCT coefficients bits represented by refinement scans in the residual, P+R_h is the number of non-fractional bits (i.e. bits in front of the “binary dot”) of the output of the inverse DCT process in the residual image where P is the bit depth indicated in the frame header of the residual codestream.
R_b	Additional bits in the HDR image. 8+Rb is the sample precision of the reconstructed HDR image.

3.3 Abbreviated terms

ASCII	American Standard Code for Information Interchange
DCT	discrete cosine transformation
LSB	least significant bit
MSB	most significant bit
TMO	tone mapping operator

Bibliography

	Rec. ITU-T T.83 \| ISO/IEC 10918‑2: Information Technology — Digital Compression and Coding of Continuous-Tone Still Images: Compliance testing
	ISO/IEC 10646, Information technology – Universal Coded Character Set (UCS)

ISO/IEC 18477-9:2016 情報技術—連続トーン静止画像のスケーラブルな圧縮とコーディング—パート9：アルファチャネルコーディング | ページ 6