ISO/IEC 23090-7:2022 情報技術 — 没入型メディアのコード化表現 — Part 7: 没入型メディアのメタデータ

この規格プレビューページの目次

※一部、英文及び仏文を自動翻訳した日本語訳を使用しています。

3 用語、定義および記号

3.1 用語と定義

このドキュメントの目的のために、ISO/IEC 14496-12 および ISO/IEC 23008-12 に記載されている用語と定義、および以下が適用されます。

ISO および IEC は、次のアドレスで標準化に使用する用語データベースを維持しています。

3.1.1

方位角

球上の点の位置を表す 2 つの球座標 (3.1.22) の最初のもの

3.1.2

方位円

同じ方位角 (3.1.1) 値を持つすべての点を結ぶ球上の円

注記 1方位円は常に大円 (3.1.12) です。

3.1.3

円形画像

魚眼レンズで撮影した画像 (3.1.9)

3.1.4

共通参照座標系

視点グループ (3.1.27) 内のすべての視点の参照座標系として使用される、(0, 0, 0) に等しい (X, Y, Z) を中心とする 3D デカルト座標系

3.1.5

コンテンツカバレッジ

トラックまたは画像項目によって表されるコンテンツによってカバーされる 1 つまたは複数の球領域 (3.1.23)

3.1.6

標高

球上の点の位置を表す 2 つの球座標 (3.1.22) の 2 番目。

3.1.7

仰角円

同じ標高 (3.1.6) 値を持つすべての点を結ぶ球上の円

注記 1仰角がゼロの場合、仰角円は大円でもある (3.1.12) 。これは地球の赤道と一致します。

3.1.8

視野

キャプチャ/記録されたコンテンツまたは物理的な表示デバイスで観察可能な世界の範囲

3.1.9

魚眼レンズ

通常、ほぼ半球状の視野（3.1.8）を捉え、それを円形の画像（3.1.3）として投影する広角カメラレンズ。

3.1.10

魚眼ビデオ

魚眼レンズで撮影したビデオ (3.1.9)

3.1.11

グローバル座標軸

同じ取得位置を表すオーディオ、ビデオ、および画像に関連付けられ、一緒にレンダリングされることを意図した座標軸

3.1.12

大円

球と球の中心点を通る平面との交点

注記 1:大円は、正射線またはリーマン円としても知られています。

注記 2:球の中心と大円の中心は同じ場所にあります。

3.1.13

ガードバンド

パックされたピクチャ (3.1.16) 内のレンダリングされていないが、パックされたピクチャのレンダリングされた部分を改善して継ぎ目などの視覚的アーチファクトを回避または軽減するために使用できる領域。

注記6.5.2 に記述されているように，ガードバンドは充填領域 (3.1.17) に関連している。

3.1.14

ローカル座標軸

グローバル座標軸に回転を適用した後に得られる座標軸 (3.1.11)

3.1.15

無指向性ビデオ

ヘッドマウントデバイスで消費された場合はユーザーの表示方向 (3.1.26) に従って、またはユーザーが希望するビューポート (3.1.28) に従ってレンダリングを可能にするビデオとそれに関連付けられたオーディオ。メディアがキャプチャされた場所と時期を特定する

3.1.16

パックされた絵

コード化されたビデオビットストリームでコード化された画像として表される画像

3.1.17

パック領域

領域ごとのパッキング (3.1.21) シグナリングによって指定されるように、投影された領域 (3.1.19) にマップされるパックされた画像 (3.1.16) 内の領域

3.1.18

投影された画像

無指向性ビデオ（3.1.15）投影（3.1.20）形式によって指定された表現形式を持つ画像。

3.1.19

投影された領域

領域ごとのパッキング (3.1.21) シグナリングによって指定されるように、パックされた領域 (3.1.17) にマップされる投影画像 (3.1.18) 内の領域

3.1.20

投影

投影された画像（3.1.18）のサンプルが、単位球面上の方位角（3.1.1）と仰角（3.1.6）座標のセットによって識別される位置のセットにマッピングされるプロセスの逆。

3.1.21

地域ごとのパッキング

投影された画像（3.1.18）の投影された領域（3.1.19 ）に再マッピングするために、パックされた画像（3.1.16）のパックされた領域（3.1.17）を変換、サイズ変更、および再配置するプロセスの逆。

3.1.22

球座標

単位球上の点の位置を特定する方位角( ϕ ) (3.1.1) と仰角( θ ) (3.1.6)

3.1.23

球体領域

4 つの大円 (3.1.12) または 2 つの方位円 (3.1.2) と 2 つの仰角円 (3.1.7) のいずれかによって指定される球上の領域、または一定量のヨーを適用した後の回転球上のそのような領域、ピッチ、およびロールの回転

3.1.24

SDL

構文記述言語

ビットストリームの構文を記述できる言語

注記 1構文記述言語は、ISO/IEC 14496-1:2010 の箇条 8 で定義されています。

3.1.25

傾斜角

球領域の傾き量を示す角度(3.1.23) 。球領域の中心点を通る球の原点から始まる軸に沿った球領域の回転量として測定され、角度値は時計回りに増加します。原点から軸の正の端に向かって見たとき

3.1.26

表示方向

方位角 (3.1.1) 、仰角 (3.1.6) 、および傾斜角 (3.1.25) の 3 つで、ユーザーが視聴覚コンテンツを消費している方向を特徴付けます。

注記 1画像またはビデオの場合、表示方向は、ビューポートの方向を特徴付けます (3.1.28) 。

3.1.27

視点グループ

同じ参照座標系を共有する視点のグループ (3.1.4)

3.1.28

ビューポート

ユーザーによる表示および閲覧に適した全方向の画像またはビデオの領域。

3.2 アイコン

+	添加。
−	減算 (2 引数演算子として) または否定 (単項前置演算子として)
*	行列の乗算を含む乗算。
^y	累乗。 xのy乗を指定します。他の文脈では、そのような表記法は、累乗として解釈されることを意図していない上付き文字に使用されます。
/	結果をゼロに向けて切り捨てる整数除算。たとえば、7/ 4 と -7/ -4 は 1 に切り捨てられ、-7/ 4 と 7/ -4 は -1 に切り捨てられます。
÷	切り捨てや丸めが意図されていない数式の除算を示すために使用されます。
	切り捨てや丸めが意図されていない数式の除算を示すために使用されます。
	xからyまでのすべての整数値を取るiによる f( i ) の合計。
x % y	係数。 xをyで割った剰余。 x >= 0 および y > 0 の整数xおよびyに対してのみ定義されます。
アシン( x )	−1.0 から 1.0 までの範囲 (両端を含む) の引数xで動作する三角逆正弦関数。ラジアン単位で -π÷2 から π÷2 (両端を含む) の範囲の出力値を使用します。
アタン( x )	任意の実数である引数xで動作する三角関数の逆正接関数で、出力値はラジアン単位で -π÷2 から π÷2 の範囲です。
	(3-1)
cos( x )	ラジアン単位の引数xに作用する三角関数の余弦関数。
床( x )	x以下の最大の整数。
罪( x )	ラジアン単位の引数xで動作する三角正弦関数。
tan( x )	ラジアン単位の引数xに作用する三角正接関数。

参考文献

[1]	ISO/IEC 14496-1:2010, 情報技術 — 視聴覚オブジェクトのコーディング — 1: システム

3 Terms, definitions and symbols

3.1 Terms and definitions

For the purposes of this document, the terms and definitions given in ISO/IEC 14496-12 and ISO/IEC 23008-12 and the following apply.

ISO and IEC maintain terminology databases for use in standardization at the following addresses:

3.1.1

azimuth

first of the two sphere coordinates (3.1.22) describing the location of a point on the sphere

3.1.2

azimuth circle

circle on the sphere connecting all points with the same azimuth (3.1.1) value

Note 1 to entry: An azimuth circle is always a great circle (3.1.12) .

3.1.3

circular image

image captured with a fisheye lens (3.1.9)

3.1.4

common reference coordinate system

3D Cartesian coordinate system with the centre being (X, Y, Z) equal to (0, 0, 0), used as the reference coordinate system for all viewpoints within a viewpoint group (3.1.27)

3.1.5

content coverage

one or more sphere regions (3.1.23) that are covered by the content represented by the track or by an image item

3.1.6

elevation

second of the two sphere coordinates (3.1.22) describing the location of a point on the sphere

3.1.7

elevation circle

circle on the sphere connecting all points with the same elevation (3.1.6) value

Note 1 to entry: When the elevation is zero, an elevation circle is also a great circle (3.1.12) . This coincides with the equator on Earth.

3.1.8

field of view

extent of the observable world in captured/recorded content or in a physical display device

3.1.9

fisheye lens

wide-angle camera lens that usually captures an approximately hemispherical field of view (3.1.8) and projects it as a circular image (3.1.3)

3.1.10

fisheye video

video captured by fisheye lenses (3.1.9)

3.1.11

global coordinate axes

coordinate axes that are associated with audio, video, and images representing the same acquisition position and intended to be rendered together

3.1.12

great circle

intersection of the sphere and a plane that passes through the centre point of the sphere

Note 1 to entry: A great circle is also known as an orthodrome or Riemannian circle.

Note 2 to entry: The centre of the sphere and the centre of a great circle are co-located.

3.1.13

guard band

area in a packed picture (3.1.16) that is not rendered but may be used to improve the rendered part of the packed picture to avoid or mitigate visual artifacts such as seams

Note 1 to entry: Guard bands are associated with packed regions (3.1.17) as described in 6.5.2.

3.1.14

local coordinate axes

coordinate axes obtained after applying rotation to the global coordinate axes (3.1.11)

3.1.15

omnidirectional video

video and its associated audio that enable rendering according to the user's viewing orientation (3.1.26) , if consumed with a head-mounted device, or according to user's desired viewport (3.1.28) , otherwise, as if the user was in the spot where and when the media was captured

3.1.16

packed picture

picture that is represented as a coded picture in the coded video bitstream

3.1.17

packed region

region in a packed picture (3.1.16) that is mapped to a projected region (3.1.19) as specified by the region-wise packing (3.1.21) signalling

3.1.18

projected picture

picture that has a representation format specified by an omnidirectional video (3.1.15) projection (3.1.20) format

3.1.19

projected region

region in a projected picture (3.1.18) that is mapped to a packed region (3.1.17) as specified by the region-wise packing (3.1.21) signalling

3.1.20

projection

inverse of the process by which the samples of a projected picture (3.1.18) are mapped to a set of positions identified by a set of azimuth (3.1.1) and elevation (3.1.6) coordinates on a unit sphere

3.1.21

region-wise packing

inverse of the process of transformation, resizing, and relocating of packed regions (3.1.17) of a packed picture (3.1.16) to remap to projected regions (3.1.19) of a projected picture (3.1.18)

3.1.22

sphere coordinates

azimuth (ϕ) (3.1.1) and elevation (θ) (3.1.6) that identify a location of a point on the unit sphere

3.1.23

sphere region

region on a sphere, specified either by four great circles (3.1.12) or by two azimuth circles (3.1.2) and two elevation circles (3.1.7) , or such a region on the rotated sphere after applying certain amount of yaw, pitch, and roll rotations

3.1.24

SDL

syntactic description language

language that allows the description of a bitstream’s syntax

Note 1 to entry: Syntactic description language is defined in ISO/IEC 14496-1:2010, Clause 8.

3.1.25

tilt angle

angle indicating the amount of tilt of a sphere region (3.1.23) , measured as the amount of rotation of the sphere region along the axis originating from the sphere origin passing through the centre point of the sphere region, where the angle value increases clockwise when looking from the origin towards the positive end of the axis

3.1.26

viewing orientation

triple of azimuth (3.1.1) , elevation (3.1.6) , and tilt angle (3.1.25) characterizing the orientation that a user is consuming the audio-visual content

Note 1 to entry: In case of image or video, viewing orientation characterizes the orientation of the viewport (3.1.28) .

3.1.27

viewpoint group

group of viewpoints that share the same common reference coordinate system (3.1.4)

3.1.28

viewport

region of omnidirectional image or video suitable for display and viewing by the user

3.2 Symbols

+	Addition.
−	Subtraction (as a two-argument operator) or negation (as a unary prefix operator).
*	Multiplication, including matrix multiplication.
x^y	Exponentiation. Specifies x to the power of y. In other contexts, such notation is used for superscripting not intended for interpretation as exponentiation.
/	Integer division with truncation of the result toward zero. For example, 7/ 4 and −7/ −4 are truncated to 1 and −7/ 4 and 7/ −4 are truncated to −1.
÷	Used to denote division in mathematical equations where no truncation or rounding is intended.
	Used to denote division in mathematical equations where no truncation or rounding is intended.
	The summation of f( i ) with i taking all integer values from x up to and including y.
x % y	Modulus. Remainder of x divided by y, defined only for integers x and y with x >= 0 and y > 0.
Asin( x )	The trigonometric inverse sine function, operating on an argument x that is in the range of −1.0 to 1.0, inclusive, with an output value in the range of −π÷2 to π÷2, inclusive, in units of radians.
Atan( x )	The trigonometric invers tangent function, operating on an argument x that is any real number, with an output value in the range of −π÷2 to π÷2, inclusive, in units of radians.
	(3‑1)
Cos( x )	The trigonometric cosine function operating on an argument x in units of radians.
Floor( x )	The the largest integer less than or equal to x.
Sin( x )	The trigonometric sine function operating on an argument x in units of radians.
Tan( x )	The trigonometric tangent function operating on an argument x in units of radians.

Bibliography

[1]	ISO/IEC 14496-1:2010, Information technology — Coding of audio-visual objects — 1: Systems

ISO/IEC 23090-7:2022 情報技術 — 没入型メディアのコード化表現 — Part 7: 没入型メディアのメタデータ | ページ 6