ISO 3534-3:2013 統計—用語, 語彙と記号—パート3：実験計画

この規格プレビューページの目次

※一部、英文及び仏文を自動翻訳した日本語訳を使用しています。

3 用語と定義

この文書の目的上、次の用語と定義が適用されます。

3.1 一般用語

3.1.1

実験

制御可能な条件とリソースの割り当てを選択的に調整することによるシステムの目的的な調査

注記 1: システムとは、機能との関係から見た、要素の相互作用の組み合わせです。システムを改善したり理解したりするために、システムに対して意図的な変更や調整が行われます。言い換えれば、実験は、あらかじめ決められた方法で制御可能な要素を変化させることによって、実験者が念頭に置いている質問に対する明確で有効な答えを得る体系的かつ客観的な手段です。

注記 2:実験の重要な側面は制御です。研究者は、システムを効率的に理解することを目的として、設定、入力材料、個人への手順の割り当てなどを変更する能力を持っています。実験を適切に設計し実施すれば、因果関係を設定の影響に帰することができます。

注記 3: 実験は、研究者がどの単位を研究するか、どの観察プロセスを観察するかを決定できる観察研究とは異なりますwhere 、実験処理の割り当て (3.1.13) は研究者の管理外です。

3.1.2

モデル

実験結果の形式化された表現 (3.1.1)

注記 1:モデルは 3 つの部分から構成されます。最初の部分は、モデル化されている応答変数 (3.1.3) です。 2 番目の部分は、予測子変数 (3.1.4) を含むモデルの決定論的または体系的な部分です。最後に、3 番目の部分は残差誤差 (3.1.6) で、純粋なランダム誤差 (3.1.9) と指定ミス誤差 (3.1.10) が含まれる可能性があります。モデルは実験全体に適用され、下付き文字で示される個別の結果にも適用されます。モデルは、応答変数を予測変数に関連付け、関連する仮定を含む数学的記述です。結果とは、応答変数の記録または測定された観察を指します。

注記 2:モデルは実際のシステムを簡略化して表現したものでありwhere 重要な機能または基本的な機能のみが考慮されています。

例 1:

コンポーネントの寿命は、コンポーネントが経験する環境条件に関係します。

例 2:

2 つの要素を含む正式なモデル (3.1.5) は次のとおりです。

mml_m1

どこ

y _ij	因子A のレベルi と因子B のレベルj の応答変数です。
μ	全体の平均応答です。
αi_i	はレベルi における因子A の増分効果です。
β_j	はレベルj における因子B の増分効果です。
ε_ij	は残留誤差です。

モデルの応答部分は、単にy _ijで構成されます。このモデルの予測部分は、全体の平均応答と因子の効果に関連する 2 つの項で構成されるµ + α_i + β_j です。このモデルのランダムまたはエラー部分は、応答を生成するプロセスに固有の変動性を含むε_ijで構成されます。

例 3:

一般的に使用されるモデルは次のとおりです。

mml_m2

どこ

y _ijk	k 番目の複製の応答です。
αi_i	係数 1 による調整です。
β_j	は係数 2 による調整です。
τ_ij	要因の相互作用による調整です。
ε_ijk	は残留誤差です。

正式な数学モデルには全体の平均項が含まれていないため、例 2 のような「増分効果」の代わりに「調整」という用語が使用されます。さらに、この例では、複製の潜在的な存在を確認するために、 y _ij ( ε_ij ) ではなくy _ijk ( ε_ijk ) が使用されます。

例 4:

別の正式なモデルは次のとおりです。

mml_m3

どこ

y _i	はx _i に対応する応答です。
x _i	単一因子のコード化または数値レベルです。
_e β₀ + β₁x _i + β₂x _i ²	はx _i に対応する平均応答を表します。
εi_i	は残留誤差です。

例 5:

次のモデルは 2 ⁴要因計画 (3.2.5) に適用されます。

mml_m4

ここで、因子x _{ji は}、因子j ( j = 1, 2, 3, 4) および観測値i ( i = 1, 2, …, n ) のコード化または数値レベルです。
このモデルには、切片項 ( µ )、4 つの主効果項 ( x _{1 i} 、 x _{2 i} 、 x _{3 i} 、 x _{4 i} )、6 つの双方向交互作用項 ( x _{1 i} x _{2 i} 、 x _{1 i} x が含まれています。 _{3 i} 、 x _{1 i} x _{4 i} 、 x _{2 i} x _{3 i} 、 x _{2 i} x _{4 i} 、およびx _{3 i} x _{4 i} )、4 つの三元交互作用項 ( x _{1 i} x _{2 i} x _{3 i} 、 x _{1 i} x _{2 i} x _{4 i} 、 x _{1 i} x _{3 i} x _{4 i} 、 x _{2 i} x _{3 i} x _{4 i} )、1 つの四方向交互作用項 ( x _{1 i} x _{2 i} x _{3 i} x _{4 i} ) と残差 ( ε_i )モデル内の因子は相互作用を表すために乗算することができますが、モデル自体のパラメーターは線形です。

注記 3:モデルに関する上記の説明は、加算誤差を伴う古典的な線形モデルだけでなく、一般化線形モデルにも当てはまりますここで, 誤差は、二項分布、ポアソン分布、指数分布、ガンマ分布を含むさまざまな分布によって説明できます。そして正規分布。例 4 では、関数の決定論的な部分に対数変換が適用されているため、直線性が生じています。この用語エントリで挙げた例はパラメーターが線形ですが、これはそのようなケースがすべての実験計画状況に適用されることを示唆するものではありません。

3.1.3

応答変数

出力変数

実験の結果を表す変数 (3.1.1)

注記 1: 「従属変数」という用語は、「独立性」と混同される可能性があるため、同義語として推奨されません (ISO 3534-1:2006, 2.4 を参照)

注記 2:各実験単位から複数の応答が記録されているため、応答変数はベクトル値である可能性があります (3.1.24) 。

注記 3:応答変数は 1 つ以上の予測変数 (3.1.4) の影響を受ける可能性が高く、その性質は応答変数の制御または最適化に役立ちます。

3.1.4

予測変数

実験の結果の説明に貢献できる変数 (3.1.1)

注記 1:予測子変数は、たとえば 2 つのレベルでカテゴリカル因子の影響をモデル化するために使用できます。因子の複数のレベルの場合、2 つ以上の予測子変数を考案して、個別のカテゴリレベルを表すことができます。

注記 2: 予測子変数には、ランダムな要素を含めることができます。あるいは、たとえば、ランダムな誤差なしで観察または割り当てることができる一連の定性的クラスからの変数を使用することもできます。

注記 3: 「予測変数」という用語は、通常、応答変数 (3.1.3) と利用可能な予測変数または予測変数の関数の間の数学的関係を構築する際に使用されます。「因子」という用語は、特定の因子が変化するときに応答変数を評価する手段として運用上使用される傾向があります。

注記 4: 「独立変数」は、「独立性」と混同される可能性があるため、同義語として推奨されません (ISO 3534-1:2006, 2.4 を参照)予測子変数の代わりに使用されることがある他の用語には、「入力変数」、「記述子変数」、「説明変数」などがあります。

3.1.5

要素

変動の潜在的な原因として調査中の機能

注記 1:特定の因子を制御できる範囲によって、計画された実験におけるその因子の潜在的な役割が決まります。要因には、制御可能（固定）、変更可能（短期間のみ制御可能またはかなりの費用がかかる）、または制御不能（ランダム）があります。

注記 2:因子はブロックの作成に関連付けられる可能性があります (3.1.25) 。

3.1.6

残留誤差

誤差項

応答変数 (3.1.3) と仮定されたモデル (3.1.2) に基づくその予測との差を表す確率変数

注記 1:応答変数の予測値は、パラメータがデータから推定された仮定のモデルに基づいています。残差誤差は、モデルに含まれている予測変数では説明できない応答変数の部分であり、体系的な原因と偶然の原因の両方が原因である可能性があります。

注記 2:この定義の目的上、「予測応答値」という用語は、実験データ (3.1. 1) 想定モデルを使用します。

注記 3: 残留誤差には、 純粋なランダム誤差 (3.1.9) および指定ミス誤差 (3.1.10) が含まれます。残差誤差の期待値はゼロであると仮定されます。

注記 4:残差誤差の分散は、通常、実験において、仮定されたモデルに含まれる項のプール平方和を平方和の総和から引き、対応する自由度の差で割ることによって推定されます (3.1. 32) 。

注記 5: 「残留誤差」という用語は、実際には 2 つの異なる方法で使用されます。 ISO 3534 のこの部分では、この用語は、確率変数である応答変数と、想定されたモデルに基づく応答変数の予測との差に関連付けられた確率変数として使用されます。

注記 6:残差がデータから推定される場合、サンプル残差または経験的残差という用語が使用されます。

例：

単純なモデルを考えてみましょう mml_m8 は、予測子変数x が与えられた残差です。

3.1.7

残留物

残差の観測値（3.1.6）

例 1:

mml_m9 は、3.1.2 の例 2 のモデルに対応する残差です。

例 2:

mml_m10 は、3.1.2 の例 3 のモデルに対応する残差です。

例 3:

mml_m11 は、3.1.2 の例 4 のモデルに対応する残差です。

例 4:

3.1.8

分散成分

応答変数の合計分散の一部 (3.1.3)

注記 1: 分散成分は、応答変数の全体的な変動の一部である個々の分散成分であるか、独立誤差項の合計として応答変数をモデル化する場合の確率変数に起因する可能性があります。

注記 2:ネスト因子 (3.2.21 を参照) または交差因子 (3.2.1 を参照) を含む他のモデルも想定できます。

注記 3:最も単純なケースでは、残差誤差の分散が唯一の分散成分であるモデルを想像することができます (たとえばwhere 因子を変更せず、単一のユニットでの繰り返しの測定からなる実験) ）。

例：

モデル (3.1.2) では、 mml_m16 はy _ijの分散成分です。

3.1.9

純粋なランダムエラー

純粋なエラー

複製された観測に関連する残差誤差 (3.1.6) の一部

注記 1:実験材料、環境条件、および実験操作は慎重に制御されているにもかかわらず、繰り返されると結果が試行ごとに異なることは、実験 (3.1.1) の共通の特徴です。したがって、実験者の最善の努力にもかかわらず、純粋なランダム誤差はよく発生します。この変動により、結果から導き出される結論にある程度の不確実性が生じるため、決定を下す際には考慮する必要があります。

注記 2: 実験計画 (3.1.28) の中心点 (3.1.39) のみが複製された場合、中心点での応答の標本分散は純粋な誤差の分散の推定値を提供します。複数の治療の組み合わせで反復 (3.1.36) が行われた場合、純粋な誤差の分散の全体的な推定は、これらの実験的治療 (3.1.13) での推定値をプールすることによって達成できます。

例：

3.1.2 の例 3 に戻ると、固定 ( i , j ) の純粋誤差の分散の推定値は次のようになります。 mml_m19

ここで、 i = 1,..., I ; j = 1,..., J ; k = 1, _... , n ; mml_m20

。

注記 3: 「純粋な誤り」という用語は、実際には 2 つの異なる方法で使用されます。 ISO 3534 のこの部分では、この用語は確率変数として使用され、数学的モデルに関連した母集団分散 ( σ² ) を指します。数学的な観点から、純粋な誤差は、例 2 ではε_ijとして、例 3 ではε_ijkとして、例 4 ではε_i として、すべて 3.1.2 から構築できます。

注記 4:純粋誤差がデータから推定できる場合 (つまり、反復がある場合)、純粋誤差は実際には、推定残差誤差と組み合わせた「サンプル」または「経験的」純粋誤差を指します (3.1. 6) モデルの不適合テストの基礎を提供します。モデルに基づいて推定された残差誤差が推定された純粋な誤差に合理的に近い場合、モデルは実質的な適合不足を示していません。残差誤差には応答変数と予測モデル間の差異のすべての変動要因が含まれるため、残差誤差には純粋な誤差の寄与が含まれます。 3.1.2 のモデルを示す例のうち、純粋な誤差の直接推定が容易になるのは、複製を含む例 3 のみです。

3.1.10

指定ミス

純粋なランダム誤差 (3.1.9) によって考慮されない残差誤差 (3.1.6) の一部

注記 1:誤った仕様誤差は、予測変数 (3.1.4) または応答変数のモデルから誤って省略された予測変数の関数 (3.1.3) に起因する可能性があります。

注記 2: モデルに組み込まれていない可能性のある固定またはランダムな要因を含む、帰属不可能な要因が存在する可能性があります。これは、たとえば、真のモデルが 2 次であるが、近似されたモデルが線形である場合に発生します。これらの要因は実験者には未知であるため、試行ごとの変動に効果的に含まれます。実際には応答変数に影響を与えるが、モデルでは省略されていた固有の要因は、実験結果の系統的誤差につながる可能性があります。モデルを慎重に選択し、ランダム化することによって、この問題を軽減できる場合があります (3.1.30) 。

注記 3: 残差誤差 (3.1.6) 、純粋ランダム誤差 (3.1.9) 、および仕様ミス誤差に関連して重要なのは、再現性標準偏差 (ISO 3534-2:2006, 3.3.7) および再現性標準偏差という用語です。 (ISO 3534-2:2006, 3.3.12)実際の実験計画が再現性条件 (ISO 3534-2:2006, 3.3.6) または再現性条件 (ISO 3534-2:2006, 3.3.6) に従っている場合、実験計画のコンテキストに直接適用されます。 3534-2:2006, 3.3.11)、それぞれ。

例：

因子A とB を含む 2 因子実験で構成される 3.1.2 の例 2 に戻ると、実験誤差は、反復された治療組み合わせでの実行ごとの変動と、次の影響を受ける可能性のある全体的な変動を通じて明らかになります。実験の実施に伴う時間の経過に伴う体系的な傾向。

3.1.11

デザイン領域

デザインスペース

予測子変数の許容値のセット (3.1.4)

注記 1: 状況によっては、設計領域は個々の予測変数の範囲によって決定され、長方形 (または高次元では超長方形) になります。ただし、1 つの予測子変数の範囲が別の予測子変数の妥当な値に影響を与える可能性がある場合、領域は長方形である必要はありません。たとえば、ケーキを焼くという単純な実験 (3.1.1) の場合、オーブンに入れる時間は論理的に温度設定に依存します。

3.1.12

因子レベル

係数の設定、値、または割り当て (3.1.5)

注記 1:因子水準は、モデル内の予測子変数 (3.1.4) の値を通じて表すことができます。

注記 2:因子のさまざまなレベルで観察された応答は、実験のレベルの範囲内で因子の効果を決定するための情報を提供します (3.1.1) 。これらのレベルの範囲を超える外挿は、モデル (3.1.2) の関係を仮定するための確固たる根拠がなければ通常は不適切です。範囲内の補間は、レベルの数とこれらのレベルの間隔に依存します。通常は内挿することが合理的ですが、実験範囲内で突然の変化を引き起こす不連続または多峰性の関係が存在する可能性があります。レベルは、特定の選択された固定値 (これらの値が既知であるか不明であるかに関係なく) に限定される場合もあれば、調査対象の範囲にわたる純粋にランダムな選択を表す場合もあります。

例：

触媒の公称レベルは、存在する場合と存在しない場合があります。実験室の名目スケール変数には、3 つの施設に対応するレベル A, B, および C を設定できます。熱処理の 4 つのレベルは、100 °C, 120 °C, 140 °C, 160 °C です。

3.1.13

走る

実験的治療

特定の実験ユニット (3.1.24) で使用されるすべての因子 (3.1.5) の特定の設定

注記 1: 最終的に、要因の影響は、予測変数 (3.1.4) での表現と、モデルが実験の結果とどの程度一致するか (3.1.1) を通じて把握されます。

例：

高収率が目的であり、予測変数が温度、期間、触媒の濃度である化学プロセス実験 (3.1.1) を考えてみましょう。これらの設定がすべて許容されると仮定すると、温度 350 °C, 継続時間 30 分、触媒濃度 10% の設定で実行できます。

3.1.14

因子効果

応答変数 (3.1.3) に影響を与える因子 (3.1.5 )

注記 1: 因子効果には、 主効果 (3.1.15) 、分散効果 (3.1.16) および交絡効果 (3.1.18) が含まれます。

3.1.15

主効果

因子効果 (3.1.14) は、期待に関して線形構造モデルのコンテキストで適用可能

注記 1: 線形構造モデルには、加法的線形モデルが含まれ、これには要因実験 (3.2.1) および部分要因実験 (3.2.3) に関連するモデルのクラスが含まれます。主効果は交互作用がゼロのモデルではほとんどの場合容易に理解できます。

注記 2:主効果は、実験が完全にバランスされている場合、他のすべての実行における応答変数を平均することによって推定できます。

注記 3: 2 水準を持つ因子の場合、主効果は水準間の応答の変化に関係します。レベルが -1 (低) および +1 (高) に指定されている場合、因子の主効果は、因子レベルが +1 の場合の平均応答から、因子レベルが - の場合の平均応答を差し引いたものとして推定されます。 1.次のモデルを考えてみましょう。

mml_m21

ここでy 、 µ 、 εは 3.1.6 の例と同じで、 X は先ほど説明した +1 または –1 で、 βは係数X の調整を表します。 βの推定値は、因子X の主効果の半分に等しいことに注意してください。 βが 0 に等しい場合、 X 応答変数の平均に影響を与えないため ( X の水準が +1 または -1 であっても同じです)、 X の主効果は 0 になります。

3.1.16

分散効果

変動に関する線形構造モデルのコンテキストにおける因子効果 (3.1.14)

注記 1: 分散効果 (3.1.16) は重大である可能性がある一方、同じ要因に対応する主効果はほとんど影響を及ぼさない可能性があることを認識することが重要です。この状況では、応答の全体的なレベルに必ずしも影響を与えない要因を使用して、応答の低い変動性または一貫性を実現する機会が得られます。

3.1.17

交流

応答変数 ( 3.1.3) に対する 1 つ以上の他の因子の影響に対する 1 つの因子 (3.1.5) の影響

注記 1: 応答変数 (3.1.1) に対する 1 つの因子の明らかな影響が 1 つ以上の他の因子に依存する場合、交互作用が存在します。このような状況では、これら 2 つ (またはそれ以上) の要因が相互作用するといわれています。交互作用は、2 つ以上の因子の関数である新しい予測変数を定義することによって、モデル (3.1.2) に組み込むことができます。交互作用は、因子の各レベルに対する応答を他の因子の各レベルで差分比較することにより、ある因子のレベルの別の因子のレベルへの依存性を反映します。

注記 2: 相互作用は、応答変数に対するある因子の主効果 (3.1.15) が、別の因子のレベルに応じて矛盾することを示します。図 1 は、非常に強い相互作用から、限定的な相互作用、あるいは相互作用がないものまで、さまざまな現象を示しています。相互作用の存在は、適切な有意性検定を介して、推定された不確実性と関連して評価される必要があります。

注記 3:相互作用は、最初は 2 つの因子のみを含むと考えられ、二元相互作用または一次相互作用と呼ばれます。もちろん、 ab の一次相互作用が因子C のレベルに依存するという意味で、3 つの因子 (たとえばA 、 B 、およびC ) が相互作用する可能性があります。この場合、二次相互作用が存在します。同様の、3 次、4 次、および高次の相互作用が考えられます。一次相互作用は、高次相互作用に比べて、図や言葉で説明するのが比較的簡単です。

注記 4: 3.1.2 の例 3 は、2 つの因子とそれらの間の二元交互作用または一次交互作用τ_ijを使用した実験 (3.1.1) の正式なモデル表現を提供します。

図 1 —相互作用プロット

Key

Y	応答
X1	インタラクションなし
X2	限られたインタラクション
X3	非常に強力な相互作用

3.1.18

交絡効果

別の因子効果と区別できない因子効果 (3.1.14)

注記 1:交絡効果は、ブロック (3.1.25) に対応するため、または検討中の実験ユニット (3.1.24) の数を増やさずに別の要素を導入するために、設計段階で作成されることがあります。交絡効果は高次相互作用である可能性があります (3.1.17) 。

例：

3 つの因子A 、 B およびC を含む 2 ³完全実施要因計画 (3.2.5) を考えてみましょう。 4 番目の因子D 、その因子レベル (3.1.12) を A 、 B 、およびC のレベルの積に等しく設定することによって導入できます (各因子のレベルが -1 または +1 としてコード化されていると仮定します)ファクターD 2 日間にわたって実験を行うためのブロックとして使用できます。 D の値は、対応する 4 つの実行が実行される日を決定します ( D = −1 は月曜日を意味し、 D = +1 は火曜日を意味します)表 1 を参照してください。この場合、因子D は三者交互作用ABCと混同されます。

表 1 — 8 回の試行による当日のブロックの実験計画

A	B	C	A×B×C	D	裁判の日
-1	-1	-1	-1	-1	月曜日
+1	-1	-1	+1	+1	水曜日
-1	+1	-1	+1	+1	水曜日
+1	+1	-1	-1	-1	月曜日
-1	-1	+1	+1	+1	水曜日
+1	-1	+1	-1	-1	月曜日
-1	+1	+1	-1	-1	月曜日
+1	+1	+1	+1	+1	水曜日

3.1.19

混乱させる

2 つ以上の因子効果 (3.1.14) を互いに区別できないように同等にする

注記 1:設計段階では、交絡は、たとえば、一部の計画された実験 (3.1.27) で指定されたブロック (3.1.25) を効果的に使用できるようにする重要な手法です。これは、特定の因子効果 [通常は高次相互作用 (3.1.17) ] をあまり興味のないものとして意図的に事前に選択し、他のより重要な主効果を維持しながらブロック効果と混同するように設計を調整することによって実現されます。 (3.1.15) およびそのような複雑さのない主要なインタラクション。交絡は、実験計画の試行数を減らすために意図的に使用される場合があります (3.1.29) 。ただし、場合によっては、実験の実行中の設計への不注意な変更 (3.1.1) や設計の不完全な計画によって混乱が生じ、実験の有効性が低下したり無効になったりすることがあります。

注記 2: 交絡とは、分析段階において、より関連性の高いコントラストを推定できる精度を向上させるために、重要でない効果 (通常は高次相互作用に対応する) のコントラストに関する情報 (3.1.22 ) を犠牲にする装置です。。

例：

3.1.18 の例を続けると、因子D ( ABCに等しい) を代わりに使用して、合計 8 つの実験単位を使用して因子A 、 B 、およびC とともに調査できる新しい因子を決定することもできます (3.1.24) 。

3.1.20

エイリアス

別の因子効果または他の因子効果の関数と等しい因子効果 (3.1.14)

注記 1: 実験 (3.1.1 ) において別の因子効果 (高次交互作用 (3.1.17) など) と意図的に混同される主効果 (3.1.15 ) は、その因子効果の別名です。エフェクトはお互いのエイリアスです。

例：

3 つの因子 (3.1.5) A 、 B およびC を含む 2 ³実験 (3.2.5) を考え、すべての因子の水準が -1 または +1 でwhere D=ABおよびE=BCを設定します。 5 つの因子を使用すると、 A 、 B 、 C 、 D 、 E の 5 つの主効果があります。 10 個の一次相互作用があります — ab 、 AC 、 … 、 de ; 10 個の二次相互作用があります — ABC 、 ABD 、…、 CDE ; 3 次相互作用は 5 つあります — ABCD 、 ABCE 、 … 、 BCDE ; 4 次相互作用 ( ABCDE) が 1 つあります。因子のレベルは、実験におけるこれらの因子の設定を示します。特に、実験を行うには因子A 、 B 、 C 、 D およびE のレベルが必要です。表 2 は、これらの因子のレベルと、いくつかの選択された相互作用に対応する計算されたレベルを示しています。

表 2 —意図的に交絡させた因子の効果

A	B	C	D	E	I=ABD=BCE=ACDE	AC=BCD=ABE=DE	AE=CD=ABC=BDE
−1	−1	−1	1	1	1	1	-1
1	−1	−1	−1	1	1	−1	1
−1	1	−1	−1	−1	1	1	1
1	1	−1	1	−1	1	−1	-1
−1	−1	1	1	−1	1	−1	1
1	−1	1	−1	−1	1	1	-1
−1	1	1	−1	1	1	−1	-1
1	1	1	1	1	1	1	1

最初の 5 つの列には次のようなエイリアスがあります: A = BD = ABCE = CDE 、 B = AD = ce = ABCDE 、 C = ABCD = be = ADE 、 D = ab = BCDE = ACE 、およびE = ABDE = BC = ACD 。
表 2 の最初の 5 列は、実験を行うために必要なもののすべてです。分析段階では、エントリは効果を推定するために使用されます。たとえば、因子A の場合、 A の主効果は、 A の +1 のレベルでのy _i の平均 (2 ^nd 、4 ^th 、6 ^th 、および 8 ^th ) からy _i ' の平均を引いたものです。 s は A の -1 のレベル (1 ^st 、3^番目、5 ^th 、および 7 ^th )表 2 の同じエントリを持つ列は、同じ推定効果値を生成します。調べてみると、列D とab 、列E と列BCと同様に同一です。列D とE 意図的にこの方法で構築されました。列B 、 AD 、 ce およびABCDEには同一のエントリがあることがわかり、数値的に等しい効果推定値が生成されます。同様に、列AC 、 BCD 、 ABE 、およびde は同一であるため、これらの因子は相互のエイリアスになります。表 2 のエントリは、他のすべての可能な相互作用を含めるように拡張できます。検査すると、エイリアスは次のように 4 つのグループで発生していることがわかります: ( I 、 ABD 、 BCE 、 ACDE )、 ( A 、 BD 、 ABCE 、 CDE )、 ( B 、 AD 、 ce 、 ABCDE )、 ( ab 、 D 、 ACE 、 BCDE )、 ( C 、 ABCD 、 be 、 ADE )、 ( AC 、 BCD 、 ABE 、 de )、 ( BC 、 ACD 、 E 、 ABDE )、および ( ABC 、 CD 、 ae 、 BDE )
5 つの因子を使用すると、大文字の「 I 」で示される全体の平均項 (つまり、切片項) を含む、合計 32 の効果を推定できます。考えられる各エフェクトは、他の 3 つのエフェクトとエイリアスされます。

3.1.21

曲率

応答変数 (3.1.3) と予測変数 (3.1.4) の間の直線関係からの逸脱

注記 1:曲率は量的予測変数では意味を持ちますが、カテゴリー (名目) または非定量 (順序) 予測変数では意味がありません。曲率の検出には 2 レベル以上の因子が必要です。場合によっては、複製された中心点 (因子の高い設定と低い設定の中間に設定された因子) により、曲率の検出と評価が可能になることがあります。あるいは、曲率を観察するには、因子レベル (3.1.12) の範囲を拡大する必要がある場合があります。

注記 2:放物線曲率は、次のような形式で簡単にモデル化できます。

mml_m22

γ がゼロから逸脱すると、単純な線形関係に対して放物線状の曲率の証拠が存在します。より複雑なタイプの曲率を表現するには、より複雑なモデルが必要です。

3.1.22

対比

すべての係数がゼロに等しくない場合に係数の合計がゼロになる応答変数 (3.1.3) の値の一次関数

注記 1:観測値y ₁ , y ₂ ,…, y _n の場合、一次関数 mml_m24 すべてのa _i がゼロに等しいわけではありません。実験計画におけるコントラストの役割は、後続の例で示すように、効果を比較および調査することです。

例 1:

係数は 3 つのレベルで適用され、結果はy _1, y _2, およびy ₃で表されます。考えられる多くの質問の中で、実験の第 1 レベルと第 3 レベルでの応答の違いを考えてみましょう (中間レベルは一時的に無視します)このクエリを評価するための適切なコントラストは、– y ₁および + y ₃ (つまり、 y _{3 −}y ₁ ) です。レベルが等間隔である場合、応答パターンが単純な線形傾向ではなく二次曲率 (3.1.21) を示すという証拠があるかどうかについて 2 番目の質問が行われる可能性があります。ここで、 y ₁とy ₃の平均はy ₂ (つまり、 y ₁ − 2 y ₂ + y ₃ ) と比較できます。 (曲率がなかった場合、 y _{2 は}y ₁とy ₃を結ぶ線の近くにあるはずです。) この例は、連続変数の回帰タイプの研究を示しています。多くの場合、コントラスト係数には分数よりも整数を使用した方が便利です。このような場合、コントラスト 2 の係数は (-1, +2, -1) として表示されます。

例 2:

因子の離散レベルを扱う別の例では、別の質問のペアが生じる可能性があります。供給源が 3 つあり、そのうちの 1 つであるA ₁が新しい製造技術を使用し、他の 2 つであるA ₂とA ₃が従来の製造技術を使用しているとします。

質問 1: 新しい技術を使用するベンダーA ₁は、古い技術を使用するA ₂およびA ₃とは異なるように見えますか? y ₁をy ₂とy ₃の平均と比較します (つまり、2 y ₁ – y ₂ － y ₃ )
質問 2: 通常の手法を使用する 2 つのサプライヤーは異なりますか? y ₁とy ₃を対比します (つまり、 y ₂ – y ₃ )結果の解釈は異なりますが、コントラスト係数のパターンは前の問題のパターンと似ています。

3.1.23

直交コントラスト

対応するペアで乗算すると積の和がゼロになるという条件を満たす係数を持つコントラストのセット (3.1.22)

注記 1: 直交対比の目的は、実験状況における関心のある仮説の独立したテストを容易にすることです。例 1 では、2 つのテストを独立して実行できます (一方のテストは他方のテストとは関係がなく、さらに残留誤差が正規分布に従うという条件で)一方、例 2 では、テストは依存しており、たとえば、1 つのテストでの拒否は、他のテストでの拒否の可能性が高いことを示している可能性があります。

例 1:

次の 2 つの直交対比が与えられます。

	_y	_y	_y
a _i ₁コントラスト 1	-1	0	+1
a _i ₂コントラスト 2	-1	2	-1
a _{i 1}a _{i 2}	+1	0	-1
Σ a _i ₁a _i ₂ = 0, したがって直交

例 2:

次の 2 つのコントラストは直交していません。

	_y	_y	_y
a _i ₁コントラスト 1	-1	0	+1
a _i ₂コントラスト 2	0	-1	+1
a _i ₁a _{i 2}	0	0	+1
Σ a _i ₁a _i ₂ = +1, したがって直交ではない

3.1.24

実験ユニット

実験材料の基本単位

注記 1:実験単位は製造プロセス全体である可能性があり、その場合、実行はプロセスのセットアップに対応し、応答変数 (3.1.3) (おそらく多変量) は製品の特性に対応します。別の状況では、実験単位は、それぞれが特定の実験的治療を受ける研究の個々の被験者 (実験動物または人間の参加者) である可能性があります。

注記 2:農業環境では、実験材料の基本単位は土地区画である。農業という設定は、実験単位の自然な説明としてのプロットを示唆していますが、概念はより一般的であり、他の状況にも適用できます。その後、プロットは実験ユニットという一般的な用語に置き換えられます。

3.1.25

ブロック

実験ユニットのコレクション (3.1.24)

注記 1: 効果を発揮するには、ブロックはある意味で均質な実験単位のセットを表す必要があります。「ブロック」という用語は、風への曝露、地下水への近さ、耕作可能な層の厚さなどの共通の条件を持つセクションに畑を細分化する農業実験 (3.1.1) に由来しています。他の状況では、ブロックは原材料のバッチ、オペレーター、1 日に学習されるユニット数などに基づいています。より一般的には、ブロックは、国の地域、工場のグループ、時間枠 (製造施設のシフトなど) などで構成されます。

注記 2:一般に、ブロックの存在の認識は、対象となる実験処理 (3.1.13) が実験単位にどのように割り当てられるかに影響を与える可能性があります。操作上、ブロックへの割り当てを指定する追加の「人工」処理がモデル (3.1.2) 内で作成されます。したがって、治療は、研究で直接関心のある因子効果 (3.1.14) と割り当てのブロックに関連する治療で構成されます。このアイデアは、ブロックが確立されていないと不明瞭になるであろう重要な因子の効果を認識する可能性を高めることです。

3.1.26

ブロッキング

実験ユニットのブロックへの配置 (3.1.25)

注記 1:各ブロック内で、残差誤差 (3.1.6) は、ブロックに関係なく、同様の数のユニットが実験的処理 (3.1.13) にランダムに割り当てられた場合に予想される値よりも小さくなることが予想されます。

注記 2:ブロックは通常、研究対象の要因 (主な関心の要因) として導入された要因に加えて、割り当て可能な要因の影響を考慮して選択されますが、ブロックのすべてについて一定に保つことは困難、または不可能ですらあります。完全な実験内の実験ユニット。これらの割り当て可能な原因の影響はブロック内で最小限に抑えることができるため、より均質な実験サブ空間が得られます。実験結果の分析では、実験をブロックする効果を考慮する必要があります。

3.1.27

計画された実験

明確な目的と実装構造を備えた実験 (3.1.1)

注記 1:適切に設計された実験の目的は、実験から有効かつ適切な結論に達するための最も効率的かつ経済的な方法を提供することです。

注記 2: 計画された実験には、応答変数 (3.1.3) と所定の因子水準 ( 3.1.12) を伴う実験的処理 (3.1.13) を含む実験計画 (3.1.28 ) が関連付けられる。応答変数を予測変数に関連付けるモデルのクラスも想定できます。

3.1.28

実験計画

各実験ユニット (3.1.24) への実験処理 (3.1.13) の割り当て

注記 1: 実験的処理の割り当てには、処理が適用される時間順序またはランダム化された順序も含まれる可能性があります。

注記 2:実験計画は、実験に含まれる実験処理 (因子のレベルまたはそのようなレベルの組み合わせ) を実験単位に割り当てるスキームと考えることができます。計画行列 (3.2.25) には、実験における各因子の指定されたレベル (および分析段階で使用される予測変数の値を示す他の列) が含まれています。

注記 3:実験計画は、有効かつ効率的かつ経済的な方法で研究課題に答えるためにどのように観察/測定を行うべきかを決定するものです。

注記 4:この定義は、比較的非効率であることが知られている計画の可能性を排除するものではありません (たとえば、「一度に 1 要素ずつ」計画は、定義によれば実験計画ですが、それが効率的であることを示唆するものではありません)推奨されること。

3.1.29

実験計画

実験を実施するための意図された手順の仕様（3.1.1）

注記 1:実験計画は、理想的には、設計された実験から有効かつ適切な結論に達するための最も効率的かつ経済的な方法を提供する可能性を提供する必要があります (3.1.27) 。実験に適切な計画の選択は、対処する質問の種類、結論に付加する一般性の程度、高い検出確率 (検出力) をもたらす効果の大きさなどの考慮事項によって異なります。望ましいこと、実験ユニットの均一性（3.1.24）、実験のコストと方法。実験計画は、実験を実施するためのプロトコルを確立します。

注記 2:適切に計画された実験は、比較的単純な統計分析と結果の解釈から効果的な結果をもたらすことがよくあります。ただし、実験の計画が不十分な場合は、結果を高度に分析したにもかかわらず、実験の目的を達成できない可能性があります。

注記 3: 実験計画の策定は非常に困難な場合があります。プロセスの詳細な説明は付録 D に記載されています。

3.1.30

ランダム化

各実験ユニット (3.1.24) に特定の実験処理 (3.1.13) が割り当てられるチャンスが等しい戦略。

注記 1:ランダム化は、モデル (3.1.2) で明示的に考慮されていない原因によるバイアスから保護しようとします。ランダム化により、潜在的な時間的または空間的効果がさらに中和される可能性があります。割り当ての等しい確率は、実験単位のコレクションのサブセット内にある可能性があります。

注記 2:実用的な観点からは、抽出された最終的な実験単位がこの段階で「ランダムに」選択されたように見えないように、非置換のサンプリングが治療への実験単位の割り当てを決定する可能性があります。ただし、割り当ての開始時に各実験ユニットが選択されるチャンスは等しいため、最終的にはランダムな割り当てが行われます。

3.1.31

直交配列

実験的治療 (3.1.13) の組み合わせのセット。因子のすべてのペアについて、各治療の組み合わせが可能な因子レベル (3.1.12) 全体で同じ回数発生します。

注記 1:直交配列に関連する強度の概念は、直交配列の可能な使用法の 1 つであるスクリーニング計画 (3.2.8) で生じます。強度d の計画は、任意のd 因子における完全な要因計画です。強度 1 は、各因子のレベルが同じ回数発生することを意味します (バランス因子と呼ばれることもあります)直交配列の強度は 2 です。サブセットサイズd 強度として知られています。

3.1.32

自由度

推定できる線形独立効果の数

注記 1: 非公式には、自由度は、制限なく自由に変化できる量の数です。

注記 2:自由度は通常、分散計算の分母に関連付けられます。自由度の値は、サンプルサイズから、計算される量に関連付けられた制約の数を引いたものです。分散計算で標本平均によって母集団平均を推定すると、自由度が 1 減り、標本サイズをn として自由度n −1 が得られます。サンプル平均とサンプルからのn -1 個の値がわかれば残りのデータ値が確立されるため、1 つの自由度が削除されます。

注記 3: 自由度は、統計的推定および統計的検定において標本分布として発生する特定の理論的分布のパラメータです。たとえば、カイ二乗分布、 F 分布、 t 分布などです。

3.1.33

1因子実験

単一の因子 (3.1.5) が応答変数 (3.1.3 ) に及ぼす影響 (存在する場合) について調査される計画された実験 (3.1.27)

注記 1: 1 因子実験のモデルは次のとおりです。

mml_m25

どこ

y _ij	因子のi 番目のレベルでのj 番目の複製の応答変数です。
_μij	因子のi 番目のレベルでの平均応答です。
ε_ij	は、他のすべての効果と変動源を捕捉する確率変数です。

このモデルは、応答変数y _ijをµ_i (因子の対応するレベルに応じて) および残差ε_ij to 関連付けます。 μ_i の差は、応答変数 (この場合、因子のレベルの関数としての平均応答値) に対する因子の影響を反映します。
このモデルの別の表現は次のとおりです。

mml_m26

どこ

y _ij	は応答変数です。
μ	全体の平均応答です。
αi_i	因子のi 番目のレベルによる増分効果です。
ε_ij	は、他のすべての効果と変動源を捕捉する確率変数です。

上記の表現では、 α_i 値は合計がゼロになると仮定されています。

3.1.34

二要素実験

応答変数 (3.1.3) に対する考えられる影響について 2 つの異なる因子 (3.1.5) を同時に調査する計画された実験 (3.1.27 )

注記 1: 2 つの因子が相互作用せずに作用する場合でも、主効果 (3.1.15) という用語は必然的に適用されます。つまり、各因子の主効果は応答変数の平均への寄与です。 3.1.2 の例 2 を参照してください。

3.1.35

k ファクター実験

多要素実験

k 個の異なる因子 (3.1.5) が応答変数 (3.1.3) に及ぼす影響について同時に調査される計画された実験 (3.1.27)

3.1.36

レプリケーション

特定の治療の組み合わせまたは予測変数の設定が複数回発生する（3.1.4）

注 1: 実験上の制約により、レプリケーションがランダムな順序ではなく順次に実行されることが規定される場合があります。非公式には、そのような状況は繰り返しに相当しますが、この用語に関する普遍的な一致は存在しません。したがって、ISO 3534 のこの部分の目的では、レプリケーションは、予測変数の固定レベルのセットに対するさまざまな応答値の達成を含む用語になります。

注記 2:単一の観測から得られるよりも信頼性の高い推定値を得るために、実行は何度も繰り返されます。複製の機能には 2 つの機能があります。(a) 純粋な誤差の推定値を提供し、(b) 実験結果の信頼性を高めます。

注記 3: ISO 3534 のこの部分で使用される複製は、特に測定システム分析に関連する ISO 3534-2:2006) で与えられる再現性と再現性の概念とは区別されるべきです。 ISO 3534 のこの部分に関連する実験状況では、再現には測定の不確実性に加えてプロセス自体からの影響も含まれます。

3.1.37

キューブポイント

因子レベルのベクトル (3.1.12) 形式 ( a ₁ , a ₂ , …, a _k ) の設定where 各a _i 、因子のコード化されたレベルの表記法として +1 または -1 に等しい (3.1.5)

注記 1:これらの点は、まさに、 k 因子による 2 レベルの完全要因実験 (3.2.2) or 部分要因実験 (3.2.3) で見つかるタイプの点です。中心複合設計のコンテキストでは、最大 2,000 個の^k ポイントを使用できます [(3.2.19) の例 1 を参照

例：

v 因子の中心複合計画を構築するには、+1 または -1 としてコード化された階乗レベルを持つ 2 ^v 階乗配置、または少なくとも分解能V を持つ 2 ^vp分数階乗配置を取得します。

3.1.38

スターポイント

因子レベル (3.1.12) のベクトル設定 ( a ₁ , a ₂ , …, a _k )ここで, 一方のa _i α or −αに等しく、もう一方のa _i は 0 に等しく、因子のコード化レベル (3.1.5)

注記 1: すべての星点には、 +α or −αに等しい単一の非ゼロ成分があります。 k ファクター中心複合設計では、通常、合計 2 ^k 個のスターポイントが使用されます。

例：

スター点は、設計中心からβの距離にある各設計変数の軸上の 2 つの軸点です。 v 軸があるため、このプロセスでは次の形式の 2 つの^v スター点が生成されます: ( ±β , 0, …, 0), (0, ±β , 0, …, 0), …, (0, 0, …) 、 ±β ）。これらの点は軸点とも呼ばれ、サーフェスの曲率の推定に役立ちます。

3.1.39

中心点

因子レベルのベクトル (3.1.12) 形式 ( a ₁ , a ₂ , …, a _k )ここで, すべてのa _i 0 に等しい設定 (3.1.5)

注記 1:中心点のすべてのエントリはゼロであるため、ベクトルは、コード化された変数の実験計画 (3.1.28) の中心に対応する形式 (0, 0, ... , 0) になります。これらの点の数、たとえばn ₀は、応答曲面設計 (3.2.19) のさまざまな目的を達成するために選択されます。中心点は、調査中のプロセスの純粋な誤差 (3.1.9) の推定値を取得するために複製されることがあります (3.1.36) 。星形 (五芒星)、立方体形 (角の立方体)、および中心点 (⊕) を備えた 2 つのデザインのグラフィック表示を図 2 および 3 に示します。

図 2 —星形、立方体、中心点の 2 次元図

図 3 —星形、立方体、中心点の 3 次元図 (立方体の 1 点は非表示)

例：

実験領域の中心を表す点。コード化されたレベルでは、これらは通常 (0, 0, …, 0) として表されます。これらは、各因子の値が階乗部分で使用される値の中央値である点または実験の実行です。この点は、実験の精度を向上させるためによく再現されます。追加する中心点の数も、設計特性によって異なります。

3.1.40

回転性

適合モデル ( 3.1.2) から予測される応答変数 (3.1.3) が、計画の中心からすべての等距離で同じ分散を持つ計画実験 (3.1.27) の特性。

注記 1: 任意の点x における予測応答の分散が中心点からのx の距離のみに依存する場合、計画は回転可能です (3.1.39) 。この特性を持つ計画は、 x での予測分散を変更せずに、その中心点を中心に回転できます。

注記 2:回転可能性は、応答曲面設計 (3.2.19) にとって望ましい特性です。

3.2 実験の準備

3.2.1

要因実験

1 つ以上の因子 (3.1.5) を使用し、因子の 1 つに少なくとも 2 つの水準を適用した計画実験 (3.1.27)

注記 1: 「要因実験」という用語は、完全な要因実験 (3.2.2) よりも一般的です。

注記 2: 交差因子: 実験において一方のすべてのレベルが他方のすべてのレベルと出現する場合、2 つの因子は交差します (3.1.1)ネストされた因子: A レベルまたは値がA のレベルまたは値ごとに異なる場合、因子 A は別B 因子B 内にネストされます。ネストされた要因または効果には階層関係があります。 (3.2.21を参照)

3.2.2

完全要因実験

要因の水準のすべての可能な組み合わせからなる要因実験 (3.2.1) (3.1.5)

注記 1:すべての交互作用 (3.1.17) と主効果 (3.1.15) は、完全な要因実験から推定できます。

注記 2:完全要因実験は、通常、各要因の水準数の積として記号的に記述されます。たとえば、因子A の 3 レベル、因子B の 2 レベル、および因子C の 4 レベルに基づく実験は、3 × 2 × 4 階乗と呼ばれます。これらの数値の積 (この場合は 24) は、個別の実行の合計数を示します。

注記 3:完全要因実験にすべて同じレベル数を持つ因子が含まれる場合、その記述は通常、因子の数k に等しいレベル数で与えられます。したがって、それぞれ 3 水準の 2 つの因子を使用する実験は 3 ²完全階乗 ( k は 2 に等しい) と呼ばれ、異なる実験処理が与えられる 9 つの実験単位が必要です。

3.2.3

部分要因実験

完全な要因実験 (3.2.2) のサブセットで構成される要因実験 (3.2.1)

注記 1:通常、この割合は、可能な実験的治療の組み合わせ全体の単純な割合です。たとえば、2 分の 1 の分数、4 分の 1 の分数などが一般的です。

注記 2:すべての交互作用 (3.1.17) と主効果 (3.1.15) は部分要因実験から推定することはできません。

注記 3: 一部実施要因計画は、完全実施要因計画の実験実行の慎重に選択されたサブセット (一部) から構成される実験計画 (3.1.28) です。サブセットは、実験の実行とリソースの点で完全要因計画の労力の一部を使用しながら、主効果と低次相互作用を利用して研究対象の問題の最も重要な特徴に関する情報を明らかにするように選択できます。それによってスクリーニング計画が得られます (3.2.8) 。他の状況では、実験単位間の不均一性を考慮してサブセットを選択することもでき、これにより、例えばラテン方陣計画 (3.2.11) or グレコ・ラテン方陣計画 (3.2.12) が得られます。

3.2.4

2レベルの実験

すべての因子 (3.1.5) が最大 2 つの因子水準 (3.1.12) を仮定する完全要因実験 (3.2.2)

3.2.5

^k 階乗実験

k 因子 (3.1.5 ) を使用した完全要因実験 (3.2.2 )、それぞれ 2 つの因子水準 (3.1.12)

注記 1: 2 レベルの完全実施要因実験は、 p 個の利用可能な因子のそれぞれが 2 つのレベルのみで調査される完全実施要因実験です。通常、実験の初期段階では、「重要な少数」の要因を発見するために、多数の潜在的な要因を調査する必要があります。これらの段階では 2 レベルの要因実験が使用され、不要な影響がすぐに除去されるため、重要な影響に注意を集中できます。

例：

2 ⁴要因実験は、プロセス収率に対する 4 つの要因 (圧力、温度、触媒、オペレーター) の影響を調査するのに適している可能性があります。 A を圧力 (低または高)、 B を因子温度 (低または高)、 C を触媒 (存在または不在)、 D を演算子 (2 つのうちの 1 つ) に対応させます。

表 3 — 2 ⁴要因実験

実験ユニット	処理	A	B	C	D
1	(1)	−	−	−	−
2	a	+	−	−	−
3	b	−	+	−	−
4	ab	+	+	−	−
5	c	−	−	+	−
6	交流	+	−	+	−
7	紀元前	−	+	+	−
8位	ABC	+	+	+	−
9	d	−	−	−	+
10	広告	+	−	−	+
11	BD	−	+	−	+
12	アブド	+	+	−	+
13	CD	−	−	+	+
14	acd	+	−	+	+
15	BCD	−	+	+	+
16	あいうえお	+	+	+	+

2 ⁴要因実験は、表 3 にリストされているように、16 の異なる実験処理 (3.1.13) で構成されます。記号「-」と「+」は、各要因の 2 つの可能なレベルを示します。多くの場合、マイナスは因子の低レベルを指し、プラスは高レベルを意味します。ただし、レベルに対するシンボルの指定は任意です。
上の表に示されている次数は標準イェーツ次数として知られており、計算が手動で実行される場合、分析段階で役立つ可能性があります。これらの処理が実行される実際の順序は、ランダム化によって決定される必要があります (3.1.30) 。最初の因子A 、交互の符号 (–、+、–、+ など) でリストされます。 2 番目の要素B 、2 つのマイナスと 2 つのプラスを交互に配置します。ファクターC 4 つのマイナスと 4 つのプラスのセットを交互に配置します。最後に、係数D は、実験ユニット 1 ～ 8 ではマイナスに設定され、実験ユニット 9 ～ 16 ではプラスに設定されます。ISO 3534 のこの部分の後半では、マイナス記号は -1 として指定され、プラス記号は +1 として指定されます。。
上の表の 2 番目の列は、治療を説明するための代替表記を示しています。小文字の存在は、対応する大文字の因子のレベルが高いことを示します。さらに、文字がない場合は、対応する因子が低レベルにあることを意味します。すべての要因が低レベルの場合を「(1)」と表記します。
完全要因実験では、すべての主効果と相互作用を明確に推定できます (テストはできません) 2 ⁴の場合、4 つの主効果 ( A 、 B 、 C 、 D )、6 つの二元 (一次) 相互作用 ( ab 、 AC 、 AD 、 BC 、 BD 、 CD )、4 つの三元相互作用 ( 2 次) 交互作用 ( ABC 、 ABD 、 ACD 、 BCD ) と 1 つの 4 方向 (3 次) 交互作用 ( ABCD )実際には、3 方向および 4 方向の交互作用は無視できると想定される場合があるため、これらの自由度で残留誤差 (3.1.6) を推定する機会が提供されます。あるいは、一部の複製によってテストの機会が提供されることもあります。
それぞれの効果 (たとえば、 A による効果、 A とB の間の相互作用、さらにA 、 B 、 C およびD の間の 4 方向の相互作用) は、表 4 に示すコントラスト係数を使用して推定できます。
たとえば、 A の主効果を推定するための式は ( − 1) y ₁+ (1) y ₂+…+ (1) y ₁₆ここで, 応答は表に示されている順序に関連付けられています。

表 4 — 2 ⁴完全実施要因計画の計画行列

I	A	B	C	D	ab	交流	広告	紀元前	BD	CD	ABC	ABD	ACD	BCD	あいうえお	y
1	−1	−1	−1	−1	1	1	1	1	1	1	−1	−1	−1	−1	1	_y
1	1	−1	−1	−1	−1	−1	−1	1	1	1	1	1	1	−1	−1	_y
1	−1	1	−1	−1	−1	1	1	−1	−1	1	1	1	−1	1	−1	_y
1	1	1	−1	−1	1	−1	−1	−1	−1	1	−1	−1	1	1	1	_y
1	−1	−1	1	−1	1	−1	1	−1	1	−1	1	−1	1	1	−1	_y
1	1	−1	1	−1	−1	1	−1	−1	1	−1	−1	1	−1	1	1	_y
1	−1	1	1	−1	−1	−1	1	1	−1	−1	−1	1	1	−1	1	_y
1	1	1	1	−1	1	1	−1	1	−1	−1	1	−1	−1	−1	−1	_y
1	−1	−1	−1	1	1	1	−1	1	−1	−1	−1	1	1	1	−1	_y
1	1	−1	−1	1	−1	−1	1	1	−1	−1	1	−1	−1	1	1	_y
1	−1	1	−1	1	−1	1	−1	−1	1	−1	1	−1	1	−1	1	_y
1	1	1	−1	1	1	−1	1	−1	1	−1	−1	1	−1	−1	−1	_y
1	−1	−1	1	1	1	−1	−1	−1	−1	1	1	1	−1	−1	1	_y
1	1	−1	1	1	−1	1	1	−1	−1	1	−1	−1	1	−1	−1	_y
1	−1	1	1	1	−1	−1	−1	1	1	1	−1	−1	−1	1	−1	_y
1	1	1	1	1	1	1	1	1	1	1	1	1	1	1	1	_y

3.2.6

2 ^{k −p} 部分階乗実験

分数要因実験 (3.2.3) そのサイズは 2 ^k 要因実験 (3.2.5) のサイズの 2 ^−p分数です。

注記 1: 因子数が多い場合 (3.1.5) 、2 ^k では実現可能よりも多くの実行が必要になる場合があります。慎重に選択することにより、部分要因実験から完全要因実験とほぼ同じ量の情報を得ることができます (3.1.2) 。特に、選択は通常、実際上重要であると予想される主効果 (3.1.15) と交互作用 (3.1.17) が、無視できると予想される交互作用 (3.1.19) とのみ混同されるように行われます。

注記 2:p 1 に等しい場合、結果の分数実施要因実験は半分の分数になります。 p が 2 の場合、結果として得られる部分階乗実験は 4 分の 1 になります。など。

例：

6 つの因子と 16 回の実行による実験 (3.1.1) を考えてみましょう。表 5 を参照してください。この例は、16 の実験的治療の組み合わせで 6 つの因子すべてを調べるために交絡を使用する実験計画の構築を示しています。特に、2 ^6-2部分実施要因計画が構築されます。因子のうち 4 つ ( A 、 B 、 C およびD ) の水準は、完全な要因実験を実行するかのように設定できます。この完全な要因コンテキストでは、すべての主効果と高次交互作用 (たとえば、二元交互作用ab 、 AC 、 AD 、 BC 、 BD 、 CD 、三元交互作用ABC 、 ABD 、 ACDおよびBCD 、および4 方向のインタラクションABCD )実際には、3 方向および 4 方向の交互作用が重要になることはほとんどありませんが、データから確実に推定できます。これらの高次相互作用 (3 元および 4 元) の重要性が考えられないことを考慮して、研究者は、 E のレベルとして割り当てることによって、この設計段階で他の要因 (たとえばE およびF ) を実験に組み込むことができることに気づきました。およびF 、高次相互作用の特定の選択。たとえば、 E のレベルは 3 者間相互作用ABCに対応するように割り当てられ、 F のレベルは 3 者間相互作用BCDに割り当てられます。この割り当てにより、推定に同じコントラストが使用されるため、三者交互作用ABCの推定値が新しく割り当てられた因子E の推定値と同一であることが保証されます。しかし、 ABC がゼロに近い可能性が高いという実際的かつ一般的な出来事を考慮すると、研究者は、 ABCとE の推定値が実質的にE 単独の推定値であると推定または結論付けることができます (つまり、 ABC相互作用がゼロであると推定します)

表 5 — 4 分の 1 分数のレイアウト

	A	B	C	D	E=ABC	F=BCD
1	−1	−1	−1	−1	−1	−1
2	1	−1	−1	−1	1	−1
3	−1	1	−1	−1	1	1
4	1	1	−1	−1	−1	1
5	−1	−1	1	−1	1	1
6	1	−1	1	−1	−1	1
7	−1	1	1	−1	−1	−1
8位	1	1	1	−1	1	−1
9	−1	−1	−1	1	−1	1
10	1	−1	−1	1	1	1
11	−1	1	−1	1	1	−1
12	1	1	−1	1	−1	−1
13	−1	−1	1	1	1	−1
14	1	−1	1	1	−1	−1
15	−1	1	1	1	−1	1
16	1	1	1	1	1	1

式E=ABCおよびF=BCD は、因子A 、 B 、 C およびD に関して適切なレベルの因子E およびF を生成するため、関係を生成します。別の有用な構造は次のように与えられます。
I = ABCE = BCDF = ADEF
これは、この設計の定義関係として知られています。「 I 」という用語は ID 列を表します (すべてのエントリは +1 に等しい)すぐにわかるように、この定義関係には、この特定の実験計画に関連する交絡に関する情報がすべて含まれています。 ia = AI = A 、 IB = bi = B 、 I = A²= B²= C²などの規則を使用すると、生成関係E = ABC はEE = ABCE と等価になり、EE = ABCEはI =と等価になります。 ABCE 。同様に、 F = BCD はI = BCDFになります。関係の定義は、一般化交互作用ABCE × BCDF = ABCEBCDF = ABBCCDEF = AIIDEF = ADEFを評価することで完了します。したがって、定義関係はI = ABCE = BCDF = ADEF となります。

注記 3: 2 ^{k −p} 部分実施要因実験は、 k 因子が 2 つのグループ ( k-p因子を持つ一次グループとp 因子を持つ二次グループ) にあるとみなして構築されます。一次グループのk−p因子は、計画の実験単位数である 2 ^{k −p} 実験単位の完全階乗に割り当てられます。各実験単位の二次グループの各因子のレベルは、一次グループの因子のレベルによって定義されます。 1 次グループの因子に関して 2 次グループの因子を定義するp 個の方程式のセットは、計画を生成するため、生成関係と呼ばれます。生成関係のp 方程式を使用して、設計のプロパティを決定するために使用できる定義関係の 2 ^p − 1 方程式を計算できます。前の例では、 k= 6, p= 一次因子はA 、 B 、 C 、 D であり、二次グループはE とF でした。生成関係はE=ABCおよびF=BCDでした。定義関係は、 I = ABCE = BCDF = ADEF でした。

3.2.7

設計解像度

2 ^kp部分要因実験に関連付けられた定義関係内の最短の単語の長さ (3.2.6)

注記 1:設計解像度は、主効果 (3.1.15) および二元相互作用および高次相互作用 (3.1.17) の間のエイリアシング ( 3.1.20) の範囲を示します。

注記 2:設計解像度は、特定の実験計画におけるエイリアシングを表します。数値の長さは通常、大文字のローマ数字で表されます。最も一般的な実際の 3 つの状況は、決議III 、 IV 、およびV です。

解像度 III デザインの場合、主効果は他の主効果とエイリアス化されません。この観察は、定義関係の式を調べることによって行うことができます。たとえば、 I = ABD = BCE = ACDE には、式I 、 ABD 、 BCE 、 ACDEが含まれており、各用語の文字数 (この例ではそれぞれ 1, 3, 3, 4) がカウントされます。これらのうち、1 ( I に相当) を除いた最も短い長さは 3 であり、これが最も短い「文字列」の長さとして知られています。少なくとも 1 つの主効果は、双方向交互作用でエイリアスされます。たとえば、 I = ABDの場合、 A はBDと混同され、 B はADと混同され、 D はab と混同されます。
解像度 IV デザインの場合、主効果は他の主効果や双方向相互作用とエイリアス化されません。少なくとも 1 つの双方向インタラクションは、別の双方向インタラクションとエイリアスされます。たとえば、定義関係I = ABCE = BCDF = ADEF には、式I 、 ABCE 、 BCDFおよびADEFが含まれます。各用語の文字数を数えると、それぞれ 1, 4, 4, 4 になります。 1 ( I に対応) 以外の最小値は 4 で、これはこのデザインの最短の「文字列」の長さとして知られています。たとえば、 I = ABCEの場合、 ab はce と混同され、 ACはbe と混同され、 ae はBCと混同されますが、 A はBCEと混同され、 B はACEと混同され、 C はABEと混同され、 E は混同されます。 ABC付き。
解像度 V 設計の場合、主効果と二元交互作用は、他の主効果や他の二元交互作用とエイリアス化されません。たとえば、 I = ABCDEの場合、各主効果は 4 元交互作用 ( A はBCDEと交絡) と交絡し、各 2 元交互作用は 3 元交互作用 ( ab はCDEと交絡) であることが明らかです。）。

注記 3: 高次の相互作用が無視できる場合に限り、解像度が高くなるほど、より多くの効果 (主または相互作用) を明確に推定できます。同じ数の因子と実験単位を含む 2 つの潜在的な計画が選択された場合、より高い解像度を持つ計画を選択する必要があります。幸いなことに、実際に重要なk とp のほとんどのケースについて、最も適切な定義関係が記録されています (たとえば、参考文献 [2], p. 410, または参考文献 [3], p. 272 を参照)

注記 4: 完全要因計画 (3.2.1) には交絡がない。ほとんどの実用的な目的では、解像度 V 設計が優れており、解像度 IV 設計が適切である可能性があります。解像度 III の設計は、経済的なスクリーニング設計として役立ちます (3.2.8) 。

注記 5:因子名は、定義の目的上、単一の文字で表現されると想定されます。

例：

(3.1.20) の表 2 は、定義関係I=ABD=BCE=ACDEを持つ解像度 III 設計のケースを示しています。

3.2.8

スクリーニングデザイン

さらなる研究のために因子（3.1.5）のサブセットを特定する計画された実験（3.1.27）

注記 1:このような実験計画 (3.1.28) は通常、主効果 (3.1.15) の調査に焦点を当てますが、相互作用 (3.1.17) の存在により解析が複雑になり、場合によっては追加の効果が必要になります。曖昧さを解決するための実験的処理 (3.1.13) 。

注記 2:スクリーニング計画を用いた実験の主な目的は、多くのそれほど重要ではない主効果から少数の重要な主効果を選択するか、除外することです。スクリーニング計画は、主効果計画と呼ばれることもあります。

例 1:

2 ^{k - p} 部分要因計画 (3.2.6) (特に高度に分割された計画) は、スクリーニング計画とみなすことができます。 G=ACD 、 H=ABD 、およびJ=ABCDの生成関係が追加された 3.2.6 の例を考えてみましょう。対応する計画は、わずか 16 の治療組み合わせで 9 つの因子を研究することを目的とした 2 ^9-5部分要因計画です。分析段階では、さらなる調査のために主要な 2 つまたは 3 つの予測変数 (主効果に関連する) を特定することが望まれます。

例 2:

Plackett と Burman [4] は、実行数が 4 の倍数である 2 レベルのスクリーニング計画のコレクションを提案しました。それらの設計は一般に、調査対象の主効果の数が実行数に近づく状況で選択されます (3.1.13) 。たとえば、表 6 に示す 12 回のプラケット-バーマン計画を使用して、最大 11 個の主効果をスクリーニングすることができます。ただし、この設計では、二元交互作用 (たとえば、 ab ) の存在により、主効果C 、 D 、 ... 、 K の推定が損なわれます。

表 6 — Plackett-Burman 12 ラン計画のレイアウト

トライアル	A	B	C	D	E	F	G	H	I	J	K
1	1	−1	1	−1	−1	−1	1	1	1	−1	1
2	1	1	−1	1	−1	−1	−1	1	1	1	−1
3	−1	1	1	−1	1	−1	−1	−1	1	1	1
4	1	−1	1	1	−1	1	−1	−1	−1	1	1
5	1	1	−1	1	1	−1	1	−1	−1	−1	1
6	1	1	1	−1	1	1	−1	1	−1	−1	−1
7	−1	1	1	1	−1	1	1	−1	1	−1	−1
8位	−1	−1	1	1	1	−1	1	1	−1	1	−1
9	−1	−1	−1	1	1	1	−1	1	1	−1	1
10	1	−1	−1	−1	1	1	1	−1	1	1	−1
11	−1	1	−1	−1	−1	1	1	1	−1	1	1
12	−1	−1	−1	−1	−1	−1	−1	−1	−1	−1	−1

注記 3:プラケット・バーマン計画の多くはアダマール行列に関連しており、当初は理論的な文脈で開発されましたが、後に実験計画に有用であることが認識されました。アダマール行列は、単一の列 (または同等の行) のみの知識から容易に構築できます。考えられる構成の 1 つとして、一番下の行をマイナス記号だけで構成するとします。残りの列エントリは、最初の列を右にシフトして 1 行ドロップすることにより、最初の列から取得され、11 番目のエントリが行位置 1 に移動します。この手順は、マトリックスが完成するまで列全体で継続されます。行列を構築するための特別な最初の列の例を以下に示します。それぞれの構造について、次のように最初の列のプラス記号の位置を示すだけで十分です。

n	列 1 にプラス記号を含む行
1	21, 2, 4, 5, 6, 10
20	1, 2, 5, 6, 7, 8, 10, 12, 17, 18
24	1, 2, 3, 4, 5, 7, 9, 10, 13, 14, 17, 19

n = 12 について上に示した行は、例 2 で詳述した設計と一致することに注意してください。Plackett-Burman 設計の多くは、単一の列の要素をベースとして使用し、この一般的な方法で構築できます。 n = 28, 52, 76, 92, および 100 の場合は、プラス 1 とマイナス 1 の単一の特別な列から構築することはできません (詳細については、参考文献 [3] を参照)あるいは、一部の統計ソフトウェアパッケージは、Plackett-Burman 計画行列 (3.2.25) を提供できます。

注記 4: Aguchi の直交配列 (3.1.31) L12 は、上記の Plackett-Burman 12 ラン計画と同等です。 L20 は、Plackett-Burman 20 ラン設計と同等です。注意として、L 配列規則では通常、アダマール構造で提供される順序とは異なる順序で計画行列のエントリが与えられます。

注記 5:プラケット・バーマン計画は、過飽和設定 (つまり、実験実行よりも多くの因子を含むケース) での使用に適合させることができます。過飽和設計の詳細については、参考文献 [5] および [6] を参照してください。

3.2.9

ブロックデザイン

ブロック (3.1.25) を使用する計画された実験 (3.1.27 )

注記 1:ブロック設計は、実験ユニットのサブセットの均一性を利用します (3.1.24) 。実験単位間の不均一性が実験計画で無視されると、観察される変動が増加するため、実験 (3.1.1) から得られる情報量が減少する可能性があります。設計でこの状況を考慮すると、設計目標を達成する実験の能力を高めることができます。ブロックデザインは、既知の迷惑効果、または応答には寄与するものの、実験の目的に比べて本質的には重要ではない効果の影響を打ち消すことを目的としています。均一性の推定が正しくない場合、ブロック化により自由度が減少するというコストが発生します。

3.2.10

ランダム化されたブロック設計

各処理が各ブロックに 1 回だけ現れるように、 b ブロック (3.1.25 ) にν 実験処理 (3.1.13) を配置したブロック設計 (3.2.9)

注記 1:これらの設計は、不均一性の設定を一方向に排除する場合、つまり、実験ユニット内の不均一性が単一の要因によって引き起こされ、不均一性を引き起こすこれらの要因のレベルが処理数と同じである場合に役立ちます。

注記 2:各処理は各ブロックに現れるため、これは完全なブロック設計であり、これをより適切な用語で「ランダム化された完全なブロック設計」と呼びます。治療の反復数はブロック数と同じです。処理は各ブロックにランダムに割り当てられます。

注記 3:これは、フィッシャーによって与えられた 3 つの設計原則 (複製、ランダム化、およびローカル制御) をすべて使用した最も単純な設計です。

3.2.11

ラテンスクエアデザイン

すべてのシンボルが各行に 1 回、各列に 1 回正確に出現するように、 v 行とv 列に配置されたv ²位置でv ² 実行 (3.1.13) を配置した部分要因計画 (3.2.3 )

注記 1:ラテン方格計画では、最大 3 つの要素 (行、列、文字) を調査できます。記号v ラテン方陣の次数を表します。

注記 2:記号をA 、 B 、 C 、 D とすると、次数 4 のラテン方陣配置は次のとおりです。

A	B	C	D
B	C	D	A
C	D	A	B
D	A	B	C

ラテン方格計画は通常、実験材料の不均一性を 2 つの方向で除去する必要がwhere 実験で使用されます。つまり、実験材料の不均一性が 2 つの要因によって引き起こされ、それぞれのレベルが相互分類される実験状況のwhere です。他の。実験単位における不均一性を引き起こす要因のレベルは、治療のレベルと同じです。これらの設計では、複製の数が処理の数と等しいことが必要です。この例では、合計 16 の治療の組み合わせが発生しますが、完全要因計画 (3.2.1) では 64 の治療の組み合わせが存在します。

注記 3:割り当てによるランダム化。ラテン方格計画の定義によれば、処理はさまざまな方法でν²実験ユニットに割り当てることができます。したがって、特定の次数のラテン方陣が多数存在します。これらは、標準形式 (最初の行と最初の列のシンボルが自然な順序である場合) または準標準形式 (最初の行のシンボルが自然な順序である場合) にすることができます。ランダム化の目的は、これらの正方形の 1 つをランダムに選択することです。以下は、ラテン方陣をランダムに選択する方法の 1 つです。
ν × ν の標準的なラテン方陣配置を、ラテン文字A 、 B 、 C など、または数字 1, 2, 3 などで治療を示すことによって最初に記述します。このような配置は、統計学者および生体認証者用の表で簡単に入手できます^[7] 。これらの正方形のうちの 1 つは、次の 5 × 5 のラテン正方形のように体系的に書くことができます。

A	B	C	D	E
B	C	D	E	A
C	D	E	A	B
D	E	A	B	C
E	A	B	C	D

ランダム化の目的で、ラテン方陣の行と列がランダムに再配置されます。行や列内でランダム化はできません。
たとえば、次は上記の 5×5 ラテン方陣の行をランダム化した四角形です。

A	B	C	D	E
B	C	D	E	A
E	A	B	C	D
D	E	A	B	C
C	D	E	A	B

次に、上記の行のランダムな正方形の列がランダムに再配置され、次のランダムな正方形が得られます。

E	B	C	A	D
A	C	D	B	E
D	A	B	E	C
C	E	A	D	B
B	D	E	C	A

個々のユニットのランダム化ではなく、行と列のランダム化の結果、全体の配置はラテン方陣のままになります。

注記 4:基本的な仮定は、これらのブロック因子は研究対象の最も関心のある因子と相互作用しない (3.1.17) 、またはそれらの間で相互作用しないということです。この仮定が有効でない場合、残差誤差 (3.1.6) の尺度が増加し、因子の効果がそのような交互作用 (3.1.17) と混同されます。場合によっては、他の対象因子がブロック位置で使用されるため、ブロック因子であるという指定のない 3 つの因子が存在する可能性があります。これは、交互作用がないと仮定した部分要因実験 (3.2.3) と同等です。部分要因実験の計画によってはラテン方格を形成するものもあります。交互作用に関して行われている仮定をより完全に理解するには、部分要因実験の観点から問題にアプローチする方が望ましい場合があります。

3.2.12

グレコ・ラテン語の正方形のデザイン

部分実施要因計画 (3.2.3) には、それぞれh レベル (3.1.12) の 4 つの因子 (3.1.5 ) が含まれ、因子のいずれかの水準と他の 3 つの因子の水準の組み合わせが 1 回出現します。サイズh ²の実験で 1 回だけ

注記 1: グレコ・ラテン平方計画には4 つの因子が含まれており、3 つのブロック (3.1.25) 因子に従って分類されたh ² ( h ≥ 3, 正の整数) 実験単位 (3.1.24) があります (たとえば、行因子、列因子、ギリシャ文字）各因子はh レベルを持ちます。主に関心のある因子にはh レベルがあり、各実験処理 (3.1.13) がすべての行と列h 1 回正確に表示され、ギリシャ文字も正確に表示されるように、h ²実験単位にランダムに割り当てられます。一度。

注記 2: 2つのラテン語正方形は、一方の正方形の各文字がもう一方の正方形の各文字とちょうど 1 回一致する場合、直交していると言われます。直交するラテン方陣デザインのペアを組み合わせて、グレコラテン方陣デザインを作成できます。

注記 3: グレコ・ラテン方形計画では、3 つのブロック変数を組み込むことができます。これらの変数はすべて、主に関心のある因子の水準数と同じ水準数を持ちます。

例：

4 × 4 のグレコラテン正方形の例は次のとおりです。

αα	Bβ	Cγ	Dδ
Bδ	Aγ	Dβ	Cα
Cβ	Dα	Aδ	Bγ
Dγ	Cδ	Bα	Aβ

係数 1 は行によって与えられます。係数 2 は列で示されます。因子 3 はラテン文字で表されます。主 (および 4 番目) 因子はギリシャ文字で表されます。

3.2.13

不完全なブロック設計

実験的処理 (3.1.13) をすべて含まないブロック (3.1.25) を少なくとも 1 つ含むブロック設計 (3.2.9)

注記 1: 実際には、ランダム化ブロック設計 (3.2.10) は、必要な治療数に対応するのに十分な数の実験ユニットが各ブロック内で利用可能であることを強調する「完全な」ブロック設計として構築できます。

注記 2:材料がブロックに分割され、そのブロックに特定の処理を割り当てることが望ましい場合、処理が多すぎて各ブロックに表示できない可能性があります。ブロックに含まれる処理の複製が完全ではない場合、そのブロックは不完全と呼ばれます。

3.2.14

バランスの取れた不完全ブロック設計

BIBD

不完全なブロック設計 (3.2.13) 。各ブロック (3.1.25) には、主に関心のある因子 (3.1.5) のl 水準からの同じ数k の異なる水準 (3.1.12) が含まれており、すべてのペアが次のように配置されています。レベルの数は、 b 個のブロックから同じ数λ個のブロックで発生します。

注記 1: 「バランスの取れた」という用語は、一貫した数のペアリングを指し、「不完全」は、各ブロックで主に関心のある因子のすべてのレベルを検査できないことを指し、「ブロック」は、実験を実施する戦略を指します。 (3.1.1) 実験単位の同種セットに関する (3.1.24) 。

例 1:

4 つの実験処理と 6 つのブロック、ブロックごとに 2 つの処理 ( l = 4, k = 2, b = 6, λ = 1) がある状況を考えてみましょう。より具体的には、主に関心のある因子の 4 つのレベル ( T ₁ 、 T ₂ 、 T ₃ 、 T ₄ ) を研究する必要があるが、1 日に実行できるのは 2 つのレベルのみであるとします。実験の実施に 6 日間の時間があれば、次の計画が適切です。

日	_T	_T	_T	_T
1	∗	∗
2			∗	∗
3	∗		∗
4		∗		∗
5	∗			∗
6		∗	∗

この例では、考えられるすべての処理ペアが同じブロック内で 1 回発生します。

例 2:

主に関心のある因子が 6 レベル、ブロックごとに 3 レベルのブロックが 10 個ある状況を考えてみましょう ( l = 6, k = 3, b = 10, λ = 2)この場合、6 つのレベルに対して 20 個の可能なトリプレットがあるため、実際には 20 個のブロックが必要であると疑うのは自然なことです。以下の一連の処理をここで, 各ブロックはトリプレットによって与えられます。
( T ₁ 、 T ₂ 、 T ₃ )、 ( T ₁_、T ₂ 、 T ₄ )、 ( T ₁ 、 T 3 、 T ₅ )、 ( T ₁ 、 T ₄ 、 T ₆ )、 ( T ₁ 、 T _{5 )}_、T )
( T ₂ 、 T ₃ 、 T ₆ )、 ( T ₂ 、 T ₄ 、 T ₅ )、 ( T ₂ 、 T ₅ 、 T ₆ )、 ( T ₃ 、 T ₄ 、 T ₅ )、 ( T ₃ 、 T _{4 )}_、T )
ここで、レベルの各ペアは 10 ブロックのセット内で正確に 2 回発生しますが、同じブロック内では 1 回だけ発生するため、10 ブロックで十分であることがわかります。

例 3:

ブロックごとに 4 つのレベル ( l = 7, k = 4, b = 7, λ = 2) を持つ 7 つのレベルと 7 つのブロックがある状況を考えてみましょう。

	主に関心のある要因のレベル
ブロック	1	1	2	3	6
	2	2	3	4	7
	3	3	4	5	1
	4	4	5	6	2
	5	5	6	7	3
	6	6	7	1	4
	7	7	1	2	5

注記 2: バランスの取れた不完全ブロック計画は、主因子の各水準が実験で同じ回数 h 出現し、次の関係が成り立つことを意味します。
bk = lh 、 b ≥ l 、およびh ( k − 1) = λ ( l − 1)
上記の方程式の各文字は整数を表すため、バランスのとれた不完全なブロック設計を構築するには、限られた組み合わせセット ( l 、 k 、 b 、 h 、 λ ) のみが可能であることは明らかです。ただし、上記の 3 つの条件を満たす 5 つの整数 ( l 、 k 、 b 、 h 、 λ ) が与えられたとしても、必ずしも BIB が存在するとは限りません。

注記 3: ランダム化 (3.1.30) の場合、ブロックと各ブロック内のレベルを独立してランダムに配置します。

3.2.15

部分的にバランスが取れた不完全なブロック設計

PBIBD

不完全なブロック設計 (3.2.13) 各ブロック (3.1.25) には、主因子 (3.1.5) のl レベルからの異なるレベル (3.1.12) の同じ数k が含まれており、すべてのペアが一致しないように配置されています。同じ数のb ブロック内でレベルの数が同時に発生する

注 1:以下の条件が満たされる場合、 l レベルおよびb ブロックを持つ不完全なブロック設計は、 m (≥ 2) の関連クラスを持つ PBIBD です。

a)各ブロックにはk (< l ) 個の異なるレベルが含まれます。
b)各レベルはh ブロックに表示されます。
c) 以下を満たすレベル間に関係が存在します。
- 任意の 2 つのレベルは、1 番目、2 番目、...、 m 番目のアソシエートのいずれかであり、その関係は対称的です (レベル α がレベル β の i 番目のアソシエイトである場合、レベル β はレベル α の i 番目のアソシエートです)
- 各レベルには ni i 番目のアソシエイトがあり、i = 1, 2, ...、数 ni は選択されたレベルとは無関係です。
d)相互にi 番目に関連する任意の 2 つのレベルがλ_i ブロック ( i = 1, 2, ..., m ) に一緒に出現しますが、すべてのλ_i が等しいわけではありません。

注2:整数l 、 b 、 h 、 k 、 λ 、 λ 、 ...、 λ _m 、 n ₁ 、 n ₂ 、 ...、 n _m 、 mml_m29

, i , j , k = 1, 2, ..., m 次のように接続されます。

mml_m30

例：

次のように、l = 6, k = 4, b = 6, h = 4, n ₁= 1, n ₂= 4, λ₁= 4, λ₂= 2 である状況を考えてみましょう。

	主に関心のある要因のレベル
ブロック	1	1	4	2	5
	2	2	5	3	6
	3	3	6	1	4
	4	4	1	5	2
	5	5	2	6	3
	6	6	3	4	1

この設計では、すべてのレベルがh = 4 回発生し、任意のレベル (たとえば、レベル 1) から開始して、 n ₁= 1 つのレベル (たとえば、レベル 4) がλ₁= 4 のレベル 1 とともに出現することがわかります。ブロックとn ₂= 4 レベル (番号 2, 3, 5, および 6)、 λ₂= 2 ブロックのレベル 1 です。パラメータn ₁ 、 n ₂ 、 λ_{1 、}およびλ₂は、開始レベルに関係なく同じです。

3.2.16

ようでんスクエアデザイン

1 つのブロック (3.1.26) に関してランダム化されたブロックデザイン (3.2.10) を取得するために、行 (または列) を削除または追加することによって特定のラテン方格デザイン (3.2.11) から派生したブロックデザイン (3.2.9) 因子 (3.1.5) と、他の因子に関するバランスの取れた不完全ブロック設計 (3.2.14)

注記 1:ユーデン正方形は、主に関心のある因子のレベルを表すエントリをもつ行列の行と列に関連付けられた 2 つのブロック因子を持つ計画と考えることができます。たとえば、このレイアウトにはレベルと同じ数の列があり、列よりも行の数が少ないと仮定します。各レベルは各行に 1 回出現し、その結果、行ブロック係数に関してランダム化されたブロック設計が得られます。ただし、列ブロック要素に注目すると、バランスのとれた不完全なブロック設計が実現されます。 4 × 4 ラテン方陣の 4 行目を削除すると、3 × 4 の Youden 四角形が得られます。

例 1:

主に関心のある因子には 4 つのレベル ( A 、 B 、 C およびD ) があり、ブロック因子が 2 つあります。1 つは 4 レベル (列)、もう 1 つは 3 レベル (行) です。考えられるレイアウトの 1 つは次のとおりです。

	ブロック係数1 (列)
	1	A	D	C	B
ブロック係数 2 (行)	2	B	A	D	C
3	C	B	A	D
行は含まれていません		D C B A

ブロック因子 2 (列) A 、 B 、 C およびD 、主に対象となる因子の 4 つの水準を示します。

例 2:

次の配列は、4 × 7 の Youden 正方形を示しています。

3	4	5	6	7	1	2
5	6	7	1	2	3	4
6	7	1	2	3	4	5
7	1	2	3	4	5	6

この例では、行がランダム化されたブロック設計を形成し、列がパラメーターl = b = 7, h = k = 4, およびλ = 2 を持つ BIB 設計を形成していることがわかります。

3.2.17

分割プロット設計

設計された実験 (3.1.27) では、主因子に割り当てられた同じ因子水準 (3.1.12) が属する実験単位 (3.1.24) のグループ (「プロット」) が次のように細分化 (「分割」) されます。その因子の各水準内で 1 つ以上の追加の主因子を研究します

例：

因子A の 3 つのレベルを 2 セットの反復でテストします。図 4 に示すように、 A の各レベル内で因子B の同じ 2 つのレベルが研究されます。

図 4 —分割プロット設計

注記 1:この例では、複製 (3.1.36) は主に関心のある第 1 段階の因子 ( A ) に対するブロック (3.1.25) の役割を果たし、 A の 3 つのレベルのいずれかに割り当てられた各プロットはその役割を果たします。 A 内で検討される主な関心事B (プロット因子内) の追加の第 2 段階因子に対するブロックの役割。したがって、プロット内係数B の推定残差 (3.1.6) は、完全な実験 (3.1.1) の残差よりも小さくなるはずです。分割プロット計画では、プロット内の効果とプロット間の効果に対して異なる残差の尺度が取得されます。この設計をさらに拡張して、第 2 段階の因子のレベルに含まれる第 3 段階の因子を導入することが可能です。このタイプの計画は、因子に対して大きな系列または領域がwhere られ、その水準が簡単に変更されず、系列または領域内で他の因子を容易に変更できる場合によく使用されます。

注記 2:このタイプの取り決めは、農業だけでなく工業実験でも一般的です (この名前の由来です)多くの場合、ある一連の因子レベルには大きな実験単位が必要ですが、別の一連の因子レベルはより小さな実験単位で比較できます。たとえば、合金を準備するために使用されるさまざまな種類の炉を比較するには、合金を流し込むさまざまな種類の型を比較する場合よりも、より多くの量の合金が必要になります。炉の種類を第 1 段階の因子のレベル、金型の種類を第 2 段階（プロット内）の因子のレベルとみなします。
また、歯車列を交換するだけで速度を変えることができる大型機械もその例です。これは時間とコストがかかる作業であるため、この要素の変更は頻繁に行わないことが望ましいでしょう。それぞれの速度で製造された材料は、いくつかの技術で熱処理され、さまざまな圧力で成形され、さまざまな研磨剤を使用して平滑化され、これらの要素をあるレベルから別のレベルに移行するのは比較的簡単です。これらの後者はプロット内要因 (または第 2 段階要因) を構成し、速度変動はプロット間要因 (または第 1 段階要因) を構成します。

3.2.18

双方向分割プロット設計

スプリットブロックデザイン

分割プロット設計 (3.2.17) 第 2 段階の因子 (3.1.5) の水準が、各プロット内で独立してランダム化される (3.1.30) のではなく、各複製のプロット全体にわたってストリップに配置されます。したがって、これは 2 つの異なる方法で分割プロット計画と見なされます。

例：

3 × 4 デザインの場合、(ランダム化後の) 適切な配置は図 5 に示すとおりです。

図 5 —双方向の分割プロットレイアウト

注記 1: 二元分割プロット計画では、A とB の主効果 (平均効果) の精度は低くなりますが、交互作用 (微分効果) の精度は高くなります。これらの後者は一般に、ランダム化されたブロックや通常の分割プロット設計よりも正確に決定されます。工業実験では、実際的な考慮事項により、二元分割プロット設計の使用が必要になる場合があります。たとえば、繊維産業では、因子A は過酸化塩素による漂白の異なる手順であり、因子B は漂白剤中の異なる量の過酸化水素である可能性があります。冷却プロセス。

3.2.19

応答曲面設計

最適化される因子 (3.1.5) のサブセットを特定する計画された実験 (3.1.27)

注記 1:応答曲面設計は、応答曲面自体を推定することによって、応答変数と一連の予測変数 (3.1.4) の間の関係を特徴付けます。「応答曲面」は、応答変数の条件付き期待値と考えることができます (予測変数は固定されているが、設計空間全体で変化する可能性があると仮定しています)応答曲面設計の 1 つのタイプは、因子の範囲にわたってほぼ均一に効率的な応答曲面の推定値を与えることを目的とした階乗タイプの実験計画です。応答曲面設計には、応答曲面の曲率 (3.1.21) を調査する実験計画が含まれます。これらの計画では、要因計画 (3.2.1) で使用される回帰方程式の線形形式ではなく、二次回帰方程式を使用することでこれを実現します。応答曲面設計には、中心複合設計 (CCD) やボックスベンケン設計など、さまざまなタイプがあります^[9] 。

注記 2:応答曲面設計の明白な利点は、(連続的であると想定される) 予測変数に対する調整が提案され、応答の「改善」につながることです。

注記 3:応答曲面手法を予期した初期の先駆者は、進化的操作 (EVOP) でした。このプロセスは、最適なパフォーマンスを目指して設定を体系的に変更することで構成されます^[9] 。

例 1:

中心複合設計の例が示されています^[10] 。これは、効率的な設計 [通常は回転可能 (3.1 .40）】。 3 つの予測子変数 (3.1.4) の場合、表 7 に示すセットが中心となる複合計画を構成します。

表 7 —中心複合設計

実験ユニット	_x	_x	_x
1	−1	−1	−1
2	1	−1	−1
3	−1	1	−1
4	1	1	−1
5	−1	−1	1
6	1	−1	1
7	−1	1	1
8位	1	1	1
9	0	0	0
10	0	0	0
11	2	0	0
12	−2	0	0
13	0	2	0
14	0	−2	0
15	0	0	2
16	0	0	−2

実験単位 1 ～ 8 には、 2 ³要因計画 (3.2.5) に相当する計画の立方体点が含まれています。予測変数の水準はコード化された値として与えられます。実験ユニット 9 と 10 がセンターポイント、11 ～ 16 がスターポイントです。最初の 8 つの実験ユニットは、個別に実行できる 2 ³要因計画のユニットを構成します。その後、設計の残りの部分に着手し、その結果を 1 つの分析に組み合わせることができます。実際の治療順序は、全体またはサブグループ内でランダム化する必要があります (3.1.30) 。中央の複合設計により、設計コンポーネントのこの一連の組み立てが容易になります。設計から得られたデータに適合したモデルは、線形 ( x ₁ 、 x ₂ 、 x ₃ )、二次関数 ( x ₁² 、 x ₂² 、 x ₃² )、および双方向交互作用 ( x ₁x _{2 、} x 3 ) で構成できます。 x ₁x ₃ 、 x ₂x ₃ )目的が最適化である場合、推定モデルの最大値または最小値は数値検索によって求めることができます。目的がターゲット値を満たすことである場合、近似関数が評価されて、指定されたターゲットを生成する予測値のセットが特定されます。

注記 4:より少ない因子水準を使用する中心複合計画 (3.1.12) の一般的な変形は、すべての星点に対してα = 1 を設定することによって得られる面心中心複合計画です。因子のレベル数が少ないと、(因子の数に応じて) 回転性が犠牲になる可能性があります。

例 2:

ボックス・ベンケン計画は、バランスの取れた不完全ブロック計画と 2 ^k 要因計画を賢明に組み合わせて構築されます (3.2.14) 。表 8 に示すセットは、3 変数ボックス-ベンケン計画を構成します。

表 8 —ボックスベンケン計画

実験ユニット	_x	_x	_x
1	0	−1	−1
2	0	1	−1
3	0	−1	1
4	0	1	1
5	−1	0	−1
6	1	0	−1
7	−1	0	1
8位	1	0	1
9	−1	−1	0
10	1	−1	0
11	−1	1	0
12	1	1	0
13	0	0	0
14	0	0	0
15	0	0	0

例 3:

五角形計画は 2 要素計画であり、計画点は単位円上の 5 つの等間隔の位置 (コード化された変数の単位を使用) と、場合によっては複製された中心点で構成されます。定義を満たす 5 つの点のセットは、(1, 0)、(0.309, 0.951)、(−0.809, 0.588)、(−0.809, −0.588)、(0.309, −0.951) です。 cos(72°) = 0.309; であることに注意してください。 sin(72°) = 0.951 など

例 4:

六角形計画は 2 要素計画であり、計画点は単位円上の 6 つの等間隔の位置 (コード化された変数の単位を使用) と、場合によっては複製された中心点で構成されます。定義を満たす 6 点のセットは次のとおりです: (1, 0)、(0.5, 0.866)、(−0.5, 0.866)、(−1, 0)、(−0.5, −0.866)、(0.5, −0.866) 。 cos(60°) = 0.5; であることに注意してください。 sin(60°) = 0.866 など

注記 5: 単位円に内接する任意の規則的な幾何学的図形は、応答曲面設計のクラス内の回転可能な設計の基礎として機能できます。

3.2.20

混合設計

混合物を用いた実験計画法

1 つ以上の予測変数 (3.1.4) の合計が固定量になるように制約されている状況を処理するために構築された計画実験 (3.1.27)

注記 1:合金中の金属の割合を表す係数 (3.1.5) は、混合計画の典型的な例です。計画領域 (3.1.9) は、 X ₁+ X₂+ ... + X_k = 1 を満たす必要があります。選択した因子の最小比率など、さらなる制限が適用される場合は、特殊目的の計画も利用できます。混合設計の包括的な取り扱いは参考文献 [8] に記載されています。混合設計は、材料の混合が必要な場合に特に役立ちます。

3.2.21

入れ子になったデザイン

少なくとも 2 つの因子 (3.1.5) と因子間の階層関係を持つ計画された実験 (3.1.27)

注記 1:階層関係は、与えられた各因子レベル (3.1.12) が、順位順でそのすぐ上の別の因子の 1 つのレベルだけと一緒に出現するという意味で、因子間の一種の順位順序を意味します。これらの因子は、ネストされた因子と呼ばれます。言い換えれば、入れ子計画の因子は、下位因子の各レベルがその上位因子のそれぞれの単一レベルに最大で現れるように階層的に順序付けされます。

注記 2:この計画は、関係する因子の分散成分を評価するために使用できます。 3 つの因子A 、 B 、およびC の場合、因子B の各水準は因子A の単一水準のみとともに表示されます。同様に、因子C のすべてのレベルは、因子B の単一レベルのみとともに表示されます。 k 因子ネスト計画ここで, k ≥ 2 は、 k 段階ネスト計画と呼ばれることもあります。

注記 3: 入れ子計画は、要因間に従属的な関係があるプロセスにおける全体的な変動に対する主な要因を特定する機能を提供します。

注記 4: クロス設計 (3.2.1 の注 4 を参照) およびネスト設計は、測定システム分析アプリケーションのコンテキストで一般的に考慮されます。関連する分散成分を推定する際には、因子が入れ子になっているか交差しているかを認識することが重要です。

注5:一般に、入れ子計画は、応答レベルや予測モデルの差ではなく、分散成分(3.1.8) の観点から結果を評価するために使用されます。

例 1:

ネストされたデザインは畜産の状況で発生します。 2 つの農場があり、それぞれに 1 頭以上の雄牛と 2 頭以上の牛の群れがあり、そのうちの 1 頭の雄牛を父親とする子孫を産んでいる状況を考えてみましょう。図 6 のレイアウトは構造を示しています。

図 6 —動物育種における入れ子設計

例 2:

3 つの異なるサプライヤーが 4 つのバッチの原材料を 1 つの会社に提供し、その後そのバッチを分析して純度を決定する状況を考えてみましょう。

図 7 —製造におけるネストされた設計

図 7 に示すように、バッチは各サプライヤー内でネストされています。これは、たとえば、サプライヤー 1 のバッチ 1 は、サプライヤー 2 のバッチ 1 とは異なるためです。バッチの「ラベル」は同じですが、要素バッチとサプライヤーは異なります。交差した。この例は、サプライヤーがそれぞれ異なる数のバッチを提供した場合でも、ネストされたまたは階層的な設計を構成します。図 8 に示すセットアップも、入れ子または階層設計です。

図 8 —別のネストされたデザイン (アンバランス)

ただし、各サプライヤーからのバッチ数が同じであれば、分析はより簡単になります。

3.2.22

バランスの取れた入れ子デザイン

完全にネストされたデザイン

入れ子になった因子 (3.1.5 ) の因子水準 (3.1.12) の数が一定である入れ子になった計画 (3.2.21)

例：

図 9 は、バランスの取れたネストされた設計を示しています。

図 9 —バランスの取れた入れ子設計

この計画は、各研究室が 2 日間 (因子B のレベルの数は 2) を費やし、各研究室で毎日 2 つの測定結果が得られる (因子C のレベルの数は 2) ため、バランスのとれた入れ子になった計画です。）。研究所によって使用される日数は、おそらく特定のテスト期間にわたってランダムに選択されているため、異なる可能性があります。

注記 1: より有意義な情報を取得するために、因子の定義を他の因子と比較できるレベルに調整できる場合があります。 B _{1 は}月曜日に、 B _{2 は}金曜日に割り当てることができます。したがって、月曜日に得られた結果と金曜日に得られた結果を比較できます。したがって、これはすべての研究室に共通するものとなり、無関係な (研究室間で) 2 つの日の割り当てwhere あった以前の状況とは異なります。この構成は、ネストされた分類ではなく交差 (つまり、因子の各レベルが他の因子のすべてのレベルで使用される) を表すため、要因実験として見ることができます。

3.2.23

千鳥状に入れ子になったデザイン

2 番目のネストされた因子 (3.1.5) が最初のネストされた因子の 1 つのレベルに 2 つの因子レベル (3.1.12) を持つが、最初のネストされた因子の他のレベルには 1 つのレベルしかないネストされた計画 (3.2.21)

例：

図 10 は、千鳥状に入れ子になった設計を示しています。

図 10 —千鳥状に入れ子になった設計

注記 1: 交互に入れ子になった計画の場合、すべての因子効果 (3.1.14) は、ほぼ同じ数の自由度で推定されます。

3.2.24

最適な設計

計画された実験 (3.1.27) その因子水準 (3.1.12) 設定は、特定の基準 (通常は計画行列 (3.2.25) の関数) を最適化するように決定されます。

注記 1:特定の基準を最適化する場合、結果として得られる最適な設計は、正しいモデル (3.1.2) が存在することを前提として予測されることに注意してください。推定されたモデルが間違っている場合、最適設計は理論的に (つまり数学的に) 最適であっても、実際の目的には役に立たない可能性があります。それにもかかわらず、この節で前述した実験計画のいくつかは最適な計画と考えることができます。最適設計では、推定モデルのクラス内の指定された基準に対応する最良のパラメータ推定値を取得しようと努めます [例: 予測子変数の二次関数 (3.1.4)

注記 2: ISO 3534 のこの部分の「はじめに」に記載されているように、コストは実験計画において重要な役割を果たします。一定レベルの品質を達成しながらコストを最小限に抑えることが実験の目的となる場合があります。ただし、この条項では、統計的推定に関連する基準に限定されています。想定される形状モデル mml_m31 は、 β の推定を評価する設計を検索する際の開始点です。

3.2.25

デザインマトリックス

個々の実験的処理 (3.1.13) (仮定されたモデルに従って変換される可能性がある) を表す行を含む行列。因子レベルの他の関数 ( 相互作用 (3.1.17) 、二次項など) の推定レベルによって拡張できますが、計画された実験に依存 (3.1.27)

注記 1:与えられた実験計画 (3.1.29) については、想定されるモデルに応じていくつかの設計行列を想定できます。仮定されたモデルに関係なく、割り当て行列は、実験を行うために必要な予測変数 (3.1.4) の設定を表す列のサブセットです。 3.1.27 の注 2 を参照してください。

注記 2:設計またはモデルのマトリックスは、一般にX で表されます。 X の各行は 1 つの治療に対応します。全体の平均項 (たとえば、 µ ) がモデル内にある場合、 X の最初の列はすべて 1 で構成されます。他の列は、因子水準 (3.1.12) または予測子変数の関数を表すことができます。 2 ³要因計画 (3.2.5) の場合、次のように 1 つの可能な計画行列 (8 x 7) が与えられます。

mml_m32

列は順に、 I 、 A 、 B 、 C 、 ab 、 AC 、およびBCに対応します。対応機種は、 mml_m33

where y と ε は 8 × 1 次元のベクトル、 β は 7 × 1 次元のベクトルです。行列X は次元 8×7 で、エントリは与えられたとおりです。転置ベクトル β のエントリは、( μ , β₁ , β₂ , β₃ , β₄ , β₅ , β₆ ) として表すことができます。切片項µ は、エントリとしてすべて 1 を持つ列I に対応します。「 β₁ 」という用語は、因子A に関連付けられた回帰係数です。回帰係数β₁は、因子A に関連する主効果の半分に等しくなります。 β₂を因子B に関連付けたり、 β_{3 を}因子C に関連付けたり、 β₆を相互作用因子BCに関連付けたりするために、同様の記述を行うことができます。

3.2.26

D 最適設計

X ´ X の行列式を最大化する最適設計 (3.2.24) where X は設計行列 (3.2.25))

注記 1:X´Xという表記は、 X とX の転置の乗算を示します。たとえば、X の次元がn×pの場合、 X´Xの次元は次のようになります。 mml_m34 。

注記 2: D 最適計画の基準は、計画行列X に関連付けられた係数の信頼楕円体の体積に関連します。したがって、D 最適設計は、高精度の回帰パラメータ推定値を生成できる設計です。このような設計は多くの統計ソフトウェアパッケージ内で取得できるため、構築の負担が軽減されます。このような設計は、想定されたモデルが正しい限り「最適」ですが、これは設計に対する潜在的な批判を意味します。

例：

使用されている一般的な実験計画の多くはD 最適ですが、この特性は実践者にはすぐに明らかではない場合があります。たとえば、Plackett-Burman スクリーニング計画 (3.2.8) は、主効果モデルに関してD 最適です。

3.2.27

A な設計

X´Xのトレースを最大化する最適設計 (3.2.24) ( where X 設計行列 (3.2.25))

注記 1:A 最適基準は、さまざまな回帰係数の推定量の分散の合計を最小化することと同じです。同様に、最小化は、さまざまな回帰係数の推定量の分散の合計の算術平均です。

3.2.28

G 最適設計

設計領域 (3.1.11 ) にわたる予測の最大分散を最小化する最適設計 (3.2.24)

注記 1:G 最適基準は、設計領域の範囲にわたる回帰関数の最大分散を最小化することです。

注記 2:特定の数学的文脈では、 D 最適性基準とG 最適性基準が同等であることが示されるため、(最適化プロセスを容易にする) G 最適性基準を使用してD 最適設計を得ることができます^[9] 。

3.2.29

直交設計

計画行列 ( 3.2.25 ) X 計画された実験 (3.1.27 )

注記 1:直交計画では、計画行列の列の各ペアは直交します。さらに特定のモデルについては mml_m36 はσ² ( X ' X ) ⁻¹ = σ²I is

3.2.30

飽和したデザイン

計画行列 (3.2.25 ) の列数が実験の実行数と同じである計画実験 (3.1.27)

注記 1:設計内の実験単位よりも多くのパラメータをモデル内で明確に推定することは不可能です。この場合、各パラメータは推定できますが、残差誤差 (3.1.6) の分散を推定するための自由度は残っておらず、仮説検定がさらに不可能になります。

注記 2:過飽和計画 (3.2.8, 注 4 を参照) は、計画内の対象因子の数が実験単位 (3.1.24) where 数を超える状況を調査するメカニズムを提供します (ただし、図に示すように明確ではありません)前のメモ）。

3.2.31

完全にランダムなデザイン

実験単位 (3.1.24) to の実験処理 (3.1.13) の割り当てにおけるランダム化 (3.1.30) の制限のない、計画された実験 (3.1.27)

注記 1:実験単位間の不均一性の可能性に関する事前知識がない場合、完全にランダム化された計画が特に適切である可能性がある (つまり、そのような状況ではブロック (3.1.26) が適切なアプローチとなる)

注記 2:完全にランダム化された計画は、 1 因子実験 (3.1.33) のように他の因子 (変数) を考慮する必要がなく、1 つの主要因子 (3.1.5 ) の効果を研究するために使用されます。

3.3 分析方法

3.3.1

グラフィカルな方法

実験結果の絵による描写 (3.1.1)

注記 1:単純なプロットは、計画された実験の結果に関する初期の効果的な評価を提供できます。この規格のこのセクションに示されている例には、主効果プロット (3.3.2) 、交互作用プロット (3.3.3) 、効果の分位点プロット (3.3.4) 、および残差プロット (3.3.5) が含まれます。

3.3.2

主効果プロット

個々の因子のさまざまな因子 (3.1.5) レベルでの平均応答を示すプロット

例：

表 9 に、この例で使用したデータを示します。データは Box, Hunter, Hunter から引用されています^[2] 。

表 9 —図 3 に関連するデータ

#	触媒	温度	プレッシャー	集中	変換 %
1	−	−	−	−	71
2	+	−	−	−	61
3	−	+	−	−	90
4	+	+	−	−	82
5	−	−	+	−	68
6	+	−	+	−	61
7	−	+	−	−	87
8位	+	+	+	−	80
9	−	−	−	+	61
10	+	−	−	+	50
11	−	+	−	+	89
12	+	+	−	+	83
13	−	−	+	+	59
14	+	−	+	+	51
15	−	+	+	+	85
16	+	+	+	+	78

次の図は、参考文献 [2] の 10.8 から引用した例のプロットを示しています。応答変数は変換率であり、予測変数 (3.1.4) は触媒量 ( A )、温度 ( B )、圧力 ( C )、および濃度 ( D ) です。各予測変数は 2 つのレベルで与えられ、低い場合は「-」、高い場合は「+」で示されます。 2 ^{4 の}完全要因実験 (3.2.5) が実施されました。図 11 から、温度が転化率に最も大きな影響を及ぼし、触媒が 2 番目で、残りの 2 つの要因はかなり同等であることが明らかです。プロット内の接続された線の傾きがゼロから大きく異なるかどうかを評価するには、追加の分析が必要になります。

図 11 —主効果プロット

注記 1:主効果プロットは、各因子のさまざまな水準での応答変数 (3.1.3) の平均値を示します。応答に対する各要因の影響の性質と大きさは明らかです。相互作用 (3.1.17) の存在により、さまざまな要因の影響が隠蔽される可能性があります。

3.3.3

相互作用プロット

別の因子の各水準に対して作成された単一因子 (3.1.5) の主効果プロット (3.3.2)

注記 1: インタラクション (3.1.17) プロットは、インタラクションを解釈するためのグラフィカル検出ツールを提供します。プロットに並列性がないことは、交互作用効果を示しています。 (3.1.17) のプロットを参照してください。

3.3.4

効果の分位点プロット

完全実施要因計画 (3.2.1) or 部分実施要因計画 (3.2.3) における標準正規分位数と推定因子効果 (3.1.14) のプロット

例：

表 10 は、3.3.2 の例に対応する効果の推定値を示しています。

表 10 —推定された要因の影響

A	−8.00	ab	1.00	ABC	−0.75
B	24.00	交流	0.75	ABD	0.50
C	−2.25	広告	0.00	ACD	−0.25
D	−5.50	紀元前	−1.25	BCD	−0.75
		BD	4.50	あいうえお	−0.25
		CD	−0.25

図 12 は、3.3.2 の例から推定された因子効果のプロットを示しています。

図 12 —効果の分位プロット

注記 1: 反復のない実験 (3.1.1) の場合、このプロットは支配的な効果を示唆する可能性があります (つまり、プロットされた点の本体を通る「ガイドライン」線のはるか左またははるか右にある点) 。図 11 では、主効果 (3.1.15) が 24 に等しい右上の点が温度効果に対応します。図 12 では、推定要因効果 (3.1.14) が 24 に等しい右上の点は、表 10 から温度であるB の効果であることがわかります。したがって、これは推定主効果 (3.1.15) の 2 倍です。 ) の温度。

3.3.5

残差プロット

残差 (3.1.7) 対予測子変数の対応する値 (3.1.4) または特定の因子 ( 3.1.5) の因子水準 (3.1.12) のプロット

例：

3.3.2 で示した例は、図 13 に示すモデルとして 4 つの主効果 (3.1.15) とBD 相互作用 (3.1.17) を備えたモデルを使用して続けられます。

図 13 —残差プロット

注記 1:残差プロットは、モデル全体 (3.1.2) の適合度に対する外れ値 (極端な観測値) を特定するのに役立つ場合や、非線形性の指標を提供する場合があります。さらに、可能性のある非定常分散 (すなわち、不均一分散性) を明らかにすることもできます。残差プロット (例: 予測値が増加するための残差の分散の増加)モデルに含まれていない他の利用可能な変数に対する残差の追加プロットは、モデルを改良する必要があることを示す場合があります。

注記 2:残差の分位点プロット (3.3.4) を使用して、残差誤差 (3.1.6) の正規性の仮定からの逸脱を検出できます。

3.3.6

最小二乗法

∑ e ²ここで, e 観測値と仮定されたモデル (3.1.2) から導出された予測値の差であり、その和はすべての実験処理にわたって取得されます (3.1.13)

注記 1:個々の観測に関連する純粋なランダム誤差 (3.1.9) は、通常、独立していると想定されますが、相関誤差を含めるために推論手法を使用することもできます。通常の分散分析 (3.3.8) 、回帰分析 (3.3.7) 、および共分散分析 (3.3.12) はすべて最小二乗法に基づいており、実験結果内の特定のバランスから生じるさまざまな計算上および解釈上の利点を提供します。データの便利なグループ化を可能にする配置。

注記 2:最小二乗法は主に、パラメータが線形である線形モデルまたは線形化モデルに使用されます。

3.3.7

回帰分析

モデルの評価に関連する手順 (3.1.2) 予測変数 (3.1.4) を応答変数 (3.1.3) to 関連付ける

注記 1: 回帰分析は、一般に、目的関数の値を最適化する (たとえば、観察された応答とモデルによって予測された応答の間の差の二乗和を最小化する) ことによって、仮定されたモデルのパラメーターを推定するプロセスと関連付けられます。。統計ソフトウェアパッケージの存在により、パラメーター推定値とその標準誤差の取得が容易になり、豊富なモデル診断が含まれています。回帰分析により、他の対応策の検討も容易になります。たとえば、反復要因計画 (3.2.1 ) で分散効果 (3.1.16) に関心がある場合、 S _i ² ( S _i ²は反復された点の標本分散) の対数を使用した応答がより簡単になる可能性があります。分析され、応答そのものとして解釈されます。

注記 2: 回帰分析は分散分析 (3.3.8) と同様の役割を果たし、因子 (3.1.5) のレベルが連続的で、明示的な予測モデルに重点が置かれている場合に特に関連します。データ間のバランスが必要な分散分析とは異なり、回帰分析は欠損データを含む計画された実験にも使用できます。ただし、バランスが欠けていると、仮説検定の次数依存性 (共通の要素が相関する最初の項に含まれ、後続の項には含まれない) が増加するだけでなく、バランスのとれた実験の他の利点も失われます。バランスの取れた実験の場合、2 つの手法は最小二乗法の単なるバリエーションであり、同等の結果が得られます。

例：

2 ³要因計画 (3.2.5) に 3 つの定量的因子を持ち、単一の反復 (3.1.36) とi 番目の個々の実験単位の仮定モデル (3.1.24) のみを使用した直交計画 (3.2.29) を考えてみましょう。） is

mml_m37

どこ

x _{i 0}	1に等しい。
x _{i 1}	は因子A のレベルです。
x _{i 2}	は因子B のレベルです。
x _{i 3}	は因子C のレベルです。
εi_i	はランダムエラーです。

この定式化では、関連する計画行列 (3.2.25) は [ x _ij ] になります。 where i は行のラベル、 j は列のラベルを表します。このモデルは、-1 および +1 のコード化レベルを持つ 3 つの定性的要因にも適用できます。表 11 では、回帰係数の推定量は応答変数Y _i に関して与えられているため、確率変数です。実験 (3.1.1) が実行されると、応答変数y _i の実現によって式内の確率変数が置き換えられ、一部はb _i を使用して置き換えられます。 mml_m38

適切な表現で。

表 11 —例の回帰分析表

変動の原因	回帰係数の推定器	自由度 (DF)	二乗平均 (MS)
合計	−	8位	−
定数₍X )		1	_Sx0
の回帰 X ₍A )		1	_Sx1
の回帰 X ₍B )		1	_S×2
の回帰 X ₍C )		1	_Sx3
残留物	−	4	S _E 4

表 11 に示す式は、計画行列を使用した行列表記を使用すると、おそらくより簡単になります。一般に、 β_i の推定値は次から取得されます。 mml_m49

。
残差ベクトルは次のように与えられます。 mml_m51

。

注記 3: 2 ^{3 回}の実験が同じブロック内で繰り返された場合、「合計」(1 行目) の自由度は 16 になり、「残差」の自由度は 12 になります。次に、「複製」と「適合の欠如」に関連付けられた 2 つの要素に分割でき、それぞれ自由度は 8 と 4 になります。

表 12 —例 1 の回帰分析表 — 反復実験の補足

変動の原因	二乗和 (SS)	自由度 (DF)	二乗平均 (MS)
残留物	S _E	12	S _E 12
複製		8位	S _R /8
運動不足		4	S _L /4

上記の式に相当する一般的な行列を表 11 に示します。

注記 4:各情報源の統計的有意性は、適切な正規性の仮定の下で、その情報源の平均二乗のF 統計量と適切な残差誤差 (3.1.6) を使用してテストできます。単一の複製状況では、「回帰」項が「残差」項に対してテストされます。 2 つの反復の状況では、「適合不足」項が「反復」[「純粋なランダム誤差 (3.1.9) 」] 項に対してテストされ、モデルが不適切かどうかが判断されます。「反復」という用語は、「残差」という用語に含まれる、モデルの不適切性の潜在的な寄与を除いた実験誤差の尺度を表します [すなわち、仕様ミス (3.1.10)

3.3.8

分散分析

応答変数 (3.1.3) の合計変動を、定義された変動源に関連付けられたコンポーネントに細分化する手法。

注記 1: ANOVA は、分散成分の推定 (3.1.8) とモデルのパラメータに関する仮説の検定 (3.1.2) を容易にします。
分散分析テーブルには通常、次の列が含まれます。

変動の原因。
二乗和 (SS);
自由度 (DF);
平均二乗 (MS) (二乗和を自由度で割った値)
F (誤差に関連する行の平均二乗と平均二乗の比);
期待平均二乗 (モデルのパラメーターに関して与えられる二乗和の数学的期待)

表の行は、特定の因子の効果 (3.1.14) or 交互作用 (3.1.17) 、ブロック (3.1.25) [ 実験計画 (3.1.28) でブロック (3.1.26) が使用された場合], または残差誤差 (3.1.6) (モデルまたはブロックによって考慮されない残りの効果)通常、「合計」と指定された行が表示され、観測値の総数から 1 つ少ない自由度に基づいて、全体の平均に関する平方和の合計が表示されます。

例：

ランダム化されたブロック設計 (3.2.10) を考えてみましょう。ここでは、 h 個のブロックのj 番目の因子A のl 個のレベルのi 番目から得られた観測値がY _ij = ( i = 1, 2, ... で表されます) で表されます。 , l ; j = 1, 2, ..., h )主な関心の因子A 固定因子効果 (3.1.14) を表します。因子B 、ブロッキングに関連する因子効果を表します (3.1.26) 。次の ANOVA 表 (表 13) が計算されます。

表 13 —分散分析表

ソース	二乗平均 (MS)	F	期待平均二乗
合計	−	−	−
ファクターA （処理）
ファクターB （ブロック）
エラー		−	^σ2

ANOVA 表では次のようになります。

mml_m69

mml_m70

F ( ν₁ , ν₂ ) はF 統計量です。
観測に関連する 1 つのモデルは次のように与えられます。
mml_m71

; i = 1, 2, …、 l ; j = 1, 2, …、 h
と

mml_m72

mml_m73

どこ

μ	は一般平均です。
αi_i	i 番目の治療の効果です。
β_j	j 番目のブロックの効果です。
e _ij	は残留誤差です。

この例では、固定レベルが指定されているものとします。 μ 、 α_i 、 β_j およびσ²の最小二乗推定値は次のように取得されます。

mml_m74

mml_m75

mml_m76

mml_m77

ランダム化ブロック設計では各セルに同数の観測値が含まれている必要があるため、この例に示されている式は簡略化されています。上記の式に相当する一般的な行列は、(3.3.7) の表 12 に示されています。

注記 2: ANOVA の基本的な仮定は、変動のすべての原因による効果が相加的であり、純粋なランダム誤差 (3.1.9) が独立して正規分布し、平均がゼロで分散が等しい (等分散性) ということです。）。したがって、ここで挙げた例の場合のように、応答変数の期待値はパラメーター内で加算されると想定されます。この技術は、 F比と組み合わせて、これらの変動要因の影響の有意性検定を提供したり、これらの要因に起因する分散の推定値を取得したりするために使用されます。正規分布の仮定は、この有意性と信頼区間の検定にのみ必要です。平均と交互作用は通常、2 元 (またはk 元) 表に要約することによって検査されます。この例では、モデル 1 または固定効果モデル (3.3.9) を想定しています。誤差の正規分布を仮定できない場合は、応答変数の変換 (対数など) を使用したり、ノンパラメトリック手順を適用したりできる場合があります。

3.3.9

固定効果分散分析

各因子（3.1.5 ）の因子レベル（3.1.12）が因子の値の範囲にわたって事前に選択される分散分析（3.3.8）

注記 1:固定水準では、分散成分 (3.1.8) を計算するのは不適切です。このモデル (3.1.2) は、モデル 1 分散分析と呼ばれることもあります。

3.3.10

変量効果分散分析

各因子効果（3.1.14 ）の各因子レベル（3.1.12）が各因子のレベルの分布からサンプリングされると仮定される分散分析（3.3.8）

注記 1: ランダム水準の場合、主な関心は通常、分散成分 (3.1.8) の推定値を取得することです。このモデルは一般にモデル 2 分散分析と呼ばれます。選択した因子レベルの効果の推定値を計算することは不適切です。

例：

操作で原材料のバッチを処理する状況を考えてみましょう。すべてのバッチの母集団からいくつかのバッチがランダムに選択される場合、「バッチ」は実験におけるランダム要素とみなされる場合があります。

3.3.11

混合モデル分散分析

分散分析 (3.3.8) では、いくつかの因子 (3.1.5) のレベルが固定され、他の因子効果 (3.1.14) のレベルが因子のレベルの分布からサンプリングされます。

注記 1: 分散成分は、ランダム水準因子および固定効果因子との交互作用についてのみ意味を持ちます。さらに、効果の推定は固定要因にのみ適用されます。このモデルは、モデル 3 分散分析とも呼ばれます。

3.3.12

共分散分析

アンコバ

1 つ以上の観察可能だが制御不可能な要因 (3.1.5) が応答変数 (3.1.3) に影響を与える場合に、実験的治療 (3.1.13 ) の効果を推定およびテストするための手法

注記 1: 共分散分析は、回帰分析 (3.3.7) と分散分析 (3.3.8) の組み合わせとして見ることができます。

注記 2:通常、制御不能な要因は実験計画 (3.1.28) では説明できず、結果に対するそれらの影響は分析で説明されるべきである。たとえば、実験ユニット (3.1.24) は、各ユニットに存在する化学成分の量が異なる場合があり、これは測定できますが、容易に調整することはできません。

注記 3:これは、各実験単位から複数の変数について観測値が取得されるwhere をカバーするための分散分析手法の拡張です。

参考文献

1	ISO 10241-1:2011, 国際用語標準 - Part 1: 準備とレイアウト
2	Box 、GEP, H under 、WG および H under 、JS Statistics for Experimenter設計、データ分析、モデル構築の概要。ジョン・ワイリー&サンズ、ニューヨーク、1978年
3	Box 、GEP, H under 、WG および H under 、JS Statistics for Experimenterデザイン、イノベーション、発見。ジョン・ワイリー&サンズ、ニューヨーク、2005年
4	Plackett 、RL, Burman 、JP「最適な多因子実験の設計」、 Biometrika 、 33, 1946 、pp. 305-325
5	L in 、DKJ 新しいクラスの過飽和設計。テクノメトリクス、 35, 1993 、pp. 28-31
6	W u 、CFJ 部分的なエイリアス相互作用による過飽和設計の構築。バイオメトリカ、 8, 1993, pp. 661-669
7	Fisher 、RA, Yates 、F.生物学、農業、医学研究のための統計表。オリバーとボイド、エディンバラ、第 4 版、1953 年
8	JAコーネルは混合物を実験。 ^nd 、ジョン・ワイリー、ニューヨーク、1990 年
9	Box 、GEP およびDraper 、NR経験モデル — 構築および応答曲面。ジョン・ワイリー&サンズ、ニューヨーク、1987年
10	Box 、GEP およびWilson 、KB 最適条件の実験的達成について (議論あり)王立統計協会ジャーナル、シリーズ B, 13, 1951, 1-45 ページ

3 Terms and definitions

For the purposes of this document, the following terms and definitions apply.

3.1 General terms

3.1.1

experiment

purposive investigation of a system through selective adjustment of controllable conditions and allocation of resources

Note 1 to entry: A system is an interacting combination of elements, viewed in relation to function. Deliberate alterations or adjustments are made to a system in order to improve or to understand it. In other words, an experiment is a systematic and objective means of getting unambiguous and valid answers to the questions that the experimenter has in mind by varying controllable factors in a predetermined manner.

Note 2 to entry: A critical aspect to an experiment is control — the investigator has the capability to vary settings, input materials, assignment of procedures to individuals and so forth with the intention of obtaining an understanding of the system efficiently. By proper design and conduct of the experiment, it is possible to attribute causation to the impact of the settings.

Note 3 to entry: Experiments are different from observational studies where the investigators may determine which units are to be studied and the observational process to be observed, but the assignment of experimental treatments (3.1.13) is outside their control.

3.1.2

model

formalized representation of outcomes of an experiment (3.1.1)

Note 1 to entry: The model consists of three parts. The first part is the response variable (3.1.3) that is being modelled. The second part is the deterministic or the systematic part of the model that includes predictor variable(s) (3.1.4) . Finally, the third part is the residual error (3.1.6) that can involve pure random error (3.1.9) and misspecification error (3.1.10) . The model applies for the experiment as a whole and for separate outcomes denoted with subscripts. The model is a mathematical description that relates the response variable to predictor variables and includes associated assumptions. Outcomes refer to recorded or measured observations of the response variable.

Note 2 to entry: The model is a simplified representation of the actual system where only key or fundamental features are considered.

EXAMPLE 1:

The lifetime of a component is related to the environmental conditions that it experiences.

EXAMPLE 2:

A formal model including two factors (3.1.5) is:

mml_m1

where

y_ij	is the response variable at level i of factor A and level j of factor B;
µ	is the overall mean response;
α_i	is the incremental effect of factor A at level i;
β_j	is the incremental effect of factor B at level j;
ε_ij	is the residual error.

The response part of the model consists simply of y_ij . The predictive part of this model is µ + α_i+ β_j consisting of an overall mean response and two terms related to the effects of factors. The random or error part of this model consists of ε_ij that includes inherent variability in the process which produces the response.

EXAMPLE 3:

A commonly used model is:

mml_m2

where

y_ijk	is the response of the kth replicate;
α_i	is the adjustment due to factor 1;
β_j	is the adjustment due to factor 2;
τ_ij	is the adjustment due to interaction of the factors;
ε_ijk	is the residual error.

The terminology “adjustment” is used instead of “incremental effect” as in Example 2, as the formal mathematical model does not include an overall mean term. Furthermore, y_ijk (ε_ijk ) is used in this example rather than y_ij (ε_ij ) to acknowledge the potential existence of replicates.

EXAMPLE 4:

Another formal model is:

mml_m3

where

y_i	is the response corresponding to x_i ;
x_i	is the coded or numerical level of a single factor;
_eβ₀ + β₁x_i + β₂x_i²	represents the mean response corresponding to x_i ;
ε_i	is the residual error.

EXAMPLE 5:

The following model applies for a 2⁴factorial design (3.2.5) :

mml_m4

where the factor x_ji is the coded or numerical level of factor j (j = 1, 2, 3, 4) and observation i (i = 1, 2, …, n).
This model includes an intercept term (µ), four main effect terms (x_1i , x_2i , x_3i , x_4i ), six two-way interaction terms (x_1ix_2i , x_1ix_3i , x_1ix_4i , x_2ix_3i , x_2ix_4i , and x_3ix_4i ), four three-way interaction terms (x_1ix_2ix_3i , x_1ix_2ix_4i , x_1ix_3ix_4i , x_2ix_3ix_4i ), one four-way interaction term (x_1ix_2ix_3ix_4i ) and an residual error (ε_i ). Although the factors in the model can be multiplicative to represent interactions, the model itself is linear in the parameters.

Note 3 to entry: The above description of a model not only applies to the classical linear models with additive error but also to generalized linear models ここで, the error can be described by a variety of distributions including the binomial, Poisson, exponential, gamma and normal distributions. Linearity occurs in Example 4 with a logarithmic transformation applied to the deterministic part of the function. Although the examples given in this terminological entry are linear in the parameters, this is not intended to suggest that such a case will apply in all experimental design situations.

3.1.3

response variable

output variable

variable representing the outcome of an experiment (3.1.1)

Note 1 to entry: The term “dependent variable” is not recommended as a synonym due to potential confusion with “independence” (see ISO 3534-1:2006, 2.4).

Note 2 to entry: It may be that the response variable is vector-valued because several responses are recorded from each experimental unit (3.1.24) .

Note 3 to entry: The response variable is likely influenced by one or more predictor variables (3.1.4) , the nature of which can be useful in controlling or optimizing the response variable.

3.1.4

predictor variable

variable that can contribute to the explanation of the outcome of an experiment (3.1.1)

Note 1 to entry: A predictor variable can be used to model the impact of a categorical factor, e.g. at two levels. For multiple levels of a factor, two or more predictor variables can be devised to represent the distinct categorical levels.

Note 2 to entry: A predictor variable can include a random element in it or it can, for example, be from a set of qualitative classes which can be observed or assigned without random error.

Note 3 to entry: The term “predictor variable” is typically used in the development of a mathematical relationship among the response variable (3.1.3) and the available predictor variable(s) or functions of predictor variables. The term “factor” tends to be used operationally as a means to assess the response variable as particular factors vary.

Note 4 to entry: “Independent variable” is not recommended as a synonym due to potential confusion with “independence” (see ISO 3534-1:2006, 2.4). Other terms sometimes substituted for predictor variable include “input variable”, “descriptor variable” and “explanatory variable”.

3.1.5

factor

feature under examination as a potential cause of variation

Note 1 to entry: The extent to which a given factor can be controlled dictates its potential role in a designed experiment. Factors can be controllable (fixed), modifiable (controllable only for short duration or at considerable expense) or uncontrollable (random).

Note 2 to entry: A factor could be associated with the creation of blocks (3.1.25) .

3.1.6

residual error

error term

random variable representing the difference between the response variable (3.1.3) and its prediction based on an assumed model (3.1.2)

Note 1 to entry: The predicted value of the response variable is based upon an assumed model, the parameters of which are estimated from the data. The residual error is that part of the response variable that is unexplained by those predictor variables, which have been included in the model, and may be due to both systematic and chance causes.

Note 2 to entry: For the purpose of this definition, the term “predicted response value” is understood to be the estimated response for that experimental treatment (3.1.13) determined from the empirical model derived from the data of the experiment (3.1.1) using the assumed model.

Note 3 to entry: Residual error includes pure random error (3.1.9) and misspecification error (3.1.10) . The expectation of the residual error is assumed to be zero.

Note 4 to entry: The variance of the residual error is usually estimated in an experiment by subtracting the pooled sum of squares for terms included in the assumed model from the total sum of squares and dividing by the corresponding difference in degrees of freedom (3.1.32) .

Note 5 to entry: The term “residual error” is used in practice in two different ways. For this part of ISO 3534, the term is used as a random variable associated with the difference between the response variable which is a random variable and the prediction of the response variable which is based on an assumed model.

Note 6 to entry: In cases in which the residual error is estimated from data, the terms sample residual error or empirical residual error are used.

EXAMPLE:

Consider a simple model mml_m8 is the residual error given the predictor variable x.

3.1.7

residual

observed value of the residual error (3.1.6)

EXAMPLE 1:

mml_m9 is a residual corresponding to the model in Example 2 from 3.1.2.

EXAMPLE 2:

mml_m10 is a residual corresponding to the model in Example 3 from 3.1.2.

EXAMPLE 3:

mml_m11 is a residual corresponding to the model in Example 4 from 3.1.2.

EXAMPLE 4:

3.1.8

variance component

part of the total variance of a response variable (3.1.3)

Note 1 to entry: A variance component can be either an individual variance component that is part of the overall variability of the response variable or it could be due to a random variable when modelling the response variable as a sum of independent error terms.

Note 2 to entry: Other models can be envisaged that include nested (see 3.2.21) or crossed factors (see 3.2.1).

Note 3 to entry: In the simplest case, it is conceivable to imagine a model in which the variance of the residual error is the sole variance component (e.g. an experiment where no factors are varied and the experiment consists of repeated measurements on a single unit).

EXAMPLE:

In the model (3.1.2) , mml_m16 are the variance components of y_ij .

3.1.9

pure random error

pure error

part of the residual error (3.1.6) associated with replicated observations

Note 1 to entry: It is a common characteristic of experiments (3.1.1) that, when repeated, results vary from trial to trial, although the experimental materials, environmental conditions and the experimental operations have been carefully controlled. Thus, pure random error is a common occurrence in spite of the best efforts of the experimenter. This variation introduces a degree of uncertainty into conclusions drawn from the results, and consequently, should be considered when reaching decisions.

Note 2 to entry: If only the centre point (3.1.39) in an experimental design (3.1.28) were replicated, then the sample variance of responses at the centre point provides an estimate of the variance of the pure error. If replicates (3.1.36) took place at multiple treatment combinations, then an overall estimate of the variance of the pure error can be achieved by pooling the estimates at these experimental treatments (3.1.13) .

EXAMPLE:

Returning to Example 3 in 3.1.2, an estimate of the variance of pure error for fixed (i, j) is mml_m19

where i = 1,..., I; j = 1,..., J; k = 1,..., n_ij ; mml_m20

Note 3 to entry: The term “pure error” is used in practice in two different ways. For this part of ISO 3534, the term is used as a random variable and refers to a population variance (σ²) in association with a mathematical model. From the mathematical perspective, the pure error can be construed as ε_ij in Example 2, as ε_ijk in Example 3, and ε_i in Example 4 all from 3.1.2.

Note 4 to entry: In cases in which the pure error can be estimated from data (i.e. there are replicates), pure error actually refers to the “sample” or “empirical” pure error which in conjunction with the estimated residual error (3.1.6) provides the basis for a lack of fit test of the model. If the estimated residual error based on a model is reasonably close to the estimated pure error, then the model does not exhibit substantial lack of fit. Since the residual error includes all sources of variation for the difference between response variable and the predictive model, the residual error includes the contribution of pure error. Of the examples illustrating models in 3.1.2, only Example 3 having replicates would facilitate direct estimation of the pure error.

3.1.10

misspecification error

part of the residual error (3.1.6) not accounted for by pure random error (3.1.9)

Note 1 to entry: Misspecification error can be attributed to predictor variables (3.1.4) or a function of the predictor variables that are erroneously omitted from the model of the response variable (3.1.3) .

Note 2 to entry: There could be non-attributable factors including fixed or random factors that may not have been incorporated in the model. This occurs, for example, if the true model is quadratic but the fitted model is linear. As being unknown to the experimenter, these factors are effectively included in the variation from one trial to the next. Inherent factors that actually impact the response variable but were omitted in the model could lead to systematic errors in the experimental results. It may be possible to mitigate this problem through careful selection of the model and by randomization (3.1.30) .

Note 3 to entry: Of related interest to residual error (3.1.6) , pure random error (3.1.9) and misspecification error are the terms repeatability standard deviation (ISO 3534-2:2006, 3.3.7) and reproducibility standard deviation (ISO 3534-2:2006, 3.3.12) which apply in the experimental design context directly if the actual design of the experiment is in accordance with repeatability conditions (ISO 3534-2:2006, 3.3.6) or reproducibility conditions (ISO 3534-2:2006, 3.3.11), respectively.

EXAMPLE:

Returning to Example 2 of 3.1.2, which consists of a two-factor experiment involving factors A and B, experimental error will be manifested in the run-to-run variability at replicated treatment combinations and through the overall variability that could be impacted by systematic trends over time as the experiment is conducted.

3.1.11

design region

design space

set of allowable values for the predictor variables (3.1.4)

Note 1 to entry: In some situations, the design region is determined by the range of the individual predictor variables and is rectangular (or hyper-rectangular in higher dimensions). However, if the range of one predictor variable could influence the reasonable values of another predictor variable, then the region need not be rectangular. For example, in the simple case of an experiment (3.1.1) in baking a cake, the duration in the oven logically depends on the temperature setting.

3.1.12

factor level

setting, value or assignment of a factor (3.1.5)

Note 1 to entry: The factor levels can be represented through the values of the predictor variables (3.1.4) in the model.

Note 2 to entry: Responses observed at the various levels of a factor provide information for determining the effect of the factor within the range of levels of the experiment (3.1.1) . Extrapolation beyond the range of these levels is usually inappropriate without a firm basis for assuming model (3.1.2) relationships. Interpolation within the range may depend on the number of levels and the spacing of these levels. It is usually reasonable to interpolate, although it is possible to have discontinuous or multi-modal relationships that cause abrupt changes within the range of the experiment. The levels may be limited to certain selected fixed values (whether these values are or are not known) or they may represent purely random selection over the range to be studied.

EXAMPLE:

The nominal levels of a catalyst may be presence and absence. The nominal-scale variable for a laboratory can have levels A, B and C, corresponding to three facilities. Four levels of a heat treatment may be 100 °C, 120 °C, 140 °C and 160 °C.

3.1.13

run

experimental treatment

specific settings of every factor (3.1.5) used on a particular experimental unit (3.1.24)

Note 1 to entry: Ultimately, the impact of the factors will be captured through their representation in the predictor variables (3.1.4) and the extent to which the model matches the outcome of the experiment (3.1.1) .

EXAMPLE:

Consider a chemical process experiment (3.1.1) in which a high yield is the objective and the predictor variables are temperature, duration, and concentration of a catalyst. A run could be a setting of temperature of 350 °C, thirty minutes duration and 10 % concentration of the catalyst, assuming that all of these settings are permissible.

3.1.14

factor effect

factor (3.1.5) that influences the response variable (3.1.3)

Note 1 to entry: Factor effects can include main effects (3.1.15) , dispersion effects (3.1.16) and confounded effects (3.1.18) .

3.1.15

main effect

factor effect (3.1.14) applicable in the context of linearly structured models with respect to expectation

Note 1 to entry: Linear structured models include additive, linear models which in turn include the class of models related to factorial experiments (3.2.1) and fractional factorial experiments (3.2.3) . Main effects are mostly readily understood in a model with zero interactions.

Note 2 to entry: The main effect can be estimated by averaging the response variable over all other runs provided the experiment is fully balanced.

Note 3 to entry: For a factor with two levels, the main effect relates to the change in the response between levels. If the levels are designated –1 (for low) and +1 (for high), then the main effect of the factor is estimated as the average response when the factor level is +1, minus the average response when the factor level is –1. Consider the model:

mml_m21

where y, µ and ε are as in the Example to 3.1.6, X is +1 or –1 as just described, and β represents the adjustment for the factor X. Note that an estimate of β is equal to one half the main effect for the factor X. If β were equal to zero, then X does not affect the mean of the response variable (it is the same regardless of the level of X being +1 or –1) so that the main effect of X is zero.

3.1.16

dispersion effect

factor effect (3.1.14) in the context of linearly structured models with respect to variation

Note 1 to entry: It is important to recognize that a dispersion effect (3.1.16) could be significant, whereas the main effect corresponding to the same factor could have little impact. This situation affords the opportunity to achieve low variability or consistency in the responses by using a factor that does not necessarily impact the overall level of the response.

3.1.17

interaction

influence of one factor (3.1.5) on one or more other factors’ impact on the response variable (3.1.3)

Note 1 to entry: An interaction is present if the apparent influence of one factor on the response variable (3.1.1) depends upon one or more other factors. In such a situation, these two (or more) factors are said to interact. Interactions can be incorporated into the model (3.1.2) by defining a new predictor variable that is a function of two or more factors. An interaction reflects the dependence of the level of one factor on the level(s) of another or other factors by providing the differential comparison of the responses for each level of the factor on each of the levels of the other factor(s).

Note 2 to entry: Interaction indicates an inconsistency of the main effect (3.1.15) of a factor on the response variable depending on the level of another factor. Figure 1 indicates these phenomena ranging from very strong interaction, to limited interaction, to no interaction. The presence of an interaction ought to be assessed in relation its estimated uncertainty via an appropriate test of significance.

Note 3 to entry: Interactions are considered initially involving only two factors and are referred to as either two-way interactions or first order interactions. Of course, it is possible that three factors, for example A, B, and C, interact in the sense that the first order interaction of ab depends on the level of factor C. In this case, there is a second order interaction. Similarly, third, fourth, and higher order interactions can be conceived. First order interactions are relatively easy to explain graphically and in words compared to higher-order interactions.

Note 4 to entry: Example 3 in 3.1.2 provides a formal model representation of an experiment (3.1.1) with two factors and the two-way or first order interaction τ_ij between them.

Figure 1 — Interaction plots

Key

Y	Response
X1	No interaction
X2	Limited interaction
X3	Very strong interaction

3.1.18

confounded effect

factor effect (3.1.14) that is indistinguishable from another factor effect

Note 1 to entry: A confounded effect is sometimes created at the design stage in order to accommodate blocks (3.1.25) or to introduce another factor without increasing the number of experimental units (3.1.24) under consideration. A confounded effect could be a high order interaction (3.1.17) .

EXAMPLE:

Consider a 2³full factorial design (3.2.5) with three factors A, B and C. A fourth factor D can be introduced by setting its factor level (3.1.12) equal to the product of the levels of A, B, and C (assuming the levels of each factor are coded as −1 or +1). Factor D could be used as a block to conduct the experiment over a two day period. Values of D determine the day that the corresponding four runs will be conducted (D = −1 implies Monday; D = +1 implies Tuesday); see Table 1. In this case, the factor D is confounded with the three-way interaction ABC.

Table 1 — Experimental plan for blocking on day with 8 trials

A	B	C	A×B×C	D	Day of Trial
–1	–1	–1	–1	–1	Monday
+1	–1	–1	+1	+1	Tuesday
–1	+1	–1	+1	+1	Tuesday
+1	+1	–1	–1	–1	Monday
–1	–1	+1	+1	+1	Tuesday
+1	–1	+1	–1	–1	Monday
–1	+1	+1	–1	–1	Monday
+1	+1	+1	+1	+1	Tuesday

3.1.19

confounding

equating two or more factor effects (3.1.14) so as to be indistinguishable from each other

Note 1 to entry: At the design stage, confounding is an important technique which permits, for example, the effective use of specified blocks (3.1.25) in some designed experiments (3.1.27) . This is accomplished by deliberately pre-selecting certain factor effects [typically high-order interactions (3.1.17) ] as being of little interest, and arranging the design so that it confounds them with block effects, while keeping the other more important main effects (3.1.15) and key interactions free from such complications. Confounding may be deliberately used to diminish the number of trials of the experimental plan (3.1.29) . Sometimes, however, confounding results from inadvertent changes to a design during the running of an experiment (3.1.1) or from incomplete planning of the design, and it serves to diminish, or even to invalidate, the effectiveness of an experiment.

Note 2 to entry: At the analysis stage, confounding is a device that sacrifices information about contrasts (3.1.22) of insignificant effects (typically corresponding to high-order interactions) in order to improve the precision with which more relevant contrasts can be estimated.

EXAMPLE:

Continuing with the example of 3.1.18, factor D (equal to ABC) could alternatively be used to determine a new factor which could be investigated along with factors A, B and C using a total of eight experimental units (3.1.24) .

3.1.20

alias

factor effect (3.1.14) that is equal to another factor effect or a function of other factor effects

Note 1 to entry: A main effect (3.1.15) that is deliberately confounded in an experiment (3.1.1) with another factor effect (such as a high order interaction (3.1.17) is an alias of that factor effect; the effects are aliases of each other.

EXAMPLE:

Consider the 2³experiment (3.2.5) involving three factors (3.1.5) A, B and C and set D=AB and E=BCwhere all factors have levels of −1 or +1. With five factors, there are 5 main effects, A, B, C, D and E; there are 10 first-order interactions — ab, AC, …, de; there are 10 second-order interactions — ABC, ABD, …, CDE; there are 5 third-order interactions — ABCD, ABCE, …, BCDE; there is one fourth-order interaction — ABCDE. The levels of the factors indicate the settings of these factors in the experiment. In particular, the levels of the factors A, B, C, D and E are required to conduct the experiment. Table 2 provides the levels of these factors as well as the computed levels corresponding to some selected interactions.

Table 2 — Deliberately confounded factor effects

A	B	C	D	E	I=ABD=BCE=ACDE	AC=BCD=ABE=DE	AE=CD=ABC=BDE
−1	−1	−1	1	1	1	1	-1
1	−1	−1	−1	1	1	−1	1
−1	1	−1	−1	−1	1	1	1
1	1	−1	1	−1	1	−1	-1
−1	−1	1	1	−1	1	−1	1
1	−1	1	−1	−1	1	1	-1
−1	1	1	−1	1	1	−1	-1
1	1	1	1	1	1	1	1

The first five columns have aliases, as follows: A=BD=ABCE=CDE, B=AD=ce=ABCDE, C=ABCD=be=ADE, D=ab=BCDE=ACE, and E=ABDE=BC=ACD.
The first five columns of Table 2 are all that are needed to conduct the experiment. At the analysis stage, the entries are used to estimate the effects. For example, for factor A, the main effect of A is the average of the y_i ’s at the level of +1 for A (2^nd, 4^th, 6^th and 8^th) minus the average of the y_i ’s at the level of −1 for A (1^st, 3^rd, 5^th and 7^th). Columns in Table 2 with the same entries will produce the same value of an estimated effect. On inspection, columns D and ab are identical as well as columns E and BC. Columns D and E were deliberately constructed in this fashion. Columns B, AD, ce and ABCDE are seen to have identical entries and would produce numerically equal effect estimates. Likewise, columns AC, BCD, ABE and de are identical and consequently, these factors are aliases of each other. The entries in Table 2 could be augmented to include all other possible interactions. Upon inspection, it could be seen that aliases occur in groups of 4, as follows: (I, ABD, BCE, ACDE), (A, BD, ABCE, CDE), (B, AD, ce, ABCDE), (ab, D, ACE, BCDE), (C, ABCD, be, ADE), (AC, BCD, ABE, de), (BC, ACD, E, ABDE), and (ABC, CD, ae, BDE).
With five factors, there are a total of 32 effects to be estimated, including the overall mean term (i.e. intercept term) denoted by the capital letter “I”. Each possible effect is aliased with three other effects.

3.1.21

curvature

departure from a straight line relationship between the response variable (3.1.3) and a predictor variable (3.1.4)

Note 1 to entry: Curvature has meaning with quantitative predictor variables, but not with categorical (nominal) or non-quantitative (ordinal) predictor variables. Detection of curvature requires more than two levels of the factors. In some instances, replicated centre points (the factor set midway between the high and low settings of the factors) can allow the detection and assessment of curvature. Alternatively, an expanded range of the factor levels (3.1.12) can be necessary to observe curvature.

Note 2 to entry: Parabolic curvature can be readily modeled via a form such as:

mml_m22

If γ deviates from zero, there is evidence of parabolic curvature relative to the simple linear relation. More complex models are needed to express more complicated types of curvature.

3.1.22

contrast

linear function of the values of the response variables (3.1.3) for which the sum of the coefficients is zero when not all coefficients equal to zero

Note 1 to entry: With observations y₁, y₂,…, y_n , the linear function mml_m24 with not all a_i ’s equal to zero. The role of contrasts in the design of experiments is to compare and to investigate effects as given in the subsequent examples.

EXAMPLE 1:

A factor is applied at three levels and the results are represented by y_1,y₂ and y₃. Among the many possible questions to be asked, consider the difference in responses at the first and third level of the experiment (ignoring temporarily the intermediate level). The appropriate contrast for assessing this query is –y₁ and +y₃ (i.e. y_{3 −}y₁). If the levels are equally spaced, a second question may be asked as to whether there is evidence that the response pattern shows a quadratic curvature (3.1.21) rather than a simple linear trend. Here the average of y₁ and y₃ can be compared to y₂ (i.e. y₁ − 2 y₂ + y₃). (If there were no curvature, y₂ should fall close to the line connecting y₁ and y₃.) This example illustrates a regression type study of continuous variables. It is frequently more convenient to use integers rather than fractions for contrast coefficients. In such a case, the coefficients for Contrast 2 would appear as (−1, +2, −1).

EXAMPLE 2:

Another example dealing with discrete levels of a factor may lead to a different pair of questions. Let us suppose there are three sources of supply, one of which, A₁, uses a new manufacturing technique while the other two, A₂ and A₃, use the customary one.

Question 1: Does vendor A₁ with the new technique seem to differ from A₂ and A₃ using the old one? Contrast y₁ with the average of y₂ and y₃ (i.e. 2 y₁ – y₂ − y₃).
Question 2: Do the two suppliers using the customary technique differ? Contrast y₁ with y₃ (i.e. y₂ – y₃).The pattern of contrast coefficients is similar to that for the previous problem, though the interpretation of the results will differ.

3.1.23

orthogonal contrast

set of contrasts (3.1.22) whose coefficients satisfy the condition that, if multiplied in corresponding pairs, the sum of the products equals zero

Note 1 to entry: The purpose of orthogonal contrasts is to facilitate independent tests of hypotheses of interest in the experimental situation. In Example 1, the two tests can be conducted independently (one test has no bearing on the other and provided further that the residual error follows a normal distribution). In Example 2, on the other hand, the tests are dependent, indicating for instance, that a rejection in one test could suggest a more likely rejection of the other.

EXAMPLE 1:

The following two orthogonal contrasts are given:

	y₁	y₂	y₃
a_i₁ Contrast 1	–1	0	+1
a_i₂ Contrast 2	–1	2	–1
a_i1a_i2	+1	0	–1
Σ a_i₁a_i₂ = 0, therefore orthogonal

EXAMPLE 2:

The following two contrasts are not orthogonal:

	y₁	y₂	y₃
a_i₁ Contrast 1	–1	0	+1
a_i₂ Contrast 2	0	–1	+1
a_i₁a_i2	0	0	+1
Σ a_i₁a_i₂ = +1, therefore not orthogonal

3.1.24

experimental unit

basic unit of the experimental material

Note 1 to entry: The experimental unit could be the entire manufacturing process in which case a run corresponds to the set up of the process and the response variable (3.1.3) (possibly multivariate) corresponds to characteristics of the product. In another situation, the experimental unit could be individual subjects in a study (laboratory animals or human participants) each of which receives a particular experimental treatment.

Note 2 to entry: In agricultural settings, the basic unit of the experimental material is a plot of land. Although the agricultural setting suggests plot as a natural description of a experimental unit, the concept is more general and can be applied to other contexts. Plot is then replaced by the generic term experimental unit.

3.1.25

block

collection of experimental units (3.1.24)

Note 1 to entry: To be effective, the blocks should represent sets of experimental units that are homogeneous in some sense. The term “block” originated in agricultural experiments (3.1.1) in which a field was subdivided into sections having common conditions, such as exposure to the wind, proximity to underground water or thickness of the arable layer. In other situations, blocks are based on batches of raw material, operators, the number of units studied in a day, and so forth. More generally, blocks can consist of regions of a country, groups of factories, time frames (e.g. shifts in a manufacturing facility) and so forth.

Note 2 to entry: Generally, recognition of the existence of blocks may affect how the experimental treatments (3.1.13) of interest are assigned to experimental units. Operationally, additional “artificial” treatments are created within the model (3.1.2) that designate the assignment to blocks. The treatments thus consist of the factor effects (3.1.14) of direct interest in the study and the treatments that relate to blocking assignments. The idea is to enhance the possibility of recognizing important factor effects which would otherwise be obscured had the blocks not been established.

3.1.26

blocking

arrangement of experimental units into blocks (3.1.25)

Note 1 to entry: Within each block, the residual error (3.1.6) can be expected to be smaller than would be expected should a similar number of units be randomly assigned to the experimental treatment (3.1.13) without regard to blocks.

Note 2 to entry: Blocks are usually selected to allow for the effects of assignable causes, in addition to those introduced as factors to be studied (factor of primary interests), which it may be difficult, or even impossible to keep constant for all of the experimental units in the complete experiment. The effect of these assignable causes may be minimized within blocks, thus a more homogeneous experiment sub-space is obtained. The analysis of the experimental results must account for the effect of blocking the experiment.

3.1.27

designed experiment

experiment (3.1.1) with an explicit objective and structure of implementation

Note 1 to entry: The purpose of a properly designed experiment is to provide the most efficient and economical method of reaching valid and relevant conclusions from the experiment.

Note 2 to entry: Associated with a designed experiment is an experimental design (3.1.28) that includes the response variable (3.1.3) or variables and the experimental treatments (3.1.13) with prescribed factor levels (3.1.12) . A class of models that relates the response variable to the predictor variables could also be envisaged.

3.1.28

experimental design

assignment of experimental treatments (3.1.13) to each experimental unit (3.1.24)

Note 1 to entry: The assignment of experimental treatments could also include the time order or randomized order in which the treatments are applied.

Note 2 to entry: An experimental design can be considered as a scheme assigning experimental treatments (levels of a factor or a combination of such levels) involved in an experiment to the experimental units. The design matrix (3.2.25) includes the specified levels of each factor in the experiment (as well as other columns which give values of the predictor variables used at the analysis stage).

Note 3 to entry: An experimental design provides the determination as to how the observations/measurements should be taken to answer a research question in a valid, efficient and economical way.

Note 4 to entry: This definition does not preclude the possibility of designs that are known to be relatively inefficient (e.g. “one-factor-at-a-time” designs are experimental designs according to the definition but that does not suggest that they are to be recommended.

3.1.29

experimental plan

specification of the intended procedure for the implementation of an experiment (3.1.1)

Note 1 to entry: The experimental plan should ideally offer the possibility of providing the most efficient and economical method of reaching valid and relevant conclusions from a designed experiment (3.1.27) . The selection of an appropriate design for an experiment will depend on considerations such as the type of questions to be addressed, the degree of generality to be attached to the conclusions, the magnitude of the effect from which a high probability of detection (power) is desired, the homogeneity of the experimental units (3.1.24) and the cost and the method of performing the experiment. The experimental plan establishes a protocol for the conduct of the experiment.

Note 2 to entry: A properly designed experiment will frequently lead to effective results from relatively simple statistical analysis and interpretation of the results. However, a poorly designed experiment may not meet the experimental objectives in spite of sophisticated analyses of the results.

Note 3 to entry: The development of an experimental plan can be quite arduous. A detailed description of the process is given in Annex D.

3.1.30

randomization

strategy in which each experimental unit (3.1.24) has an equal chance of being assigned a particular experimental treatment (3.1.13)

Note 1 to entry: Randomization attempts to protect against biases due to causes not taken into account explicitly in the model (3.1.2) . Randomization may further neutralize potential temporal or spatial effects. The equal chance of assignment could be within a subset of the collection of experimental units.

Note 2 to entry: From a practical standpoint, sampling without replacement may govern the allocation of experimental units to treatments so that the final experimental unit drawn seemingly is not “randomly” chosen at this stage. However, at the onset of the allocation, each experimental unit had an equal chance of being chosen so that ultimately, a random allocation has occurred.

3.1.31

orthogonal array

set of experimental treatment (3.1.13) combinations, in which for every pair of factors, each treatment combination occurs the same number of times across the possible factor levels (3.1.12)

Note 1 to entry: The concept of strength in relation to orthogonal arrays arises with screening designs (3.2.8) , which is one possible use of orthogonal arrays. A design of strength d is a complete factorial design in any d factors. Strength 1 implies that the levels of each factor occur the same number of times (which is sometimes called a balanced factor). An orthogonal array has strength 2. The subset size d is known as the strength.

3.1.32

degrees of freedom

number of linearly independent effects that can be estimated

Note 1 to entry: Informally, the degrees of freedom are the number of quantities that are free to vary without restriction.

Note 2 to entry: Degrees of freedom is commonly associated with the denominator of a variance calculation. The value of the degrees of freedom is the sample size minus the number of constraints associated with the quantity being computed. Estimating the population mean by the sample mean in a variance calculation reduces the degrees of freedom by 1, yielding degrees of freedom of n−1, with n as the sample size. One degree of freedom is eliminated since knowing the sample mean and n−1 values from the sample establishes the remaining data value.

Note 3 to entry: Degrees of freedom are the parameters of certain theoretical distributions that occur as sample distributions in statistical estimation and testing; for example, the chi-square distribution, the F-distribution, and the t-distribution.

3.1.33

one-factor experiment

designed experiment (3.1.27) in which a single factor (3.1.5) is investigated as to its effect (if any) on the response variable (3.1.3)

Note 1 to entry: A model for a one-factor experiment is

mml_m25

where

y_ij	is the response variable for the jth replicate at the ith level of the factor;
µ_ij	is the mean response at the ith level of the factor;
ε_ij	is a random variable capturing all other effects and sources of variation.

This model relates the response variable y_ijtoµ_i (depending on the corresponding level of the factor) and a residual error ε_ij . Differences in the µ_i reflect the influence of the factor on the response variable (in this case the mean response value as a function of the level of the factor).
An alternative representation of this model is

mml_m26

where

y_ij	is the response variable;
µ	is the overall mean response;
α_i	is the incremental effect due to the ith level of the factor;
ε_ij	is a random variable capturing all other effects and sources of variation.

In the above representation, the α_i values are assumed to sum to zero.

3.1.34

two-factor experiment

designed experiment (3.1.27) in which two distinct factors (3.1.5) are simultaneously investigated for possible effects on the response variable (3.1.3)

Note 1 to entry: If the two factors operate without interacting with each other, the term main effect (3.1.15) necessarily still applies. Namely, for each factor the main effect is its contribution to the mean of the response variable. See Example 2 of 3.1.2.

3.1.35

k-factor experiment

multi-factor experiment

designed experiment (3.1.27) in which k distinct factors (3.1.5) are simultaneously investigated for possible effects on the response variable (3.1.3)

3.1.36

replication

multiple occurrences of a given treatment combination or setting of predictor variables (3.1.4)

Note 1 to entry: Experimental constraints may dictate that replications take place sequentially rather than in a randomized order. Informally, such a situation would correspond to repetition, but universal concurrence on this term does not exist. Hence, for purposes of this part of ISO 3534, replication will be the term involving the attainment of different response values for a fixed set of levels of predictor variables.

Note 2 to entry: A run is repeated a number of times in order to obtain a more reliable estimate than is possible from a single observation. The function of replication is two-fold: (a) it provides an estimate of the pure error, and (b) it adds to the confidence in the experimental results.

Note 3 to entry: Replication as used in this part of ISO 3534 should be distinguished from the concepts of repeatability and reproducibility given in ISO 3534-2:2006) which relate particularly to measurement systems analysis. In experimental situations relevant to this part of ISO 3534, replication includes contributions from the process itself in addition to measurement uncertainty.

3.1.37

cube point

vector of factor level (3.1.12) settings of the form (a₁, a₂, …, a_k ) where each a_i equals +1 or −1 as a notation for the coded levels of the factors (3.1.5)

Note 1 to entry: These points are precisely the type of points found in a two-level full factorial experiment (3.2.2) or fractional factorial experiment (3.2.3) with k factors. As many as 2 ^k cube points can be used in the context of a central composite design [see Example 1 in (3.2.19)].

EXAMPLE:

For constructing a central composite design for v-factors, take a 2 ^v factorial arrangement with factorial levels coded as +1 or −1 or a 2 ^v-p fractional factorial arrangement having at least resolution V.

3.1.38

star point

vector of factor level (3.1.12) settings of the form (a₁, a₂, …, a_k ) ここで, one a_i equals αor−α and the other a_i ’s equal 0, as notation for the coded levels of the factors (3.1.5)

Note 1 to entry: All star points have a single non-zero component equal to +αor−α. In k-factor central composite designs, typically a total of 2 ^k star points are employed.

EXAMPLE:

Star points are 2 axial points on the axis of each design variable at a distance of β from the design centre. As there are v-axes, this process yields 2 ^v star points of the form: (±β, 0, …, 0), (0, ±β, 0, …, 0), …, (0, 0, …, ±β). These points are also known as axial points and assist in the estimation of curvature of the surface.

3.1.39

centre point

vector of factor level (3.1.12) settings of the form (a₁, a₂, …, a_k ) ここで, all a_i equal 0, as notation for the coded levels of the factors (3.1.5)

Note 1 to entry: All entries of centre points are zero, so the vectors are of the form (0, 0, ... , 0) corresponding to the centre of the experimental design (3.1.28) in the coded variables. The number of these points, for example n₀, is chosen to achieve various objectives in response surface designs (3.2.19) . Centre points are sometimes replicated (3.1.36) to obtain an estimate of the pure error (3.1.9) of the process under investigation. Graphical depictions of two designs with star (five pointed star), cube (cubes on the corners) and centre points (⊕) are given in Figures 2 and 3.

Figure 2 — Two-dimensional illustration of star, cube and centre points

Figure 3 — Three-dimensional illustration of star, cube and centre points (one cube point hidden)

EXAMPLE:

The points which which represents the centre of experimental region. In coded levels these are generally denoted as (0, 0, …, 0). These are the points or experimental runs whose values of each factor are the medians of the values used in the factorial portion. This point is often replicated in order to improve the precision of the experiment. The number of centre points to be added also depends upon the design properties.

3.1.40

rotatability

characteristic of a designed experiment (3.1.27) for which the response variable (3.1.3) that is predicted from a fitted model (3.1.2) has the same variance at all equal distances from the centre of the design

Note 1 to entry: A design is rotatable if the variance of the predicted response at any point x depends only on the distance of x from the centre point (3.1.39) . A design with this property can be rotated around its centre point without changing the prediction variance at x .

Note 2 to entry: Rotatability is a desirable property for response surface designs (3.2.19) .

3.2 Arrangements of experiments

3.2.1

factorial experiment

designed experiment (3.1.27) with one or more factors (3.1.5) and with at least two levels applied for one of the factors

Note 1 to entry: The term “factorial experiment” is more general than full factorial experiment (3.2.2).

Note 2 to entry: Crossed factors: two factors are crossed if every level of one occurs with every level of the other in the experiment (3.1.1). Nested factors: factor A is nested within another factor B if the levels or values of A are different for every level or value of B. Nested factors or effects have a hierarchical relationship. (See 3.2.21).

3.2.2

full factorial experiment

factorial experiment (3.2.1) consisting of all possible combinations of the levels of the factors (3.1.5)

Note 1 to entry: All interactions (3.1.17) and main effects (3.1.15) can be estimated from a full factorial experiment.

Note 2 to entry: A full factorial experiment is usually described symbolically as the product of the number of levels of each factor. For example, an experiment based on 3 levels of factor A, 2 levels of factor B and 4 levels of factor C would be referred to as a 3 × 2 × 4 factorial. The product of these numbers (24 in this case) indicates the total number of distinct runs.

Note 3 to entry: When a full factorial experiment includes factors all having the same number of levels, the description is usually given in terms of the number of levels raised to a power equal to the number of factors, k. Thus, an experiment with two factors each at three levels would be referred to as a 3² full factorial (k being equal to 2) and requires 9 experimental units which are given different experimental treatments.

3.2.3

fractional factorial experiment

factorial experiment (3.2.1) consisting of a subset of the full factorial experiment (3.2.2)

Note 1 to entry: Typically, the fraction is a simple proportion of the full set of possible experimental treatment combinations. For example, half-fractions, quarter-fractions, and so forth are common.

Note 2 to entry: All interactions (3.1.17) and main effects (3.1.15) cannot be estimated from a fractional factorial experiment.

Note 3 to entry: Fractional factorial designs are experimental designs (3.1.28) consisting of a carefully chosen subset (fraction) of the experimental runs of a full factorial design. The subset may be chosen so as to exploit the main effects and low-order interactions to expose information about the most important features of the problem studied, while using a fraction of the effort of a full factorial design in terms of experimental runs and resources and thereby yielding screening designs (3.2.8) . In other situations, the subset may be chosen to account for inhomogeneity among the experimental units, thereby yielding, for example, Latin square (3.2.11) or Graeco-Latin square designs (3.2.12) .

3.2.4

two-level experiment

full factorial experiment (3.2.2) in which all factors (3.1.5) assume at most two factor levels (3.1.12)

3.2.5

2 ^k factorial experiment

full factorial experiment (3.2.2) with k factors (3.1.5) , each at two factor levels (3.1.12)

Note 1 to entry: Two level full factorial experiments are full factorial experiments in which each of the p available factors is investigated at only two levels. The early stages of experimentation usually involve the investigation of a large number of potential factors to discover the “vital few” factors. Two level factorial experiments are used during these stages to quickly filter out unwanted effects so that attention can then be focused on the important ones.

EXAMPLE:

A 2⁴ factorial experiment may be appropriate for investigating the effect of four factors on the process yield: pressure, temperature, catalyst and operator. Let A be the pressure (low or high), B be the factor temperature (low or high), C represent the catalyst (presence or absence) and D correspond to the operator (one of two).

Table 3 — 2⁴ factorial experiment

Experimental unit	Treatment	A	B	C	D
1	(1)	−	−	−	−
2	a	+	−	−	−
3	b	−	+	−	−
4	ab	+	+	−	−
5	c	−	−	+	−
6	ac	+	−	+	−
7	bc	−	+	+	−
8	abc	+	+	+	−
9	d	−	−	−	+
10	ad	+	−	−	+
11	bd	−	+	−	+
12	abd	+	+	−	+
13	cd	−	−	+	+
14	acd	+	−	+	+
15	bcd	−	+	+	+
16	abcd	+	+	+	+

A 2⁴ factorial experiment consists of 16 different experimental treatments (3.1.13) , as listed in Table 3. The symbols “–” and “+” denote the two possible levels for each factor. Frequently, minus refers to a low level of a factor, while plus implies the high level; however, the specification of symbols to levels is arbitrary.
The order presented in the table above is known as standard Yates' order, which may be useful at the analysis stage, if calculations are performed manually. The actual order in which these treatments are performed should be determined by randomization (3.1.30) . The first factor A is listed with alternating signs (–,+,–,+ and so forth). The second factor B alternates two minuses and two pluses. Factor C alternates sets of four minuses and four pluses. Finally, factor D is set at minus for experimental units 1 through 8, and plus for experimental units 9 through 16. In the latter part of this part of ISO 3534, the minus sign is designated as −1 and the plus sign as +1.
The second column of the table above illustrates an alternative notation for describing treatments. The presence of a lower-case letter indicates that the level of the corresponding upper-case factor is at the high level; furthermore, absence of a letter implies the corresponding factor is at the low level. The case in which all factors are at the low level is denoted “(1)”.
A full factorial experiment allows the estimation (but not testing) of all main effects and interactions unambiguously. In the 2⁴ case, there are four main effects (A, B, C, D), six two-way (first-order) interactions (ab, AC, AD, BC, BD, CD), four three-way (second-order) interactions (ABC, ABD, ACD, BCD) and one four-way (third-order) interaction (ABCD). In practice, the three-way and four-way interactions are sometimes assumed to be negligible and thus offer the opportunity for estimating the residual error (3.1.6) with these degrees of freedom. Alternatively, some replication could also provide the opportunity for testing.
Each of the effects (for example, effect due to A, interaction between A and B, even four-way interaction among A, B, C and D), can be estimated using the contrast coefficients as given in Table 4.
For example, to estimate the main effect of A, the formula is (−1)y₁+(1)y₂+…+ (1)y₁₆ここで, the responses have been associated with the order given in the table.

Table 4 — Design matrix for a 2⁴ full factorial design

I	A	B	C	D	ab	AC	AD	BC	BD	CD	ABC	ABD	ACD	BCD	ABCD	y
1	−1	−1	−1	−1	1	1	1	1	1	1	−1	−1	−1	−1	1	y₁
1	1	−1	−1	−1	−1	−1	−1	1	1	1	1	1	1	−1	−1	y₂
1	−1	1	−1	−1	−1	1	1	−1	−1	1	1	1	−1	1	−1	y₃
1	1	1	−1	−1	1	−1	−1	−1	−1	1	−1	−1	1	1	1	y₄
1	−1	−1	1	−1	1	−1	1	−1	1	−1	1	−1	1	1	−1	y₅
1	1	−1	1	−1	−1	1	−1	−1	1	−1	−1	1	−1	1	1	y₆
1	−1	1	1	−1	−1	−1	1	1	−1	−1	−1	1	1	−1	1	y₇
1	1	1	1	−1	1	1	−1	1	−1	−1	1	−1	−1	−1	−1	y₈
1	−1	−1	−1	1	1	1	−1	1	−1	−1	−1	1	1	1	−1	y₉
1	1	−1	−1	1	−1	−1	1	1	−1	−1	1	−1	−1	1	1	y₁₀
1	−1	1	−1	1	−1	1	−1	−1	1	−1	1	−1	1	−1	1	y₁₁
1	1	1	−1	1	1	−1	1	−1	1	−1	−1	1	−1	−1	−1	y₁₂
1	−1	−1	1	1	1	−1	−1	−1	−1	1	1	1	−1	−1	1	y₁₃
1	1	−1	1	1	−1	1	1	−1	−1	1	−1	−1	1	−1	−1	y₁₄
1	−1	1	1	1	−1	−1	−1	1	1	1	−1	−1	−1	1	−1	y₁₅
1	1	1	1	1	1	1	1	1	1	1	1	1	1	1	1	y₁₆

3.2.6

2 ^{k −p} fractional factorial experiment

fractional factorial experiment (3.2.3) the size of which is a 2 ^−p fraction of the size of the 2 ^k factorial experiment (3.2.5)

Note 1 to entry: For a large number of factors (3.1.5) , 2 ^k may require more runs than are feasible. Through careful selection, nearly the same amount of information can be obtained from the fractional factorial experiment as the full factorial experiment (3.1.2) . In particular, the selection is typically made so that main effects (3.1.15) and interactions (3.1.17) that are expected to be of practical importance are confounded (3.1.19) only with interactions expected to be negligible.

Note 2 to entry: For p equal to 1, the resulting fractional factorial experiment is a half-fraction; for p equal to 2, the resulting fractional factorial experiment is a quarter-fraction; and so forth.

EXAMPLE:

Consider an experiment (3.1.1) with 6 factors and with 16 runs. See Table 5. This example illustrates the construction of an experimental design that uses confounding in order to examine all 6 factors with 16 experimental treatment combinations. In particular, a 2^6-2 fractional factorial design is constructed. The levels of four of the factors (A, B, C and D) can be set as if a full factorial experiment were to be run. In this full factorial context, all main effects and high order interactions can be estimated (e.g. the two-way interactions ab, AC, AD, BC, BD, CD; the three-way interactions ABC, ABD, ACD and BCD; and the four-way interaction ABCD). In practice, the three- and four-way interactions are rarely important, but they can certainly be estimated from the data. In light of the unlikely importance of these higher order interactions (three-way and four-way), investigators have realized that the other factors (say E and F) could be incorporated into the experiment at this design stage by assigning as levels for E and F, particular choices of high order interactions. For example, the level of E could be assigned to correspond to the three-way interaction ABC while the level of F could be assigned to the three-way interaction BCD. This assignment ensures that the estimate of the three-way interaction ABC is identical to the estimate of the newly-assigned factor E, since they use the same contrast for estimation. However, in light of the practical and common occurrence that ABC is likely to be near zero, the investigator could presume or conclude that the estimate of ABC and E is effectively an estimate of E alone (i.e. presumes that the ABC interaction is zero).

Table 5 — One-quarter fraction layout

	A	B	C	D	E=ABC	F=BCD
1	−1	−1	−1	−1	−1	−1
2	1	−1	−1	−1	1	−1
3	−1	1	−1	−1	1	1
4	1	1	−1	−1	−1	1
5	−1	−1	1	−1	1	1
6	1	−1	1	−1	−1	1
7	−1	1	1	−1	−1	−1
8	1	1	1	−1	1	−1
9	−1	−1	−1	1	−1	1
10	1	−1	−1	1	1	1
11	−1	1	−1	1	1	−1
12	1	1	−1	1	−1	−1
13	−1	−1	1	1	1	−1
14	1	−1	1	1	−1	−1
15	−1	1	1	1	−1	1
16	1	1	1	1	1	1

The expressions E=ABC and F=BCD are generating relations, because they generate the appropriate levels of the factors E and F in terms of the factors A, B, C and D. Another useful construction is given as
I = ABCE = BCDF = ADEF
that is known as the defining relation for this design. The term “I” represents the identity column (all entries equal to +1). As will be seen shortly, this defining relation contains all of the information on confounding associated with this particular experimental design. Using the conventions ia = AI = A, IB = bi = B, I = A²= B²= C² and so forth, the generating relation E = ABC is equivalent to EE = ABCE, which in turn is equivalent to I = ABCE. Similarly, F = BCD leads to I = BCDF. The defining relation is completed by evaluating the generalized interaction ABCE × BCDF = ABCEBCDF = ABBCCDEF = AIIDEF = ADEF. Hence, the defining relation is I = ABCE = BCDF = ADEF.

Note 3 to entry: A 2 ^k−p fractional factorial experiment is constructed by considering the k factors to be in two groups, a primary one with k−p factors and a secondary one with p factors. The k−p factors in the primary group are allocated to a full factorial with 2 ^k−p experimental units which are the number of experimental units of the design. The levels of each of the factors of the secondary group for each experimental unit are defined in terms of levels of factors of the primary group. The set of p equations that define the factors of the secondary group in terms of the factors of the primary group is called the generating relation, because it generates the design. The p equations of the generating relation can be used to calculate the 2 ^p − 1 equations of the defining relation that can be used to determine properties of the design. In the previous example, k=6, p=2; the primary factors were A, B, C and D and the secondary group were E and F; the generating relations were E=ABC and F=BCD; the defining relation was I = ABCE = BCDF = ADEF.

3.2.7

design resolution

length of the shortest word in the defining relation associated with a 2 ^k-p fractional factorial experiment (3.2.6)

Note 1 to entry: Design resolution indicates the extent of aliasing (3.1.20) among main effects (3.1.15) and two-way and higher order interactions (3.1.17) .

Note 2 to entry: The design resolution describes the aliasing in a particular experimental design. The numerical length is generally given by upper case Roman numerals. The three most common practical situations are resolutions III, IV and V.

For a resolution III design, main effects are not aliased with other main effects. This observation can be made by examining the expressions in the defining relation. For example, I = ABD = BCE = ACDE includes the expressions I, ABD, BCE and ACDE and counting the number of letters for each term (1, 3, 3, and 4, respectively for this example). The shortest length of these aside from 1 (corresponding to I) is 3, which is known as the length of the shortest “character string”. At least one main effect is aliased with a two-way interaction. For example, for I = ABD, it would be the case that A is confounded with BD, B is confounded with AD and D is confounded with ab.
For a resolution IV design, main effects are not aliased with other main effects or with any two-way interactions. At least one two-way interaction is aliased with another two-way interaction. For example, the defining relation I = ABCE = BCDF = ADEF includes the expressions I, ABCE, BCDF and ADEF; counting the number of letters for each term gives 1, 4, 4, and 4, respectively. The smallest value aside from 1 (corresponding to I) is 4, which is known as the length of the shortest “character string” for this design. For example, for I = ABCE, it would be the case that ab is confounded with ce, AC with be, ae with BC, while A is confounded with BCE, B is confounded with ACE, C is confounded with ABE and E is confounded with ABC.
For a resolution V design, main effects and two-way interactions are not aliased with any other main effects or with any other two-way interactions. For example, with I = ABCDE, it is clear that each main effect is confounded with a 4-way interaction (A is confounded with BCDE) and each two-way interaction is confounded with a 3-way interaction (ab is confounded with CDE).

Note 3 to entry: The higher the resolution, the more effects (main or interactions) can be estimated unambiguously provided the higher order interactions are negligible. Given a choice of two potential designs involving the same number of factors and experimental units, the design with the higher resolution should be selected. Fortunately, for most cases of k and p of practical interest, the most appropriate defining relations are recorded (see, for example, Reference [2], p. 410; or Reference [3], p. 272).

Note 4 to entry: Full factorial designs (3.2.1) have no confounding. For most practical purposes, a resolution V design is excellent and a resolution IV design may be adequate. Resolution III designs are useful as economical screening designs (3.2.8) .

Note 5 to entry: Factor names are presumed to be expressed by single letters for purposes of the definition.

EXAMPLE:

Table 2 in (3.1.20) illustrated a case with a Resolution III design having defining relation I=ABD=BCE=ACDE.

3.2.8

screening design

designed experiment (3.1.27) that identifies a subset of factors (3.1.5) for further study

Note 1 to entry: Such experimental designs (3.1.28) generally focus on the investigation of main effects (3.1.15) , with the presence of interactions (3.1.17) leading to complications in the analysis, and possibly the need for additional experimental treatments (3.1.13) to resolve ambiguities.

Note 2 to entry: The primary purpose of an experiment with a screening design is to select or to screen out the few important main effects from the many less important ones. Screening designs are also sometimes termed main effects designs.

EXAMPLE 1:

The 2 ^k-p fractional factorial designs (3.2.6) (especially those that are highly fractionated) can be viewed as screening designs. Consider the example in 3.2.6 with additional generating relations of G=ACD, H=ABD and J=ABCD. The corresponding design is a 2⁹⁻⁵ fractional factorial design intended to study 9 factors in only 16 treatment combinations. It would be hoped at the analysis stage to identify the leading 2 or 3 predictor variables (associated with main effects) for further investigation.

EXAMPLE 2:

Plackett and Burman [4] proposed a collection of two-level screening designs with the number of runs as a multiple of four. Their designs are commonly chosen for situations in which the number of main effects under investigation approaches the number of runs (3.1.13) . For example, the 12-run Plackett-Burman design given in Table 6 can be used to screen for as many as 11 main effects. However, for this design the presence of two-way interactions (for example, ab) undermines the estimation of main effects C, D, ..., K.

Table 6 — Layout of Plackett-Burman 12-run design

Trial	A	B	C	D	E	F	G	H	I	J	K
1	1	−1	1	−1	−1	−1	1	1	1	−1	1
2	1	1	−1	1	−1	−1	−1	1	1	1	−1
3	−1	1	1	−1	1	−1	−1	−1	1	1	1
4	1	−1	1	1	−1	1	−1	−1	−1	1	1
5	1	1	−1	1	1	−1	1	−1	−1	−1	1
6	1	1	1	−1	1	1	−1	1	−1	−1	−1
7	−1	1	1	1	−1	1	1	−1	1	−1	−1
8	−1	−1	1	1	1	−1	1	1	−1	1	−1
9	−1	−1	−1	1	1	1	−1	1	1	−1	1
10	1	−1	−1	−1	1	1	1	−1	1	1	−1
11	−1	1	−1	−1	−1	1	1	1	−1	1	1
12	−1	−1	−1	−1	−1	−1	−1	−1	−1	−1	−1

Note 3 to entry: Many of the Plackett-Burman designs are related to Hadamard matrices, developed originally in a theoretical context, but later recognized to be useful in experimental designs. The Hadamard matrices are readily constructed from the knowledge of a single column (or equivalently, row) alone. As one possible construction, let the bottom row consist solely of minus signs. The remaining column entries are obtained from the first column by shifting it to the right and dropping it one row, the eleventh entry moving up to row position one. This procedure is continued across the columns until the matrix is completed. Examples of some of these special first columns to construct the matrices are given below. For each construction, it is sufficient to indicate the location of plus signs in the first column, as follows:

n	rows in column one containing plus sign
1	21, 2, 4, 5, 6, 10
20	1, 2, 5, 6, 7, 8, 10, 12, 17, 18
24	1, 2, 3, 4, 5, 7, 9, 10, 13, 14, 17, 19

Note that the rows indicated above for n = 12 agree with the design detailed in Example 2. Many of the Plackett-Burman designs can be constructed in this general fashion using the elements of a single column as the basis. The cases with n = 28, 52, 76, 92 and 100 cannot be constructed from a single special column of plus ones and minus ones (for details, see Reference [3]). Alternatively, some statistical software packages can provide Plackett-Burman design matrices (3.2.25) .

Note 4 to entry: Aguchi’s orthogonal array (3.1.31) L12 is equivalent to the Plackett-Burman 12-run design given above. L20 is equivalent to the Plackett- Burman 20-run design. As a word of caution, the L array convention typically gives the entries of the design matrix in a different order than that provided by the Hadamard construction.

Note 5 to entry: Plackett-Burman designs can be adapted for use in a supersaturated setting (i.e. cases with more factors than experimental runs). For further details on supersaturated designs, see References [5] and [6].

3.2.9

block design

designed experiment (3.1.27) that uses blocks (3.1.25)

Note 1 to entry: A block design takes advantage of the homogeneity of subsets of the experimental units (3.1.24) . Inhomogeneity among experimental units, if ignored in the experimental design, can reduce the amount of information obtained from the experiment (3.1.1) by increasing the observed variation. Accounting for this situation in the design can increase the experiment's capability of meeting the design objective. Block designs are intended to counteract the impact of known nuisance effects or effects that contribute to the response but are not inherently of interest relative to the objective of the experiment. If the presumption of homogeneity is incorrect, then blocking incurs a cost in reducing the degrees of freedom.

3.2.10

randomized block design

block design (3.2.9) with an arrangement of ν experimental treatments (3.1.13) in b blocks (3.1.25) such that each treatment appears exactly once in each block

Note 1 to entry: These designs are useful for one-way elimination of heterogeneity setting, i.e. when the heterogeneity in the experimental units is caused by a single factor and levels of this factors causing heterogeneity is same as the number of treatments.

Note 2 to entry: Since each of the treatments appear in each block, therefore, it is a complete block design and a better term for this is “randomized complete block design”. The number of replications of treatments are same as number of blocks. The treatments are randomly allocated to each block.

Note 3 to entry: This is the simplest design using all the three principles of design as given by Fisher (replication, randomization and local control).

3.2.11

Latin square design

fractional factorial design (3.2.3) with an arrangement of v² runs (3.1.13) in v² positions arranged in v rows and v columns, such that every symbol occurs precisely once in each row and precisely once in each column

Note 1 to entry: Latin square designs can investigate at most three factors (rows, columns and letters). The symbol v represents the order of the Latin square.

Note 2 to entry: If the symbols are taken as A, B, C, D, a Latin square arrangement of order 4 is as follows:

A	B	C	D
B	C	D	A
C	D	A	B
D	A	B	C

Latin square designs are normally used in experiments where it is required to remove the heterogeneity of experimental material in two directions, i.e.., for the experimental situations where the heterogeneity in the experimental material is caused by two factors levels of which are cross classified with each other. The levels of the factors causing heterogeneity in the experimental units are same as that of treatments. These designs require that the number of replications equal the number of treatments . In this example a total of 16 treatment combinations occur whereas there would be 64 treatment combinations for a full factorial design (3.2.1)

Note 3 to entry:Randomization by allocation. According to the definition of a Latin square design, treatments can be allocated to the ν² experimental units in a number of ways. There are, therefore, a number of Latin squares of a given order. They can be in standard form (if the symbols in the first row and first column are in natural order) or semi-standard form (if the symbols of the first row are in natural order). The purpose of randomization is to select one of these squares at random. The following is one of the methods of random selection of Latin squares.
Let a ν × ν Standard Latin square arrangement be first written by denoting treatments by Latin letters A, B, C, etc. or by numbers 1, 2, 3, etc. Such arrangements are readily available in the Tables for Statisticians and Biometricians^[7] . One of these squares of any order can be written systematically as shown below for a 5×5 Latin square:

A	B	C	D	E
B	C	D	E	A
C	D	E	A	B
D	E	A	B	C
E	A	B	C	D

For the purpose of randomization rows and columns of the Latin square are rearranged randomly. There is no randomization possible within the rows and/or columns.
For example, the following is a row randomized square of the above 5×5 Latin square;

A	B	C	D	E
B	C	D	E	A
E	A	B	C	D
D	E	A	B	C
C	D	E	A	B

Next, the columns of the above row randomized square have been rearranged randomly to give the following random square:

E	B	C	A	D
A	C	D	B	E
D	A	B	E	C
C	E	A	D	B
B	D	E	C	A

As a result of row and column randomization, but not the randomization of the individual units, the whole arrangement remains a Latin square.

Note 4 to entry: A basic assumption is that these block factors do not interact (3.1.17) with the factor of most interest under study, or among themselves. If this assumption is not valid, the measure of residual error (3.1.6) is increased and the effect of the factor confounded with such interactions (3.1.17) . Sometimes other factors of interest are used in the block positions so that there may be three factors without any designation of being block factors. This is equivalent to a fractional factorial experiment (3.2.3) with the assumption of no interactions. Some designs of fractional factorial experiments form Latin squares and it may be more desirable to approach the problem from the fractional factorial experiment viewpoint in order to more fully understand the assumptions being made concerning interactions.

3.2.12

Graeco-Latin square design

fractional factorial design (3.2.3) involving 4 factors (3.1.5) , each at h levels (3.1.12) , in which the combination of the levels of any one of the factors with the levels of the other three factors appears once and only once in an experiment of size h²

Note 1 to entry: A Graeco-Latin square design involves four factors and there are h² (h ≥ 3, a positive integer) experimental units (3.1.24) classified according to three block (3.1.25) factors (for example, row factor, column factor and Greek letter) each factor having h levels. There are h levels of the factor of primary interest which are allocated to the h² experimental units at random in such a manner that each experimental treatment (3.1.13) appears in every row and column precisely once and also appears with a Greek letter precisely once.

Note 2 to entry: Two Latin squares are said to be orthogonal, if each letter in one square coincides exactly once with each letter of the other square. Pairs of Latin square designs that are orthogonal can be combined to produce Graeco-Latin square designs.

Note 3 to entry: Graeco-Latin square designs allow the incorporation of three blocking variables, all of which have the same number of levels as the number of levels of the factor of primary interest.

EXAMPLE:

An example of a 4 × 4 Graeco-Latin Square is as follows:

Aα	Bβ	Cγ	Dδ
Bδ	Aγ	Dβ	Cα
Cβ	Dα	Aδ	Bγ
Dγ	Cδ	Bα	Aβ

Factor 1 is given by the rows. Factor 2 is given by the columns. Factor 3 is represented by the Latin letters. The principal (and fourth) factor is represented by the Greek letters.

3.2.13

incomplete block design

block design (3.2.9) with at least one block (3.1.25) containing less than all of the experimental treatments (3.1.13)

Note 1 to entry: In effect, a randomized block design (3.2.10) can be construed as a “complete” block design emphasizing that a sufficient number of experimental units are available within each block to accommodate the required number of treatments.

Note 2 to entry: If material is divided into blocks and it is desired to allocate certain treatments to the block, the treatments may be too numerous for them to appear in each block. When a block contains less than a complete replication of the treatments, it is called incomplete.

3.2.14

balanced incomplete block design

BIBD

incomplete block design (3.2.13) in which each block (3.1.25) contains the same number k of different levels (3.1.12) from the l levels of the factor of primary interest (3.1.5) arranged so that every pair of levels occurs in the same number λ of blocks from the b blocks

Note 1 to entry: The term “balanced” refers to consistent number of pairings, “incomplete” refers to the inability to examine every level of the factor of primary interest in each block, and “block” refers to the strategy of conducting the experiments (3.1.1) on homogeneous sets of experimental units (3.1.24) .

EXAMPLE 1:

Consider a situation with 4 experimental treatments and 6 blocks, 2 treatments per block (l = 4, k = 2, b = 6, λ = 1). More specifically, suppose four levels (T₁, T₂, T₃, T₄) of the factor of primary interest are to be studied, but only two levels can be done in one day. If six days are available to conduct the experiment, then the following plan is appropriate:

Day	T₁	T₂	T₃	T₄
1	∗	∗
2			∗	∗
3	∗		∗
4		∗		∗
5	∗			∗
6		∗	∗

In this example, all possible pairs of treatments occur once in the same block.

EXAMPLE 2:

Consider a situation with 6 levels of the factor of primary interest and 10 blocks with 3 levels per block (l = 6, k = 3, b = 10, λ = 2). In this case, it is natural to suspect that 20 blocks are in fact necessary since there are 20 possible triplets for 6 levels. Consider the following set of treatments ここで, each block is given by a triplet:
(T₁,T₂,T₃), (T₁,T₂,T₄), (T₁,T₃,T₅), (T₁,T₄,T₆), (T₁,T₅,T₆)
(T₂,T₃,T₆), (T₂,T₄,T₅), (T₂,T₅,T₆), (T₃,T₄,T₅), (T₃,T₄,T₆)
Here, each pair of levels occurs exactly twice in the set of 10 blocks, but only once in the same block, indicating that 10 blocks may suffice.

EXAMPLE 3:

Consider a situation with 7 levels and 7 blocks with 4 levels per block (l = 7, k = 4, b = 7, λ = 2).

	Levels of the factor of primary interest
Block	1	1	2	3	6
	2	2	3	4	7
	3	3	4	5	1
	4	4	5	6	2
	5	5	6	7	3
	6	6	7	1	4
	7	7	1	2	5

Note 2 to entry: The balanced incomplete block design implies that every level of the principal factor appears the same number of times h in the experiment and that the following relations hold true:
bk = lh, b ≥ l and h(k − 1) = λ(l − 1).
Since each letter in the above equations represents an integer, it is clear that only a restricted set of combinations (l, k, b, h, λ) is possible for constructing balanced incomplete block designs. However, given five integers (l, k, b, h, λ) satisfying the above three conditions, it is not necessarily the case that a BIB exists.

Note 3 to entry: For randomization (3.1.30) , arrange the blocks and levels within each block independently at random.

3.2.15

partially balanced incomplete block design

PBIBD

incomplete block design (3.2.13) in which each block (3.1.25) contains the same number k of different levels (3.1.12) from the l levels of the principal factor (3.1.5) , arranged so that not all pairs of levels occur together in the same number of the b blocks

Note 1 to entry: An incomplete block design with l levels and b blocks is a PBIBD with m (≥ 2) associate classes, if the following conditions are satisfied:

a) each block contains k (< l) distinct levels;
b) each level appears in h blocks;
c) there exists a relation among levels satisfying:
- any two levels are either 1st, 2nd, ..., mth associates, the relation being symmetric (if level α is the ith associate of level β, then level β is the ith associate of level α);
- each level has ni ith associates, i = 1, 2, ..., m; the number ni being independent of the level chosen;
d) any two levels that are mutually ith associates appear together in λ_i blocks (i = 1, 2, ..., m) not all λ_i s are equal.

Note 2 to entry: The integers l, b, h, k, λ, λ, ..., λ_m , n₁, n₂, ..., n_m , mml_m29

, i, j, k = 1, 2, ..., m are connected by the following:

mml_m30

EXAMPLE:

Consider a situation having l = 6, k = 4, b = 6, h = 4, n₁= 1, n₂= 4, λ₁= 4, λ₂= 2 as depicted, as follows:

	Levels of the factor of primary interest
Block	1	1	4	2	5
	2	2	5	3	6
	3	3	6	1	4
	4	4	1	5	2
	5	5	2	6	3
	6	6	3	4	1

In this design, every level occurs h = 4 times and starting with any level (for example, level 1), n₁= 1 level (for example, level 4) is found appearing together with level 1 in λ₁= 4 blocks and n₂= 4 levels (numbers 2, 3, 5 and 6) and with level 1 in λ₂= 2 blocks. Parameters n₁, n₂, λ₁ and λ₂ are the same regardless of the starting level.

3.2.16

Youden square design

block design (3.2.9) derived from certain Latin square designs (3.2.11) by deleting or adding rows (or columns) so as to obtain a randomized block design (3.2.10) with respect to one blocking (3.1.26) factor (3.1.5) and a balanced incomplete block design (3.2.14) with respect to the other factor

Note 1 to entry: A Youden square can be considered as a design with two blocking factors associated with the rows and columns of a matrix whose entries represent the levels of the factor of primary interest. Assume, for example, that this layout has the same number of columns as levels, but fewer rows than columns. Each level would appear once in each row resulting in a randomized block design with respect to the row block factor. However, by focusing attention on the column block factor, a balanced incomplete block design would occur. The elimination of the 4th row of the 4 × 4 Latin square yields a 3 × 4 Youden square.

EXAMPLE 1:

The factor of primary interest has four levels (A, B, C and D) and there are two block factors—one with 4 levels (columns) and one with three levels (rows). One possible layout is as follows:

	Block Factor 1 (columns)
	1	A	D	C	B
Block Factor 2 (rows)	2	B	A	D	C
3	C	B	A	D
Row not included		D C B A

Block factor 2 (columns) A, B, C and D indicate the four levels of the factor of primary interest.

EXAMPLE 2:

The following array depicts a 4 × 7 Youden square:

3	4	5	6	7	1	2
5	6	7	1	2	3	4
6	7	1	2	3	4	5
7	1	2	3	4	5	6

For this example, it can be seen that the rows form a randomized block design and the columns form a BIB design with parameters l = b = 7, h = k = 4, and λ = 2.

3.2.17

split-plot design

designed experiment (3.1.27) in which the group of experimental units (3.1.24) (“plot”) to which the same factor level (3.1.12) assigned to the principal factor is subdivided (“split”) so as to study one or more additional principal factors within each level of that factor

EXAMPLE:

Three levels of factor A are tested in two sets of replicates. Within each level of A, the same two levels of factor B are studied, as shown in Figure 4.

Figure 4 — Split-plot design

Note 1 to entry: In the example, replicates (3.1.36) serve the role of blocks (3.1.25) to the first-stage factor of primary interest (A) and each plot assigned to one of the three levels of A serves the role of blocks for the additional second-stage factor of primary interest B (within plot factor) studied within A. Thus, the estimated residual error (3.1.6) for the within-plot factor B should be smaller than that for the full experiment (3.1.1) . In a split-plot design, different measures of residual error are obtained for the within-plot and the between-plot effects. It is possible to extend this design further in order to introduce a third-stage factor included in the levels of the second-stage factor. This type of design is frequently used where large series or areas are obtainable for a factor, the levels of which are not easily changed, and the other factors can be varied readily within the series or areas.

Note 2 to entry: This type of arrangement is common in industrial experimentation as well as in agriculture (whence the name is derived). Frequently, one series of factor levels requires a large experimental unit, while another series of factor levels can be compared with smaller experimental units. For instance, it would require larger amounts of alloy to compare different types of furnaces used to prepare an alloy than it would to compare different types of molds into which the alloy is poured. The types of furnaces are regarded as the levels of the first-stage factor and the types of molds as the levels of the second-stage (within-plot) factor.
Another example is a large machine the speed of which can be changed only by replacing the gear train. This is a time-consuming and expensive task and it would be preferable to make infrequent changes to this factor. The material manufactured at each speed can be heat-treated by several techniques, shaped under varying pressures and smoothed using different polishing agents with relative ease of these shifting from one level of factors to another. These latter constitute the within-plot factors (or second-stage factors) while the speed variations constitute the between plot factor (or first-stage factor).

3.2.18

two-way split-plot design

split-block design

split-plot design (3.2.17) in which the levels of the second stage factor (3.1.5) , instead of being randomized (3.1.30) independently within each plot, are arranged in strips across plots in each replication; thus, it is considered as a split-plot design in two different ways

EXAMPLE:

For a 3 × 4 design, the appropriate arrangements (after randomization) may be as shown in Figure 5.

Figure 5 — Two-way split-plot layout

Note 1 to entry: The two-way split-plot design results in lower precision for the main effects (average effects) of A and B but provides higher precision for the interactions (differential effects). These latter are generally more accurately determined than in either randomized blocks or the ordinary split-plot design. In industrial experimentation, practical considerations sometimes necessitate the use of two-way split-plot designs, for example, in the textile industry, factor A may be different procedures of bleaching by chlorine peroxide and factor B may be different amounts of hydrogen peroxide in the cooling process.

3.2.19

response surface design

designed experiment (3.1.27) that identifies a subset of factors (3.1.5) to be optimized

Note 1 to entry: Response surface designs characterize the relationship between the response variable and a set of predictor variables (3.1.4) by estimating the response surface itself. The “response surface” can be thought of as a conditional expectation of the response variables (assuming the predictor variables are fixed but can vary through the design space). One type of response surface design is an experimental design of factorial type intended to give approximately uniformly efficient estimates of the response surface over the range of the factors. Response surface designs include experimental designs that explore the curvature (3.1.21) of the response surface. These designs achieve this by using a quadratic regression equation rather than the linear form of the regression equation used in factorial designs (3.2.1) . There are various types of response surface designs including the central composite designs (CCD) and Box-Behnken designs ^[9] .

Note 2 to entry: An obvious benefit of response surface designs is the suggested adjustments to predictor variables (assumed to be continuous) that lead to “improved” responses.

Note 3 to entry: An early precursor that anticipated response surface methodology was Evolutionary Operation (EVOP). This process consists of systematic changes to the settings in striving for optimal performance ^[9] .

EXAMPLE 1:

An example of a central composite design is given ^[10] . It is a set of experimental treatments (3.1.13) consisting of cube points (3.1.37) , star points (3.1.38) and centre points (3.1.39) chosen so as to produce an efficient design [typically rotatable (3.1.40) ]. For three predictor variables (3.1.4) , the set shown in Table 7 constitutes a central composite design:

Table 7 — Central composite design

Experimental unit	x₁	x₂	x₃
1	−1	−1	−1
2	1	−1	−1
3	−1	1	−1
4	1	1	−1
5	−1	−1	1
6	1	−1	1
7	−1	1	1
8	1	1	1
9	0	0	0
10	0	0	0
11	2	0	0
12	−2	0	0
13	0	2	0
14	0	−2	0
15	0	0	2
16	0	0	−2

Experimental units 1 to 8 include the cube points of the design, equivalent to a 2³factorial design (3.2.5) . The levels of the predictor variables are given as coded values. Experimental units 9 and 10 are the centre points and 11 to 16 are the star points. The first eight experimental units constitute the units in a 2³ factorial design, which could be run separately. Subsequently, the remainder of the design could be undertaken and the results combined in one analysis. The actual order of treatments, overall or within a subgroup, should be randomized (3.1.30) . The central composite design facilitates this sequential assembly of design components. A fitted model to data obtained from the design can consist of linear (x₁, x₂, x₃), quadratic (x₁², x₂², x₃²), and two-way interactions (x₁x₂, x₁x₃, x₂x₃). The maximum or minimum of the estimated model can be sought via a numerical search, if the objective is an optimization. If the objective is to meet a target value, then the fitted function is evaluated to locate the set of predictor values that produce the specified target.

Note 4 to entry: A common variant of the central composite design using fewer factor levels (3.1.12) is the face-centered central composite design obtained by setting α = 1 for all of the star points. The fewer number of levels of the factors may sacrifice rotatability (depending on the number of factors).

EXAMPLE 2:

A Box-Behnken design is constructed by judicious combination of 2 ^k factorial designs with balanced incomplete block designs (3.2.14) . The set shown in Table 8 constitutes the three-variable Box-Behnken design:

Table 8 — Box-Behnken design

Experimental unit	x₁	x₂	x₃
1	0	−1	−1
2	0	1	−1
3	0	−1	1
4	0	1	1
5	−1	0	−1
6	1	0	−1
7	−1	0	1
8	1	0	1
9	−1	−1	0
10	1	−1	0
11	−1	1	0
12	1	1	0
13	0	0	0
14	0	0	0
15	0	0	0

EXAMPLE 3:

A pentagon design is a two-factor design in which the design points consist of five equally spaced locations on the unit circle (using units of the coded variables), and possibly replicated centre points. A set of five points satisfying the definition is: (1, 0), (0,309, 0,951), (−0,809, 0,588), (−0,809, −0,588), (0,309, −0,951). Note that cos(72°) = 0,309; sin(72°) = 0,951, etc.

EXAMPLE 4:

A hexagon design is a two-factor design in which the design points consist of six equally spaced locations on the unit circle (using units of the coded variables), and possibly replicated centre points. A set of six points satisfying the definition is: (1, 0), (0,5, 0,866), (−0,5,0,866), (−1, 0), (−0,5, −0,866), (0,5, −0,866). Note that cos(60°) = 0,5; sin(60°) = 0,866, etc.

Note 5 to entry: Any regular geometric figure inscribed in the unit circle can serve as the basis of a rotatable design within the class of response surface designs.

3.2.20

mixture design

design of experiments with mixtures

designed experiment (3.1.27) constructed to handle the situation in which one or more predictor variables (3.1.4) are constrained to sum to a fixed quantity

Note 1 to entry: Factors (3.1.5) representing proportions of metals in an alloy are a typical example of a mixture design. The design region (3.1.9) must satisfy X₁+ X₂+ ... + X_k= 1. Special purpose designs are also available if further restrictions apply, such as a minimal proportion for selected factors. A comprehensive treatment of mixture designs is given in Reference [8]. Mixture designs are particularly useful for situations involving the blending of materials.

3.2.21

nested design

designed experiment (3.1.27) with at least two factors (3.1.5) and with a hierarchical relationship among the factors

Note 1 to entry: Hierarchical relationships imply a type of rank order among the factors in the sense that each given factor level (3.1.12) occurs together with only a single level of another factor immediately above it in the rank order. These factors are referred to as nested factors. In other words, the factors in a nested design are hierarchically ordered in such a way that each level of a subordinate factor appears at most in a single level of each of its superordinate factors.

Note 2 to entry: This design can be used to evaluate the variance components of the factors involved. For the case of three factors, A, B, and C, every level of factor B appears with only a single level of factor A; similarly, every level of factor C appears with only a single level of factor B. The k-factor nested design ここで,k ≥ 2, is sometimes referred to as a k-stage nested design.

Note 3 to entry: Nested designs provide the capability of identifying the primary contributors to the overall variability in a process in which the contributing factors have subordinate relationships among themselves.

Note 4 to entry: Crossed designs (see Note 4 of 3.2.1) and nested designs are commonly considered in the context of measurement systems analysis applications. It is critical to recognize if factors are nested or crossed in estimating associated variance components.

Note 5 to entry: Generally, nested designs are used to evaluate results in terms of variance components (3.1.8) rather than in terms of differences in response levels or prediction models.

EXAMPLE 1:

Nested designs occur in animal husbandry situations. Consider a situation with two farms, each with respective herds of one or more bulls and two or more cows producing offspring fathered by one of the bulls. The layout in Figure 6 illustrates the structure:

Figure 6 — Nested design in animal breeding

EXAMPLE 2:

Consider a situation in which three different suppliers provide four batches of raw material to a company that will subsequently assay the batches to determine purity.

Figure 7 — Nested design in manufacturing

As depicted in Figure 7, the batches are nested within each supplier, since, for example, batch 1 from supplier 1 is distinct from batch 1 from supplier 2. Although the batch “label” is the same, the factors batch and supplier are not crossed. This example would still constitute a nested or hierarchical design in the event that the suppliers each provided a different number of batches. The setup shown in Figure 8 is also a nested or hierarchical design:

Figure 8 — Another nested design (unbalanced)

However, the analysis would be much more straightforward if the number of batches from each supplier were the same.

3.2.22

balanced nested design

fully nested design

nested design (3.2.21) in which the number of factor levels (3.1.12) of the nested factors (3.1.5) is constant

EXAMPLE:

Figure 9 depicts a balanced nested design:

Figure 9 — Balanced nested design

This design is a balanced nested design because every laboratory expends two days (the number of levels of factor B is two), and two measurement results are obtained on every day at each of the laboratories (the number of the levels of factor C is two). The days used by the laboratories are likely to be different because they presumably have been chosen randomly over a given testing window.

Note 1 to entry: It is sometimes possible to adjust the definition of a factor into levels that can be compared across the other factors so as to obtain more meaningful information. B₁ can be allocated to Monday and B₂ can be allocated to Friday. Therefore, results obtained on Mondays can be compared with those obtained on Fridays. All of the laboratories would thus have this in common, unlike the previous situation where they were two unrelated (across laboratories) day allocations. This configuration would now represent a crossed (i.e. each level of a factor is used with all levels of the other factors) rather than a nested classification and hence, can be viewed as a factorial experiment.

3.2.23

staggered nested design

nested design (3.2.21) in which the second nested factor (3.1.5) has two factor levels (3.1.12) in one level of the first nested factor but has only one level in the other level of the first nested factor

EXAMPLE:

Figure 10 depicts a staggered nested design:

Figure 10 — Staggered nested design

Note 1 to entry: For staggered nested designs, all factor effects (3.1.14) are estimated with approximately the same number of degrees of freedom.

3.2.24

optimal design

designed experiment (3.1.27) whose factor level (3.1.12) settings are determined to optimize a particular criterion, typically a function of the design matrix (3.2.25)

Note 1 to entry: In optimizing a particular criterion, note that the resulting optimal design is predicated upon having the correct model (3.1.2) . If the presumed model is incorrect, then the optimal design may be theoretically optimal (i.e. mathematically) but may not be useful for practical purposes. Nonetheless, several of the experimental designs mentioned earlier in this sub-clause can be considered as optimal designs. An optimal design strives to obtain the best possible parameter estimates corresponding to a specified criterion within a class of presumed models [e.g. quadratic function of the predictor variables (3.1.4) ].

Note 2 to entry: As noted in the Introduction to this part of ISO 3534, costs play an important role in experimental design. Minimization of cost while attaining a certain level of quality can be an objective in experimentation. However, for this clause, attention is restricted to criteria related to statistical estimation. An assumed model of the form mml_m31 is the starting point in searching for designs which assess the estimation of β .

3.2.25

design matrix

matrix with rows representing individual experimental treatments (3.1.13) (possibly transformed according to the assumed model) which can be extended by deduced levels of other functions of factor levels ( interactions (3.1.17) , quadratic terms, etc.) but are dependent upon the designed experiment (3.1.27)

Note 1 to entry: For a given experimental plan (3.1.29) , several design matrices can be envisaged depending upon the assumed model. Regardless of the assumed model, the assignment matrix is a subset of the columns representing the settings of the predictor variables (3.1.4) , necessary to conduct the experiment. See Note 2 of 3.1.27.

Note 2 to entry: The design or model matrix is commonly denoted X. Each row of X corresponds to a single treatment. The first column of X may consist of all 1's if an overall mean term, for example µ, is in the model. Other columns can represent factor levels (3.1.12) or functions of the predictor variables. For a 2³factorial design (3.2.5) , one possible design matrix (8 x 7) is given, as follows:

mml_m32

The columns in order correspond to I, A, B, C, ab, AC, and BC. The corresponding model is mml_m33

where y and ε are vectors of dimension 8×1 and β is a vector of dimension 7×1. The matrix X is dimension 8×7 with entries as just given. The entries of the transposed vector β can be represented as (µ,β₁,β₂,β₃,β₄,β₅,β₆). The intercept term µ corresponds to column I having all 1’s as entries. The term “ β₁” is the regression coefficient associated with factor A. The regression coefficient β₁ is equal to one half of the main effect associated with factor A. Analogous statements can be made to relate β₂ to factor B, to relate β₃ to factor C, and so forth to relate β₆ to the interaction factor BC.

3.2.26

D-optimal design

optimal design (3.2.24) that maximizes the determinant of X´XwhereX is the design matrix (3.2.25)

Note 1 to entry: The notation X´X indicates the multiplication of the transpose of X with X. For example, if X has dimension n×p, then X´X has dimension mml_m34 .

Note 2 to entry: The criterion for the D-optimal design relates to the volume of the confidence ellipsoid of the coefficients associated with the design matrix X. Thus, a D-optimal design is one that can produce high precision regression parameter estimates. Such designs can be obtained within many statistical software packages, thus relieving the burden of their construction. Such designs are “optimal” provided the assumed model is correct, which represents a potential criticism of the designs

EXAMPLE:

Many common experimental designs in use are D-optimal, although this property may not be immediately evident to the practitioner. For example, the Plackett-Burman screening designs (3.2.8) are D-optimal with respect to a main effects model.

3.2.27

A-optimal design

optimal design (3.2.24) that maximizes the trace of X´XwhereX is the design matrix (3.2.25)

Note 1 to entry: The A-optimal criterion is equivalent to minimizing the sum of the variances of the estimators for the different regression coefficients. Equivalently, the minimization is of the arithmetic mean of the sum of the variances of the estimators for the different regression coefficients.

3.2.28

G-optimal design

optimal design (3.2.24) that minimizes the maximum variance of prediction over the design region (3.1.11)

Note 1 to entry: The G-optimal criterion is to minimize the maximum variance of the regression function over the range of the design region.

Note 2 to entry: It can be shown that in certain mathematical contexts the D- and G-optimality criteria are equivalent, so that one can use the G-optimality criterion (which facilitates the optimization process) to obtain a D-optimal design ^[9] .

3.2.29

orthogonal design

designed experiment (3.1.27) with a design matrix (3.2.25) X such that X′X is a diagonal matrix

Note 1 to entry: In an orthogonal design, each pair of columns of the design matrix are orthogonal. Further for the specific model mml_m36 isσ² ( X′X )⁻¹ = σ²I .

3.2.30

saturated design

designed experiment (3.1.27) whose design matrix (3.2.25) has the same number of columns as the number of runs of the experiment

Note 1 to entry: It is impossible to estimate unambiguously more parameters in the model than experimental units in the design. Although each parameter could be estimated in this case, there would be no degrees of freedom remaining to estimate the variance of the residual error (3.1.6) , which further precludes hypothesis testing.

Note 2 to entry: Supersaturated designs (see 3.2.8, Note 4) provide a mechanism to investigate situations where the number of factors of interest exceeds the number of experimental units (3.1.24) in the designs (but not unambiguously as indicated in the previous note).

3.2.31

completely randomized design

designed experiment (3.1.27) with no restriction on randomization (3.1.30) in the assignment of experimental treatments (3.1.13) to experimental units (3.1.24)

Note 1 to entry: A completely randomized design may be particularly appropriate, if there were no prior knowledge of possible heterogeneity among the experimental units (i.e. blocking (3.1.26) would be an appropriate approach in such a circumstance).

Note 2 to entry: Completely randomized designs are used for studying the effects of one primary factor (3.1.5) without the need to take other factors (variables) into account as in a one-factor experiment (3.1.33) .

3.3 Methods of analysis

3.3.1

graphical method

pictorial depiction of results from an experiment (3.1.1)

Note 1 to entry: Simple plots can provide an initial, effective assessment as to the outcome of a designed experiment. Examples given in this section of the standard include main effects plots (3.3.2) , interaction plots (3.3.3) , quantile plots of effects (3.3.4) , and residual plot (3.3.5) .

3.3.2

main effects plot

plot giving the average responses at the various factor (3.1.5) levels of individual factors

EXAMPLE:

Table 9 presents data used in this example. The data is drawn from Box, Hunter and Hunter ^[2] .

Table 9 — Data associated with Figure 3

#	Catalyst	Temperature	Pressure	Concentration	Conversion %
1	−	−	−	−	71
2	+	−	−	−	61
3	−	+	−	−	90
4	+	+	−	−	82
5	−	−	+	−	68
6	+	−	+	−	61
7	−	+	−	−	87
8	+	+	+	−	80
9	−	−	−	+	61
10	+	−	−	+	50
11	−	+	−	+	89
12	+	+	−	+	83
13	−	−	+	+	59
14	+	−	+	+	51
15	−	+	+	+	85
16	+	+	+	+	78

The following figure gives such a plot for the example taken from 10.8 of Reference [2]. The response variable is the conversion percentage and the predictor variables (3.1.4) are the catalyst charge (A), the temperature (B), the pressure (C), and the concentration (D). Each predictor variable was given at two levels, denoted “−” for low, and “+” for high. A 2⁴full factorial experiment (3.2.5) was conducted. From Figure 11, it is apparent that temperature appears to have the most substantial effect on conversion, with the catalyst second and the remaining two factors fairly comparable. Additional analyses would be necessary to assess whether the slopes of the connected lines in the plot are significantly different from zero.

Figure 11 — Main effects plot

Note 1 to entry: A main effects plot gives the average value of the response variable (3.1.3) at the various levels of each factor. The nature and magnitude of the effect of each factor on the response is apparent. The presence of interactions (3.1.17) can hide the effects of various factors.

3.3.3

interaction plot

main effects plot (3.3.2) for a single factor (3.1.5) constructed for each level of another factor

Note 1 to entry: Interaction (3.1.17) plots provide a graphical detection tool for interpreting interactions. Lack of parallelism in the plot is an indication of interaction effects. See the plots in (3.1.17).

3.3.4

quantile plot of effects

plot of the standard normal quantiles versus the estimated factor effects (3.1.14) in a full factorial (3.2.1) or fractional factorial design (3.2.3)

EXAMPLE:

Table 10 provides the estimates of the effects corresponding to the example in 3.3.2.

Table 10 — Estimated factor effects

A	−8,00	ab	1,00	ABC	−0,75
B	24,00	AC	0,75	ABD	0,50
C	−2,25	AD	0,00	ACD	−0,25
D	−5,50	BC	−1,25	BCD	−0,75
		BD	4,50	ABCD	−0,25
		CD	−0,25

Figure 12 shows a plot of the estimated factor effects from the example in 3.3.2.

Figure 12 — Quantile plot of effects

Note 1 to entry: For experiments (3.1.1) without replication, this plot may suggest dominant effects (i.e. those points far to the left or far to the right of a “guide”-line through the main body of the plotted points). In Figure 11, the upper right-hand point with a main effect (3.1.15) equal to 24 corresponds to the temperature effect. In Figure 12, the upper right-hand point with an estimated factor effect (3.1.14) equal to 24 is seen from Table 10 to be the effect of B which is temperature, so it is twice the estimated main effect (3.1.15) of temperature.

3.3.5

residual plot

plot of the residuals (3.1.7) versus the corresponding values of the predictor variable (3.1.4) or versus the factor levels (3.1.12) of a particular factor (3.1.5)

EXAMPLE:

The example given in 3.3.2 is continued using the model with the four main effects (3.1.15) and the BD interaction (3.1.17) as the model shown in Figure 13.

Figure 13 — Residual plot

Note 1 to entry: Residual plots may assist in identifying outliers (extreme observations) relative to the overall model (3.1.2) fit or provide an indication of non-linearity Moreover, a possible non-constant variance (i.e. heteroscedasticity) can be revealed as well with a residual plot (e.g. an increasing dispersion of residuals for increasing predicted values.) Additional plots of the residuals versus other available variables not included in the model may indicate the need to refine the model.

Note 2 to entry: A quantile plot (3.3.4) of the residuals can be used to detect a deviation from the assumption of normality of the residual error (3.1.6) .

3.3.6

method of least squares

technique of parameter estimation which minimizes ∑e²ここで,e is the difference between the observed value and the predicted value derived from the assumed model (3.1.2) , and the sum is taken over all experimental treatments (3.1.13)

Note 1 to entry: Pure random error (3.1.9) associated with individual observations are ordinarily assumed to be independent, although inferential methods can be employed to include correlated errors. The usual analysis of variance (3.3.8) , regression analysis (3.3.7) and analysis of covariance (3.3.12) are all based on the method of least squares and provide different computational and interpretative advantages stemming from certain balances within the experimental arrangements which permit convenient groupings of the data.

Note 2 to entry: The method of least squares is mainly used for linear or linearized models, which are linear in the parameters.

3.3.7

regression analysis

procedures associated with assessing models (3.1.2) relating predictor variables (3.1.4) to response variables (3.1.3)

Note 1 to entry: Regression analysis is commonly associated with the process of estimating the parameters of an assumed model by optimizing the value of an objective function (for example, minimizing the sum of squared differences between the observed responses and those predicted by the model). The existence of statistical software packages has facilitated obtaining parameter estimates, their standard errors, and contains a wealth of model diagnostics. Regression analysis also facilitates consideration of other measures for the response. For example, if dispersion effects (3.1.16) are of interest in a replicated factorial design (3.2.1) , the response using logarithm of S_i² (where S_i² is the sample variance of replicated points) may be more easily analyzed and interpreted than the responses themselves.

Note 2 to entry: Regression analysis plays a role similar to the analysis of variance (3.3.8) and is particularly relevant when the levels of the factors (3.1.5) are continuous and emphasis is on an explicit predictive model. Regression analysis can also be used in designed experiments with missing data unlike the analysis of variance which requires balance between data. However, lack of balance increases the order-dependency (common elements are included in the first correlated term and not included in subsequent terms) of the hypothesis tests as well as losing other advantages of balanced experiments. For balanced experiments, the two techniques are simply variations of the method of least squares and produce comparable results.

EXAMPLE:

Consider an orthogonal design (3.2.29) having three quantitative factors in a 2³factorial design (3.2.5) , with only a single replicate (3.1.36) and the assumed model for the ith individual experimental unit (3.1.24) is

mml_m37

where

x_i0	is equal to 1;
x_i1	is the level of factor A;
x_i2	is the level of factor B;
x_i3	is the level of factor C;
ε_i	is the random error.

With this formulation, the associated design matrix (3.2.25) is [x_ij ] wherei labels the rows and j labels the columns. This model can also apply for three qualitative factors having coded levels of −1 and +1. In Table 11, the estimators of the regression coefficients are given in terms of the response variables Y_i and hence are random variables. Once the experiment (3.1.1) has been conducted, the realizations of the response variables y_i replace the random variables in the formulae and some use b_i to replace mml_m38

in the appropriate expression.

Table 11 — Regression analysis table for the example

Source of variation	Estimators of regression coefficients	Degrees of freedom (DF)	Mean square (MS)
Total	−	8	−
Constant (X₀)		1	Sx₀
Regression for X₁(A)		1	Sx₁
Regression for X₂(B)		1	Sx₂
Regression for X₃(C)		1	Sx₃
Residual	−	4	S_E/4

The expressions given in Table 11 are perhaps simpler if the matrix notation employing the design matrix is used. In general, the estimates of the β_i ′s are obtained from mml_m49

.
The residual vector is given by mml_m51

Note 3 to entry: If the 2³ experiments were replicated within the same block, the degrees of freedom for the “total” (line 1) would become 16 and for the “residual” would become 12. The “residual” sum of squares may then be partitioned into two elements associated with “replicates” and “lack of fit”, with 8 and 4 degrees of freedom respectively.

Table 12 — Regression analysis table for Example 1 — Addenda for replicated experiment

Source of variation	Sum of squares (SS)	Degrees of freedom (DF)	Mean square (MS)
Residual	S_E	12	S_E/12
Replicates		8	S_R/8
Lack of fit		4	S_L/4

The general matrix equivalents of the above expressions are given following Table 11.

Note 4 to entry: The statistical significance of each source can be tested using the F-statistic for the mean square of that source and the appropriate residual error (3.1.6) under suitable normality assumptions. For the single replicate situation, the “regression” terms would be tested against the “residual” term. For the two replicates situation, the “lack of fit” term would be tested against the “replicates” [“ pure random error (3.1.9) ”] term to determine whether the model is inadequate. The “replicates” term represents a measure of experimental error free of the potential contribution of model inadequacy [i.e. misspecification error (3.1.10) ] which would be included in the “residual” term.

3.3.8

analysis of variance

ANOVA

technique which subdivides the total variation of a response variable (3.1.3) into components associated with defined sources of variation.

Note 1 to entry: ANOVA facilitates the estimation of variance components (3.1.8) and the testing of hypotheses on the parameters of a model (3.1.2) .
An analysis of variance table usually contains columns for

source of variation;
sum of squares (SS);
degrees of freedom (DF);
mean square (MS) (sum of squares divided by degrees of freedom);
F (ratios of mean squares for the row to the mean square associated with error);
expected mean squares (mathematical expectation of the sum of squares given in terms of the parameters of the model).

The rows of the table represent specific factor effects (3.1.14) or interactions (3.1.17) , blocks (3.1.25) [if blocking (3.1.26) were employed in the experimental design (3.1.28) ], or residual error (3.1.6) (the remaining effects not accounted for by the model or the blocks). A row designated “Total” is usually given which provides the total sum of squares about the overall average and based on the degrees of freedom which is one less than the total number of observations.

EXAMPLE:

Consider a randomized block design (3.2.10) , in which the observation obtained from the ith of l levels of a factor A in the jth of h blocks is denoted by Y_ij = (i = 1, 2, ..., l; j = 1, 2, ..., h). The factor of primary interest A represents a fixed factor effect (3.1.14) ; factor B represents a factor effect associated with blocking (3.1.26) . The following ANOVA table (Table 13) is computed:

Table 13 — Analysis of variance table

Source	Mean square (MS)	F	Expected mean square
Total	−	−	−
Factor A (Treatment)
Factor B (Block)
Error		−	σ²

In the ANOVA table:

mml_m69

mml_m70

F(ν₁,ν₂) is the F-statistic.
One model associated with the observations is given as
mml_m71

; i = 1, 2,…,l; j = 1, 2,…,h
with

mml_m72

mml_m73

where

µ	is the general mean;
α_i	is the effect of the ith treatment;
β_j	is the effect of the jth block;
e_ij	is the residual error.

For this example, it is assumed that fixed levels are designated. The least squares estimates of µ, α_i , β_j and σ² are obtained by

mml_m74

mml_m75

mml_m76

mml_m77

The formulae shown in this example are simplified because in a randomized block design each cell should contain an equal number of observations. The general matrix equivalents of the above expressions are given following Table 12 in (3.3.7).

Note 2 to entry: The basic assumptions of ANOVA are that the effects due to all the sources of variation are additive and that the pure random errors (3.1.9) are independently and normally distributed, with a zero mean and have equal variances (homoscedasticity). Hence, it is assumed that the expectation of the response variable is additive in the parameters, which is the case in the examples given here. The technique, in conjunction with the F-ratio, is used to provide a test of significance for the effects of these sources of variation and/or to obtain an estimate of the variances attributable to these sources. The assumption of a normal distribution is required only for this test of significance and confidence intervals. Averages and interactions are usually examined by summarizing them in 2-way (or k-way) tables. This example assumes a model 1 or fixed effects model (3.3.9) . When the assumption of normal distribution of errors cannot be made, it is sometimes possible to use transformations (for example, logarithms) of the response variable or to apply a nonparametric procedure.

3.3.9

fixed effects analysis of variance

analysis of variance (3.3.8) in which the factor levels (3.1.12) of each factor (3.1.5) are preselected over the range of values of the factors

Note 1 to entry: With fixed levels, it is inappropriate to compute variance components (3.1.8) . This model (3.1.2) is sometimes referred to as a model 1 analysis of variance.

3.3.10

random effects analysis of variance

analysis of variance (3.3.8) in which each factor level (3.1.12) of each factor effect (3.1.14) is assumed to be sampled from the distribution of levels of each factor

Note 1 to entry: With random levels, the primary interest is usually to obtain estimates of variance components (3.1.8) . This model is commonly referred to as a model 2 analysis of variance. It is inappropriate to compute estimates of the effects of the selected factor levels.

EXAMPLE:

Consider a situation in which an operation processes batches of raw material. “Batch” may be considered a random factor in an experiment when a few batches are randomly selected from the population of all batches.

3.3.11

mixed model analysis of variance

analysis of variance (3.3.8) in which the levels of some factors (3.1.5) are fixed and the levels of other factor effects (3.1.14) are sampled from the distribution of levels of the factors

Note 1 to entry: Components of variance are meaningful only for the random level factors and their interactions with fixed effect factors. Moreover, estimates of effects apply only for fixed factors. This model is also referred to as a model 3 analysis of variance.

3.3.12

analysis of covariance

ANCOVA

technique for estimating and testing the effects of experimental treatments (3.1.13) when one or more observable but uncontrollable factors (3.1.5) influence the response variable (3.1.3)

Note 1 to entry: The analysis of covariance can be viewed as a combination of regression analysis (3.3.7) and the analysis of variance (3.3.8) .

Note 2 to entry: Usually the uncontrollable factors cannot be accounted for in the experimental design (3.1.28) and their effects on the results should be accounted for in the analysis. For example, the experimental units (3.1.24) may differ in the amount of some chemical constituent present in each unit, which can be measured, but not readily adjusted.

Note 3 to entry: This is an extension of the analysis of variance technique to cover the cases where observations are taken on more than one variable from each experimental unit.

Bibliography

1	ISO 10241-1:2011, International terminology standards — Part 1: Preparation and layout
2	Box, G.E.P., Hunter, W. G. and Hunter, J. S. Statistics for Experimenters. An Introduction to Design, Data Analysis, and Model Building. John Wiley & Sons, New York, 1978
3	Box, G.E.P., Hunter, W. G. and Hunter, J. S. Statistics for Experimenters. Design, Innovation, and Discovery. John Wiley & Sons, New York, 2005
4	Plackett, R. L. and Burman, J. P. The design of optimum multifactorial experiments, Biometrika, 33 , 1946, pp. 305-325
5	Lin, D.K.J. A new class of supersaturated designs. Technometrics, 35 , 1993, pp. 28-31
6	Wu, C.F.J. Construction of supersaturated designs through partially aliased interactions. Biometrika, 80 (3), 1993, pp. 661-669
7	Fisher, R.A. and Yates, F. Statistical Tables for Biological, Agricultural, and Medical Research. Oliver and Boyd, Edinburgh, 4th edition, 1953
8	Cornell, J. A. Experiments With Mixtures. 2^nd ed. John Wiley, New York, 1990
9	Box, G.E.P. and Draper, N.R. Empirical Model — Building and Response Surfaces. John Wiley & Sons, New York, 1987
10	Box, G.E.P. and Wilson, K.B. On the experimental attainment of optimum conditions (with discussion). Journal of the Royal Statistical Society, Series B, 13, 1951, pp. 1-45

ISO 3534-3:2013 統計—用語, 語彙と記号—パート3：実験計画 | ページ 6

3 用語と定義

3.1 一般用語

3.2 実験の準備

3.3 分析方法

参考文献

3 Terms and definitions

3.1 General terms

3.2 Arrangements of experiments

3.3 Methods of analysis

Bibliography

ISO PDF プレビュー