-prof-gen-sampling compiler option
    using with profrun
    using with SSP
-prof-gen compiler option
    using with SSP
-prof-genx compiler option
    code-coverage tool
    test-priorization tool
-prof-use compiler option
    code-coverage tool
    profmerge utility
    using with profrun
    using with SSP
-ssp compiler option
    using with SSP
.dpi file
   Code-coverage Tool
   Profmerge and Proforder Utilities
   Test-prioritization Tool
.dyn file
   Code-coverage Tool
   Profmerge and Proforder Utilities
   Test-prioritization Tool
.hpi file
.spi file
   Code-coverage Tool
   Generating Function Order Lists
   Test-prioritization Tool
.tb5 file
/Qprof-gen-sampling compiler option
    using with profrun
    using with SSP
/Qprof-gen compiler option
    using with SSP
/Qprof-genx compiler option
    code-coverage tool
    test-priorization tool
/Qprof-use compiler option
    code-coverage tool
    profmerge utility
    using with profrun
    using with SSP
/Qssp compiler option
    using with SSP



accessing arrays efficiently
accuracy
    controlling
advanced PGO options
   Advanced PGO Options
   Example of Profile-Guided Optimization
aliases
   Stacks: Automatic Allocation and Checking
   Vectorization Examples
aligning data
   Aligning Data
   Setting Data Type and Alignment
alignment
   Alignment Options
   Efficient Compilation
   Multifile IPO Overview
   Setting Data Type and Alignment
   Understanding Data Alignment
   Vectorization Examples
    example
    options
    strategy
       Alignment Options
       Vectorization Examples
ALLOCATABLE
   Stacks: Automatic Allocation and Checking
   Using Arrays Efficiently
allocating temporary arrays
analyzing
    code for OpenMP*
    data dependency
    data flow
    effects of multifile IPO
    performance
    programming
analyzing applications
   Performance Enhancement Strategies
   Using a Performance Methodology
   Using Intel Performance Analysis Tools
    IntelŪ Debugger
    IntelŪ Threading Tools
    VTune(TM) Analyzer
analyzing hotspots
application
    basic block
    code coverage
    OpenMP*
    pipelining
    tests
    visual presentation
application characteristics
application performance
application tests
architectures
    coding guidelines for
argument aliasing
arithmetic precision
    improving
    restricting
arrays
    alignment in vectorization
    efficient compilation of
    loop blocking
    operations in a loop body
    rules for improving I/O performance
    using efficiently
assumed-shape arrays
ATOMIC
auto-parallelization
   Auto-parallelization Overview
   Parallelism Overview
    diagnostic
    enabling
    environment variables
    guidelines
    overview
    programming with
    threshold
auto-parallelized loops
auto-parallelizer
   Auto-parallelization Overview
   Parallelism: an Overview
    controls
       Auto-parallelization Threshold Control and Diagnostics
       Parallelism Overview
    enabling
auto-vectorization
   Coding Guidelines for IntelŪ Architectures
   Parallelism Overview
auto-vectorizer
automatic
    allocation of stacks
       Optimizing the Compilation Process Overview
       Stacks: Automatic Allocation and Checking
    checking of stacks
    optimization for IA-32 systems
avoid
    EQUIVALENCE statements
    inefficient data types
    mixed arithmetic expressions
    slow arithmetic operators
    small integer data items
    unnecessary operations in DO loops



BARRIER
basic PGO options
big-endian data
    conversion of little-endian data to
browsing frames using the coverage tool
buffers
    UBC system



cache size intrinsic
changing number of threads
checking
    floating-point stacks
    stacks
chunk size
    specifying
clauses
    in parallel region directives
    in worksharing construct directives
    summary table of
cleanup of loops
code-coverage tool
    coloring scheme for
    dynamic counters in
    export data
    options
    options in
    syntax of
    visual presentation of
code design considerations
    preparing for OpenMP* programming
coding
    for efficiency in Intel Fortran
    guidelines for Intel Architectures
combined parallel and worksharing constructs
compilation
    efficient
    optimizing
    phase
compiler
    intermediate language files produced by
compiler reports
    High-Level Optimization (HLO)
    Interprocedural Optimizations (IPO)
    report generation
    software pipelining
    vectorization
computing denormals
conditional parallel region execution
controlling
    auto-parallelizer diagnostics
       Auto-parallelization Threshold Control and Diagnostics
       Parallelism Overview
    data scope attributes
    inline expansion
    rounding
correct usage of countable loop
   Loop Constructs
   Loop Exit Conditions
    countable loop
COS
countable loop
    correct usage of
       Loop Constructs
       Loop Exit Conditions
counters for dynamic profile
CPU time
create libraries using IPO
creating
    DPI list
    multifile IPO executable
       Creating a Multifile IPO Executable
       Creating a Multifile IPO Executable Using a Project Makefile
    multithreaded applications
criteria
    for inline function expansion
cross-iteration dependencies



data alignment
data format
    alignment
       Alignment Options
       Vectorization Examples
    dependence
       Auto-parallelization: Threshold Control and Diagnostics
       Loop Transformations
       Pipelining for ItaniumŪ-based Applications
       Scalar Replacement
    options
    partitioning
    prefetching
       HLO Overview
       Loop Count and Loop Distribution
    scope attribute clauses
    sharing
    structure
    type
       Improving Run-time Efficiency
       Parallelism Overview
       Vectorization Overview
data prefetches
data scope attribute clauses
data types
dataflow analysis
   Auto-parallelization Overview
   Parallelism Overview
DEFAULT
    for record buffers
    level optimization
    listing
    name
    value
deferred-shape arrays
denormal exceptions
denormal numbers
    flush-to-zero
denormals
denormals-are-zero
   Coding Guidelines for IntelŪ Architectures
   Processor-specific Run-time Checks for IA-32 Systems
dependence of data
dequeuing
derived-type components
determining parallelization
device-specific blocksize
diagnostic reports
   Auto-parallelization Threshold Control and Diagnostics
   Compiling with OpenMP*, Directive Format, and Diagnostics
diagnostics
   Parallelism: an Overview
   Vectorization Key Programming Guidelines
    auto-parallelizer
       Auto-parallelization Threshold Control and Diagnostics
       Parallelism Overview
    OpenMP*
difference operators
differential coverage
directives
    commons
    dcommons
    records
    sequence
    structure
directives for OpenMP*
   Combined Parallel and Worksharing Constructs
   Worksharing Construct Directives
    ATOMIC
    BARRIER
    CRITICAL
    DO
    END DO
    END PARALLEL
    END PARALLEL DO
    END PARALLEL SECTIONS
    END SECTIONS
    END SINGLE
    FLUSH
    MASTER
    ORDERED
    PARALLEL
    PARALLEL DO
    PARALLEL SECTIONS
    PARALLEL WORKSHARE
    SECTION
    SECTIONS
    SINGLE
    WORKSHARE
directory
    specifying for dynamic information files
disabling
    function splitting
    inlining
disk I/O
    efficient use of
DISTRIBUTE POINT
division-to-multiplication optimization
DO constructs
    order of
double-precision
    numbers
       Coding Guidelines for Intel Architectures
       Improving or Restricting FP Arithmetic Precision
       Improving Run-time Efficiency
       OpenMP* Run-time Library Routines
dummy arguments
   Improving I/O Performance
   Using Arrays Efficiently
dumping profile information
   Dumping Profile Information
   Interval Profile Dumping
dyn files
   Advanced PGO Options
   Basic PGO Options
   Dumping and Resetting Profile Information
   Dumping Profile Information
   Example of Profile-Guided Optimization
   PGO Environment Variables
dynamic-information files
dynamic counters
dynamic information
   PGO Environment Variables
   Profile-guided Optimizations Overview
    directory for files
    dumping profile information
    files
    resetting profile counters
    threads


>>