-prof-gen-sampling compiler option
using with profrun
using with SSP
-prof-gen compiler option
using with SSP
-prof-genx compiler option
code-coverage tool
test-priorization tool
-prof-use compiler option
code-coverage tool
profmerge utility
using with profrun
using with SSP
-ssp compiler option
using with SSP
.dpi file
Code-coverage Tool
Profmerge and Proforder Utilities
Test-prioritization Tool
.dyn file
Code-coverage Tool
Profmerge and Proforder Utilities
Test-prioritization Tool
.hpi file
.spi file
Code-coverage Tool
Generating Function Order Lists
Test-prioritization Tool
.tb5 file
/Qprof-gen-sampling compiler option
using with profrun
using with SSP
/Qprof-gen compiler option
using with SSP
/Qprof-genx compiler option
code-coverage tool
test-priorization tool
/Qprof-use compiler option
code-coverage tool
profmerge utility
using with profrun
using with SSP
/Qssp compiler option
using with SSP
accessing arrays efficiently
accuracy
controlling
advanced PGO options
Advanced PGO Options
Example of Profile-Guided Optimization
aliases
Stacks: Automatic Allocation and Checking
Vectorization Examples
aligning data
Aligning Data
Setting Data Type and Alignment
alignment
Alignment Options
Efficient Compilation
Multifile IPO Overview
Setting Data Type and Alignment
Understanding Data Alignment
Vectorization Examples
example
options
strategy
Alignment Options
Vectorization Examples
ALLOCATABLE
Stacks: Automatic Allocation and Checking
Using Arrays Efficiently
allocating temporary arrays
analyzing
code for OpenMP*
data dependency
data flow
effects of multifile IPO
performance
programming
analyzing applications
Performance Enhancement Strategies
Using a Performance Methodology
Using Intel Performance Analysis Tools
IntelŪ Debugger
IntelŪ Threading Tools
VTune(TM) Analyzer
analyzing hotspots
application
basic block
code coverage
OpenMP*
pipelining
tests
visual presentation
application characteristics
application performance
application tests
architectures
coding guidelines for
argument aliasing
arithmetic precision
improving
restricting
arrays
alignment in vectorization
efficient compilation of
loop blocking
operations in a loop body
rules for improving I/O performance
using efficiently
assumed-shape arrays
ATOMIC
auto-parallelization
Auto-parallelization Overview
Parallelism Overview
diagnostic
enabling
environment variables
guidelines
overview
programming with
threshold
auto-parallelized loops
auto-parallelizer
Auto-parallelization Overview
Parallelism: an Overview
controls
Auto-parallelization Threshold Control and Diagnostics
Parallelism Overview
enabling
auto-vectorization
Coding Guidelines for IntelŪ Architectures
Parallelism Overview
auto-vectorizer
automatic
allocation of stacks
Optimizing the Compilation Process Overview
Stacks: Automatic Allocation and Checking
checking of stacks
optimization for IA-32 systems
avoid
EQUIVALENCE statements
inefficient data types
mixed arithmetic expressions
slow arithmetic operators
small integer data items
unnecessary operations in DO loops
BARRIER
basic PGO options
big-endian data
conversion of little-endian data to
browsing frames using the coverage tool
buffers
UBC system
cache size intrinsic
changing number of threads
checking
floating-point stacks
stacks
chunk size
specifying
clauses
in parallel region directives
in worksharing construct directives
summary table of
cleanup of loops
code-coverage tool
coloring scheme for
dynamic counters in
export data
options
options in
syntax of
visual presentation of
code design considerations
preparing for OpenMP* programming
coding
for efficiency in Intel Fortran
guidelines for Intel Architectures
combined parallel and worksharing constructs
compilation
efficient
optimizing
phase
compiler
intermediate language files produced by
compiler reports
High-Level Optimization (HLO)
Interprocedural Optimizations (IPO)
report generation
software pipelining
vectorization
computing denormals
conditional parallel region execution
controlling
auto-parallelizer diagnostics
Auto-parallelization Threshold Control and Diagnostics
Parallelism Overview
data scope attributes
inline expansion
rounding
correct usage of countable loop
Loop Constructs
Loop Exit Conditions
countable loop
COS
countable loop
correct usage of
Loop Constructs
Loop Exit Conditions
counters for dynamic profile
CPU time
create libraries using IPO
creating
DPI list
multifile IPO executable
Creating a Multifile IPO Executable
Creating a Multifile IPO Executable Using a Project Makefile
multithreaded applications
criteria
for inline function expansion
cross-iteration dependencies
data alignment
data format
alignment
Alignment Options
Vectorization Examples
dependence
Auto-parallelization: Threshold Control and Diagnostics
Loop Transformations
Pipelining for ItaniumŪ-based Applications
Scalar Replacement
options
partitioning
prefetching
HLO Overview
Loop Count and Loop Distribution
scope attribute clauses
sharing
structure
type
Improving Run-time Efficiency
Parallelism Overview
Vectorization Overview
data prefetches
data scope attribute clauses
data types
dataflow analysis
Auto-parallelization Overview
Parallelism Overview
DEFAULT
for record buffers
level optimization
listing
name
value
deferred-shape arrays
denormal exceptions
denormal numbers
flush-to-zero
denormals
denormals-are-zero
Coding Guidelines for IntelŪ Architectures
Processor-specific Run-time Checks for IA-32 Systems
dependence of data
dequeuing
derived-type components
determining parallelization
device-specific blocksize
diagnostic reports
Auto-parallelization Threshold Control and Diagnostics
Compiling with OpenMP*, Directive Format, and Diagnostics
diagnostics
Parallelism: an Overview
Vectorization Key Programming Guidelines
auto-parallelizer
Auto-parallelization Threshold Control and Diagnostics
Parallelism Overview
OpenMP*
difference operators
differential coverage
directives
commons
dcommons
records
sequence
structure
directives for OpenMP*
Combined Parallel and Worksharing Constructs
Worksharing Construct Directives
ATOMIC
BARRIER
CRITICAL
DO
END DO
END PARALLEL
END PARALLEL DO
END PARALLEL SECTIONS
END SECTIONS
END SINGLE
FLUSH
MASTER
ORDERED
PARALLEL
PARALLEL DO
PARALLEL SECTIONS
PARALLEL WORKSHARE
SECTION
SECTIONS
SINGLE
WORKSHARE
directory
specifying for dynamic information files
disabling
function splitting
inlining
disk I/O
efficient use of
DISTRIBUTE POINT
division-to-multiplication optimization
DO constructs
order of
double-precision
numbers
Coding Guidelines for Intel Architectures
Improving or Restricting FP Arithmetic Precision
Improving Run-time Efficiency
OpenMP* Run-time Library Routines
dummy arguments
Improving I/O Performance
Using Arrays Efficiently
dumping profile information
Dumping Profile Information
Interval Profile Dumping
dyn files
Advanced PGO Options
Basic PGO Options
Dumping and Resetting Profile Information
Dumping Profile Information
Example of Profile-Guided Optimization
PGO Environment Variables
dynamic-information files
dynamic counters
dynamic information
PGO Environment Variables
Profile-guided Optimizations Overview
directory for files
dumping profile information
files
resetting profile counters
threads