old-cross-binutils/gdb/doc/LRS

196 lines
6.9 KiB
Text
Raw Normal View History

What's LRS?
===========
LRS, or Live Range Splitting is an optimization technique which allows a user
variable to reside in different locations during different parts of a function.
For example, a variable might reside in the stack for part of a function and
in a register during a loop and in a different register during another loop.
Clearly, if a variable may reside in different locations, then the compiler
must describe to the debugger where the variable resides for any given part
of the function.
This document describes the debug format for encoding these extensions in
stabs.
Since these extensions are gcc specific, these additional symbols and stabs
can be disabled by the gcc command option -gstabs.
GNU extensions for LRS under stabs:
===================================
range symbols:
-------------
A range symbol will be used to mark the beginning or end of a live range
(the range which describes where a symbol is active, or live).
These symbols will later be referenced in the stabs for debug purposes.
For simplicity, we'll use the terms "range_start" and "range_end" to
identify the range symbols which mark the beginning and end of a live
range respectively.
Any text symbol which would normally appear in the symbol table (eg. a
function name) can be used as range symbol. If an address is needed to
delimit a live range and does not match any of the values of symbols
which would normally appear in the symbol table, a new symbol will be
added to the table whose value is that address.
The three new symbol types described below have been added for this
purpose.
For efficiency, the compiler should use existing symbols as range symbols
whenever possible; this reduces the number of additional symbols which
need to be added to the symbol table.
New debug symbol type for defining ranges:
------------------------------------------
range_off - contains PC function offset for start/end of a live range.
Its location is relative to the function start and therefore
eliminates the need for additional relocation.
This symbol has a values in the text section, and does not have a name.
NOTE: the following may not be needed but are included here just
in case.
range - contains PC value of beginning or end of a live range
(relocs required).
NOTE: the following will be required if we desire LRS debugging
to work with old style a.out stabs.
range_abs - contains absolute PC value of start/end of a live
range. The range_abs debug symbol is provided for
completeness, in case there is a need to describe addresses
in ROM, etc.
Live range:
-----------
The compiler and debugger view a variable with multiple homes as a primary
symbol and aliases for that symbol. The primary symbol describes the default
home of the variable while aliases describe alternate homes for the variable.
A live range defines the interval of instructions beginning with
range_start and ending at range_end-1, and is used to specify a range of
instructions where an alias is active or "live". So, the actual end of
the range will be one less than the value of the range_end symbol.
Ranges do not have to be nested. Eg. Two ranges may intersect while
each range contains subranges which are not in the other range.
There does not have to be a 1-1 mapping from range_start to
range_end symbols. Eg. Two range_starts can share the same
range_end, while one symbol's range_start can be another symbol's
range_end.
When a variable's storage class changes (eg. from stack to register, or
from one register to another), a new symbol entry will be added to
the symbol table with stabs describing the new type, and appropriate
live ranges refering to the variable's initial symbol index.
For variables which are defined in the source but optimized away, a symbol
should be emitted with the live range l(0,0).
Live ranges for aliases of a particular variable should always be disjoint.
Overlapping ranges for aliases of the same variable will be treated as
an error by the debugger, and the overlapping range will be ignored.
If no live range information is given, the live range will be assumed to
span the symbol's entire lexical scope.
New stabs string identifiers:
-----------------------------
"id" in "#id" in the following section refers to a numeric value.
New stab syntax for live range: l(<ref_from>,<ref_to>)
<ref_from> - "#id" where #id identifies the text symbol (range symbol) to
use as the start of live range (range_start). The value for
the referenced text symbol is the starting address of the
live range.
<ref_to> - "#id" where #id identifies the text symbol (range symbol) to
use as the end of live range (range_end). The value for
the referenced text symbol is ONE BYTE PAST the ending
address of the live range.
New stab syntax for identifying symbols.
<def> - "#id="
Uses:
<def><name>:<typedef1>...
When used in front of a symbol name, "#id=" defines a
unique reference number for this symbol. The reference
number can be used later when defining aliases for this
symbol.
<def>
When used as the entire stab string, "#id=" identifies this
nameless symbol as being the symbol for which "#id" refers to.
<ref> - "#id" where "#id" refers to the symbol for which the string
"#id=" identifies.
Uses:
<ref>:<typedef2>;<liverange>;<liverange>...
Defines an alias for the symbol identified by the reference
number ID.
l(<ref1>,<ref2>)
When used within a live range, "#id" refers to the text
symbol identified by "#id=" to use as the range symbol.
<liverange> - "l(<ref_from>,<ref_to>)" - specifies a live range for a
symbol. Multiple "l" specifiers can be combined to represent
mutiple live ranges, separated by semicolons.
Example:
========
Consider a program of the form:
void foo(){
int a = ...;
...
while (b--)
c += a;
..
d = a;
..
}
Assume that "a" lives in the stack at offset -8, except for inside the loop where
"a" resides in register "r5".
The way to describe this is to create a stab for the variable "a" which describes
"a" as living in the stack and an alias for the variable "a" which describes it
as living in register "r5" in the loop.
Let's assume that "#1" and "#2" are symbols which bound the area where "a" lives
in a register.
The stabs to describe "a" and its alias would look like this:
.stabs "#3=a:1",128,0,8,-8
.stabs "#3:r1;l(#1,#2)",64,0,0,5
This design implies that the debugger will keep a chain of aliases for any
given variable with aliases and that chain will be searched first to find
out if an alias is active. If no alias is active, then the debugger will
assume that the main variable is active.