1 files changed, 170 insertions, 0 deletions
diff --git a/order-files/HOW TO BUILD b/order-files/HOW TO BUILD
new file mode 100644
index 00000000000..7cca046f37a
--- /dev/null
+++ b/order-files/HOW TO BUILD
@@ -0,0 +1,170 @@
+       Creating Order files, i.e., Scatter Loading the Compiler
+       --------------------------------------------------------
+
+This is a brief description on how to generate the cc1*.order files.
+The order files are intended to minimize the number of page-ins of
+the compiler as it is loaded.  If there is enough memory this
+benifits only the first load of the compiler since it will stay
+resident after that.
+
+Unfortunately it's a manual process since one of the tools requires
+an explicit interrupt from the terminal.  You should only need to
+(re)do the order files if there's any major reorganizations or
+additions to the compilers.
+
+There are five steps involved to genrrate the order files.
+
+1. Select test cases.
+
+   These should be "average" compilations to exercise each of the
+   cc1* compilers.  They should be large enough to take enough time
+   to generate acceptible results.  As of this writing the
+   following cases were chosen:
+   
+   For cc1        - gcc/c-decl.c (from the compiler sources)
+   For cc1plus    - Finder_FE/AboutWindow/AboutWindow.cp
+   For cc1obj     - MailViewer/Compose.subproj/MessageEditor.m
+   For cc1objplus - devkit/cpp.subproj/Cpp-precomp.mm
+
+   Of these four the cc1objplus test case is not very good.  
+   Unfortunately there are few .mm files of any significant size.
+   If a better one can be found it probably should be used.
+   
+2. Capture the command lines needed to build the chosen files.
+
+   For the selected projects built with with PB set PB's building
+   preferences for detailed build logs.  That way you can the
+   full command lines you need.  In non-PB projects like gcc the
+   command lines are of course echoed on the terminal. 
+
+3. Run selected command lines with -### to get the cc1* lines.
+
+   From the full command lines you need the cc1* lines generated
+   by the driver.  The easiest way to get these is to add -###
+   to the full command lines captured in step 2.
+   
+4. Prepare to generate the order files
+
+   If you don't already have it you should build a set of cc1*
+   compilers with -O2 with symbols.  The easiest way to do this
+   is FSF-style but using buildit with build_gcc probably will
+   also work.
+   
+   In the gcc objects directory you will of course have the cc1*
+   compilers.  You need to substitute these in each of the cc1*
+   command lines captured in step 3.  You also need to run these
+   with ~perf/bin/pcsample to generate the order files.  Thus,
+   for each cc1* command line it should have the following in
+   the beginning in place of the original cc1* of the step 3
+   lines.
+   
+   sudo ~perf/bin/pcsample -O -E $gcc3-obj/cc1* ... rest of line...
+   
+   Where $gcc3-obj represents whatever the path is to the gcc3
+   objects directory and cc1* is one of the cc1* compilers (is
+   -B necessary here?).
+   
+   Note, you need sudo because pcsample will only run as root.
+   Also, if you have a dual processor you need to reboot as in
+   single  processor mode.  If you don't pcscample will tell you
+   to do that by executing the command,
+   
+   nvram boot-args="cpus=1"
+   
+5. Generate the order files   
+   
+   Run the lines created in step 4.  The order files (cc1*.order)
+   will be left in /tmp in the direectory indicated by the summary
+   that pcsample displays when you hit cmd-c to stop the pcsample
+   execution.  Be sure to run pcsample long enough to compile the
+   entire program.
+   
+   At this point you now have the order files created.  You place
+   them in the order-files directory at the top level of the gcc
+   source directory.  
+   
+   You can also use them to measure he effects of these order files
+   on compiler page-ins.  If you do this go to the next step (6).
+   Otherwisw you are ready to go.
+   
+6. Creating the compilers with ther order files
+
+   You will need two versions of the cc1* files; the ones from above
+   and a set linked with their respective order file.
+   
+   From a gcc compiler build extract the command lines that link
+   the cc1* files.  Change the -o file to something else, for
+   example, cc1 to cc1X.  Then add the following options to the
+   link line.  Note, if you build using buildit and build_gcc the
+   lines will already be there referencing the order-files
+   directory.  Otherwise you need to add,
+   
+   -sectorder __TEXT __text $order/cc1*.order -e start
+   
+   where $order is the directory containing the order file being
+   used and cc1* is of course a reference to a specific order
+   file.
+   
+ 7. Measuring the performance improvement
+ 
+    You need to have two terminbal windows open; T1 and T2.  The
+    execute the follwoing commands on the indicated terminals:
+ 
+    T2: sudo fs_usage -w > /tmp/fs.out1  (do NOT execute yet)
+    T1: ~perf/bin/flushmem  (note this can take a while)
+    T2: fs_usage
+    T1: use cc1* line originally used to build order file
+    T2: ctl-c when cc1* compilation done
+    
+    T2: sudo fs_usage -w > /tmp/fs.out2  (do not execute yet)
+    T1: ~perf/bin/flushmem
+    T2: fs_usage
+    T1: use cc1*X line originally used to build order file
+    T2: ctl-c when cc1XXX compilation done
+    
+    In the first group of commands you use the original cc1* line
+    with the command line used to build the order file.  You also
+    run fs_usage at the same time to measure the paging behavior.
+    
+    The second group of commands is similar but you use the cc1*
+    linked with its order file (with the -sectorder stuff mentioned
+    in step 6).  For this discussion call this compiler cc1*X.
+    
+    In both cases you need to run ~perf/bin/flushmem to make sure
+    compiler is flused from the cached.  That way you are
+    measuring the initial page-in bechavor as thee compiler is
+    loaded.  Warning, the flushmem's sometimes take quite a
+    while.
+ 
+    At this point you should have /tmp/fs.out1 and /tmp/fs.out2.
+    You need to extract the page-ins times for the compilers in
+    order to sum them up.  The easies way to do this is make the
+    data tab delimited for importing into Excel.
+    
+    T1: fgrep cc1*  /tmp/fs.out1 | fgrep PAGE_IN | \
+        tr -s "[:blank:]" '\t' >/tmp/cc1*.pageins-1
+    T1: fgrep cc1*X /tmp/fs.out2 | fgrep PAGE_IN | \
+        tr -s "[:blank:]" '\t' >/tmp/cc1*.pageins-2
+    
+    Load each of these into Excel and sum the pagin times to 
+    determine the percent change.
+    
+    Remember that the '*'s in the above illustrations are not really
+    a '*'.  It is just a short way to show the general command lines
+    where in reality you explicitly specify cc1, cc1plus, cc1obj, or
+    cc1objplus.
+    
+    Also remember that you are measuring the page-in performance
+    improvement on the test cases used to generate the order files
+    in the first place.  Thus you should expect that these would
+    probably show the greatest improvement.  That is why it is
+    important to try to choose representative test cases in the
+    first place.  You can try the measurements on other tests.
+    But that requires you again extracting the cc1* lines using
+    -###.  The above procedure only uses the orignal files since
+    the command lines are already handy.
+
+   
+   
+   
+