<HR WIDTH = "100%">
<A NAME="edu10"></A><B><FONT SIZE=+1>CS 614 ADVANCED COMPILERS

<P> SIC 304, Slot 2: Monday 9.30-10.25, Tuesday 10.35-11.30, Thursday 11.35-12.30
</FONT></B>
<P> TA: Ambika Agarwal (ambika@cse).
<HR WIDTH = "100%">

<P>

<B> Course coverage:</B>


<P>
	This course will cover the following topics: <P>

&nbsp &nbsp	1. Code optimization <BR>
&nbsp   &nbsp      2. Data flow analysis <BR>
&nbsp	 &nbsp 3. Register assignment <BR>
&nbsp &nbsp 4. Vectorization and parallelization of code
&nbsp	 &nbsp 5. Generation of high quality target code <BR>
&nbsp	 &nbsp 6. Selected current topics, e.g., use of execution-time information in optimization and in debugging of optimized code.<BR>
<P>
     For each topic, we cover the classical techniques and a few
research papers.

<P>
	The course will contain homework assignments (with very little 
weightage), group projects, individual paper
reading & presentation assignments, and exams. 

<P>
	The primary sources for the FIRST FEW lectures would be:

<P>
&nbsp &nbsp 1. The Aho-Lam-Sethi-Ullman Compilers book.
<P>
&nbsp &nbsp 2. Slides on  <A HREF="allopt.ps"> code optimization and data flow analysis</A>
(<A HREF="allopt.pdf"> pdf version.</A>)
<P>
&nbsp &nbsp 3. Slides on <A HREF="new_PRE_slides.ppt"> partial redundancy elimination</A>&nbsp;
and a brief paper on <A HREF="corrected_sigplan_paper.pdf"> E-path based PRE
</A>&nbsp;
<P>
&nbsp &nbsp 4. <I> Advanced Compiler Design &
Implementation,</I> Steven S Muchnick, Harcourt Asia/Morgan Kaufmann, 1997.

<BR>

<P>

<B> Honesty policy </B>

<P>

Unless otherwise specified, students are both permitted and encouraged to
discuss assignment problems but each person must write his/her own assignment
completely independently.

<P>

Discussion of project ideas is also permitted and encouraged, but looking
over other persons' code is forbidden. 


<P>
<B>  Log of lectures (compiled by Jojumon Kavalan)</B>

<P>Lecture 2 & 3 - 5/01/10 & 7/01/10 <BR>
Introduction to code optimization - kinds of optimization, levels of optimization <BR>
Optimizing transformations - compile time evaluation, CSE, constant propagation, variable propagation, code movement optimization,<BR> loop optimization<BR>
Safety of code movement

<P>Lecture 4 - 11/01/10 <BR>
Strength reduction, loop test replacement, Dead code elimination <BR>
DAG based local optimization

<P>Lecture 5 - 12/01/10 <BR>
Abstract syntax tree, triples, quadruples.<BR> 
Global optimization: Control flow analysis---dominators and post dominators. Data flow analysis---available expressions

<P>Lecture 6 - 14/01/10<BR>
Data flow property, Data flow equations, distributivity of DF problem.

<P>Lecture 7 - 18/01/10 <BR>
Separability of DF solution, lattice theory, meet over path, fixed point and maximum fixed point.<BR>
Reaching definitions

<P>Lecture 8 - 19/01/10 <BR>
Round-robin iterative data flow analysis, monotonicity and convergence of iterative DFA, complexity of data flow analysis. <BR> 
Worklist based iterative technique

<P>Lecture 9 - 21/01/10 <BR>
Complexity of worklist approach.<BR> 
Reaching definitions, live variables, busy expressions, very busy expressions 
Introduction to register assignment.

<P>Lecture 10 - 25/01/10 <BR>
Live range, Chow-Hennessy register allocation. 

<P>Lecture 11 - 28/01/10  <BR>
Interference graph, Live range splitting in Chow-Hennessy approach.<BR>
Static single assignment(SSA), def-use chain.

<P>Lecture 12 - 01/02/10 <BR>
Sparse simple constant propagation, conditional constant propagation, sparse conditional constant propagation.<BR>

<P>Lecture 13 - 02/02/10<BR>
Sparse Conditional Constant.<BR> 
Conversion to the SSA form:
Dominance frontier, placement of phi function, renaming.<BR>
Control dependence. 

<P>Lecture 14 - 04/02/10<BR>
Tutorial on data flow analysis for CSE.<BR>
Register allocation paper by Briggs.

<P>Lecture 15 - 08/02/10<BR>
Register Allocation paper by Briggs (Contd.)<BR>
Introduction to partial redundancy elimination (PRE).

<P>Lecture 16 - 09/02/10<BR>
Epath-PRE.

<P>Lecture 17 - 22/02/10<BR>
Advanced compiler optimization for supercomputers (Padua and Wolfe paper).<BR>
Advanced architectures - vector instructions, multi-core systems
Data dependence -  Flow/true, anti, and output dependences.<BR>
Loop independent and loop carried dependences.  Control dependence.

<P>Lecture 18 - 23/02/10<BR>
Tutorial on topics covered in lecture 17. 
Effect of dependences in nested loops. 

<P>Lecture 19 - 25/02/10<BR>
Allen and Kennedy paper on Vector code generation.
Code improving transformations - variable renaming, loop interchanging

<P>Lecture 20 - 02/03/10<BR>
Code improving transformations contd - thresholding.<BR>
Parallelization of programs (based on chapter 11 in ALSU book)
Types of parallelization, issues with synchronization and communication, blocking to improve cache performance. SPMD.<BR> 
Basic affine transformations - fission, fusion, re-indexing, scaling.

<P>Lecture 21 - 04/03/10<BR>
Basic affine transformations contd- reversal, permutation, skewing. <BR>
Matrix formulation of inequalities. Self re-use, group-reuse, affine partitions.

<P>Lecture 22 - 08/03/10<BR>
Affine expressions.  Self spatial use, more on group re-use.  Basis vector of null space. <BR>
Example of affine space partition

<P>Lecture 23 - 09/03/10 <BR>
Example of affine space partition revisited and worked out

<P>Lecture 24 - 11/03/10 <BR>
Code generation - issues in code generation, Sethi-Ullman algorithm, introduction to Aho-Johnson algorithm

<P>Lecture 25 - 15/03/10 <BR>
Aho-Johnson algorithm contd - machine model, register re-arrangement theorem, contiguous programs

<P>Lecture 26 - 18/03/10 <BR>
Aho-Johnson algorithm contd - Cover algorithm, mark algorithm, code generation
Introduction to RISC architecture, pipelined machines and speculative load instruction machines

<P>Lecture 27 - 22/03/10 <BR>
Code scheduling constraints
Pipelined machines, VLIW machines, special instructions (prefetch, predicated instructions, etc)
Code scheduling (local)

<P>Lecture 28 - 23/03/10 <BR>
Code scheduling (local)contd. 
Global code scheduling - hoisting of instructions, sinking of instructions, global scheduling algorithm, control equivalence, dominated successor
Aho-Johnson algorithm revisited

<P>Lecture 29 - 25/03/10 <BR>
Software pipelining - machine model, inter iteration interval, bound on inter iteration interval due to resource constraints and data dependence

<P>Lecture 30 - 30/3/10 <BR>
Software pipelining contd. - Scheduling algorithm for acyclic graph, example

<P>Lecture 31 - 05/04/10 <BR>
Software pipelining example contd., scalar expansion

<P>Lecture 32 - 06/04/10 <BR>
Alias analysis - may alias, must alias, flow sensitive alias, flow insensitive alias

<P>Lecture 33 - 08/04/10 <BR>
Discussion on Aho-Johnson algorithm