Publications
Copyright Notice. The material below is presented to ensure
timely dissemination of scholarly and technical work. Copyright and all
rights therein are retained by authors or by other copyright holders.
All
persons copying this information are expected to adhere to the terms
and
constraints invoked by each author's or organization's copyright. In
most
cases, these works may not be reposted without the explicit permission
of the copyright holder.
JOURNAL PUBLICATIONS
- S. Carr and P. Sweany. "An Experimental Evaluation of Scalar
Replacement on
Scientific Benchmarks", Software - Practice & Experience
33(15), December 2003.
- S. Carr, J. Mayo and C.-K. Shene. "ThreadMentor: A Pedagogical
Tool for Multithreaded Programming", ACM Journal of Educational
Resources in
Computing 3(1), March 2003.
- S. Carr, J. Mayo and C.-K. Shene. Race
Conditions:
A Case Study, The Journal of Computing in Small Colleges
17(1),
September 2001.
-
P. Sweany, S. Carr and B.L. Huber, ``Global Instruction Scheduling
Without
Copies'',Digital Technical Journal 10(1), December 1998.
-
C.-K. Shene and S. Carr. ``The Design of a
Multithreaded
Programming Course and Its Accompanying Software Tools'', The
Journal
of Computing in Small Colleges 14(1), November 1998.
-
S. Carr and R.B. Lehoucq, ``Compiler
Blockability
of Dense Matrix Factorizations'',
ACM Transactions on Mathematical
Software 23(3), September 1997.
-
K. McKinley, S. Carr and C.-W. Tseng, ``Improving
Data Locality with Loop Transformations'', ACM Transactions on
Programming
Languages and Systems 18(4), July 1996.
-
S. Carr and K. Kennedy, ``Improving the
Ratio of Memory Operations to Floating-Point Operations in Loops'',
ACM Transactions on Programming Languages and Systems 16(6),
November
1994.
-
S. Carr and K. Kennedy, ``Scalar Replacement
in the Presence of Conditional Control Flow'', Software -
Practice
& Experience 24(1), January 1994.
CONFERENCE AND WORKSHOP PUBLICATIONS
- Y. Ma and S. Carr. "Register Pressure Guided Unroll-and-Jam", In The 2008 Open64 Workshop, April 6, 2008.
- C. Fang, S. Carr, S. Onder and Z. Wang. "Feedback-directed Memory Disambiguation Through Store Distance Analysis", In
Proceedings
of the 20th ACM International Conference on Supercomputing,
Queensland, Australia, June 2006.
- C. Fang, S. Carr, S. Onder and Z. Wang. "Path-based Reuse Distance Analysis", In
Proceedings
of the 15th International Conference on Compiler Construction,
Vienna, Austria, March 2006.
- C. Fang, S. Carr, S. Onder and Z. Wang. "Instruction Based Memory
Distance Analysis and Its Application to Optimization", In
Proceedings
of the Fourteenth ACM/IEEE International Conference on Parallel Architectures
and Compilation Techniques,
St. Louis, MO, September 2005.
- P. Zhou, S. Onder and S. Carr. "Fast Branch Misprediction Recovery in
Out-of-order Superscalar Processors", In
Proceedings
of the 2005 ACM International Conference on Supercomputing,
Boston, MA, June 2005.
- S. Carr and S. Onder. "A Case for a Working-set-based Memory
Hierarchy", In Proceedings
of the 2005 ACM International Conference on Computing Frontiers,
Ischia, Italy, May 2005.
- S. Carr and P. Sweany. "Automatic
Data Partitioning for the Agere Payload Plus Network Processor", In
Proceedings of the ACM/IEEE
2004 International Conference on Compilers, Architecture and Synthesis
for Embedded Systems, Washington, D.C., September 2004.
- Y. Ma, S. Carr and R. Ge. "Low-cost
Register-pressure Prediction for
Scalar Replacement using Pseudo-schedules", In Proceedings of
the 2004 Interational Conference on Parallel Processing, Montreal,
Canada,
August 15-18, 2004.
- C. Fang, S. Carr, S. Onder and Z. Wang. "Reuse-distance-based Miss-rate Prediction on
a Per Instruction Basis", In Proceedings of the 2004 ACM
Workshop on Memory System Performance, Washington, D.C., June 2004.
- D. Callahan, S. Carr and K. Kennedy. "
Retrospective: Improving Register Allocation for Subscripted Variables",
In 20 Years of the ACM SIGPLAN Conference on Programming Language
Design
and Implementation (1979 - 1999): A Selection, Kathryn S. McKinley,
Editor, ACM SIGPLAN Notices, Volume 39, Number 4, April 2004.
- S. Carr, C. Fang, T. Jozwowski, J. Mayo and C.-K. Shene. "ConcurrentMentor:
A Visualization System for Distributed Programming Education", In
Proceedings of the 2003 International Conference on Parallel and
Distributed
Processing Techniques and Applications, Las Vegas, NV, June 23-26,
2003.
- P. Sweany and S. Carr. "Building a C Compiler Retargetable for
DSP
Processors", In The 1st Workshop on Optimizations for
DSP and Embedded Systems (ODES), San Francisco, California, March
2003.
- Y. Qian, S. Carr and P. Sweany. "Optimizing
Loop Performance for Clustered VLIW Architectures", In Proceedings
of the Eleventh IEEE International Conference on Parallel Architectures
and Compiler Techniques (PACT-2002), Charlottesville, Virginia,
September
22-25, 2002.
-
Y. Qian, S. Carr and P. Sweany. "Loop
Fusion
for Clustered VLIW Architectures", In Proceedings of the ACM
2002
Joint Conference on Languages, Compilers and Tools for Embedded Systems
and Software and Compilers for Embedded Systems, Berlin, Germany,
June
19-21, 2002.
-
S. Carr, P. Chen, T. Jozwowski, J. Mayo and C.-K. Shene. "Channels,
Visualization and Topology Editor", In Proceedings of the
Seventh
Annual ACM SIGCSE Conference on Innovation and Technology in Computer
Science
Education, Aarhus, Denmark, June 24-26, 2002.
-
D. Sule, S. Carr, and P. Sweany. "Evaluating
Register Partitioning with Genetic Algorithms", In Proceedings
of
the Fourth International Conference on Massively Parallel Computing
Systems,
Ischia, Italy, April 2002.
-
S. Carr. C.Fang, T. Jozwowski, J. Mayo and C.-K. Shene. "A
Communication Library to Support Concurrent Programming Courses",
In
Proceedings of the 33rd ACM SIGCSE Technical
Symposium on
Computer Sceince Education, Northern Kentucky, February 2002.
-
X. Huang, S. Carr and P. Sweany. "Loop
Transformations for Architectures with Partitioned Register Banks",
In Proceedings of the 2001 ACM Workshop on Languages, Compilers and
Tools for Embedded Systems (LCTES '2001), Snowbird, Utah, June
22-23,
2001.
-
M. Bedy, S. Carr, S. Onder and P. Sweany. "Improving
Software Pipelining by Hiding Memory Latency with Combined Loads and
Prefetches",
In Interaction between Compilers and Computer Architectures, G.
Lee and P.-C. Yew ed., Kluwer Academic Publishers, 2001.
-
J. Hiser, S. Carr and P. Sweany. "Global
Register Partitioning", In Proceedings of the 2000 IEEE
International
Conference on Parallel Architectures and Compiler Techniques,
Philadelphia,
PA, October 15-19, 2000.
-
S. Carr and C.-K. Shene. ``A Portable
Class Library for Teaching Multithreaded Programming''. In Proceedings
of the Fifth ACM SIGCSE Annual Conference on Innovation and Technology
in Computer Science Education, Helsinki, Finland, July 11-13, 2000.
-
J. Hiser, S. Carr, P. Sweany, and S.J. Beaty. ``Register
Assignment for Software Pipelining with Partitioned Register Banks''.
In Proceedings of the 2000 IEEE International Parallel and
Distributed
Processing Symposium, Cancun, Mexico, May 1-4, 2000.
-
M.J. Bedy, S. Carr, X. Huang and C.-K. Shene. ``A
Visualization System for Multithreaded Programming'', In Proceedings
of the 31st ACM SIGCSE Technical Symposium on Computer Science Education,
Austin, TX, March 8-12, 2000.
-
M.J. Bedy, S. Carr, X. Huang and C.-K. Shene. ``The
Design and Construction of a User-Level Kernel for Teaching
Multithreaded
Programming'', In Proceedings of the 1999 ASEE/IEEE Frontiers
in
Education, San Juan, Puerto Rico, November 10-13, 1999.
-
S. Jang, S. Carr, P. Sweany, and D. Kuras, ``A
Code Generation Framework for VLIW Architectures with Partitioned
Register
Files''. In Proceedings of the Third International Conference
on
Massively Parallel Computing Systems, Colorado Springs, Colorado,
April
1998.
-
D. Kuras, S. Carr and P. Sweany. ``Value
Cloning for Architectures with Partitioned Register Banks'', In The
1998 Workshop on Compiler Support for Embedded Systems (CASES98),
Washington
D.C., December 1998.
-
S. Carr and P. Sweany. ``Improving
Software
Pipelining with Hardware Support for Self-Spatial Loads'', In Proceedings
of the Third Workshop on Interaction between Compilers and Computer
Architecture
(INTERACT-3), San Jose, CA, October 1998.
-
S. Carr and Y. Guan. ``Unroll-and-Jam
Using
Uniformly Generated Sets'', In Proceedings of the 30th IEEE
International
Symposium on Microarchitecture (MICRO-30), Research Triangle Park
NC,
December 1997.
-
C. Ding, S. Carr, and P. Sweany. ``Modulo
Scheduling
with Cache-Reuse Information'',
Lecture Notes in Computer Science
1300, Springer-Verlag, Proceedings of Europar 97, Passau, Germany,
August 1997.
-
S. Carr. ``Combining Optimization for Cache
and Instruction-Level Parallelism'', In Proceedings of the 1996
IEEE International Conference on Parallel Architectures and Compiler
Techniques
(PACT 96), Boston MA, October 1996.
-
S. Carr, C. Ding and P. Sweany, ``Improving
Software Pipelining with Unroll-and-Jam'', In Proceedings of
the
Twenty-Ninth Annual Hawaii International Conference on System Sciences,
Maui HI, January 1996, pp. 183-192.
-
T. Brasier, P. Sweany, S. Beaty and S. Carr, ``CRAIG:
A Practical Framework for Combining Instruction Scheduling and Register
Assignment'', In Proceedings of the 1995 IEEE International
Conference
on Parallel Architectures and Compilation Techniques (PACT 95),
Cyprus,
June 1995.
-
S. Carr and R.B. Lehoucq, ``A Compiler
Blockable
Algorithm for QR Decomposition'', In Proceedings of the 7th
SIAM
Conference on Parallel Processing for Scientific Computing, San
Francisco
CA, February 1995.
-
S. Carr, K.S. McKinley and C-W. Tseng, ``Compiler
Optimizations for Improving Data Locality'', In Proceedings of
the
Sixth ACM International Conference on Architectural Support for
Programming
Languages and Compilers (ASPLOS-VI), San Jose CA, October 1994.
-
S. Carr and K. Kennedy, ``Compiler
Blockability
of Numerical Algorithms'', In Proceedings of Supercomputing '92',
Minneapolis MN, November 1992.
-
S. Carr and K. Kennedy, ``Compiling
Scientific
Code for Complex Memory Hierarchies'', In Proceedings of the
Twenty-Fourth
Annual Hawaii International Conference on System Sciences, Kauai
HI,
January 1991.
-
S. Carr, D. Callahan and K. Kennedy, ``Improving
Register Allocation for Subscripted Variables'', In Proceedings
of the ACM SIGPLAN 1990 Conference on Programming Language Design and
Implementation
(PLDI 90), White Plains NY, June 1990.
-
S. Carr and K. Kennedy, ``Blocking Linear
Algebra Codes for Memory Hierarchies'', In Proceedings of the
Fourth
SIAM Conference on Parallel Processing for Scientific Computing,
Chicago
IL, December 1989.
Ph.D. Thesis
-
S. Carr. Memory Hierarchy Management, PhD Thesis, Rice University, September 1992.