Memory system design
Many of these articles are copyrighted by the ACM, IEEE, Springer, and the
U.S. Government. Copyright notices are included in each document, where
applicable. ACM grants gratis permission for individual digital or hard
copies made without fee for use in academic classrooms and for use by
individuals in personal research and study. No further reproduction or
distribution is allowed without explicit permission. The IEEE specifically
states "copies must be made in compliance with Section 108 of the Copyright
Act of the U.S. and with guidelines developed by the National Commission
on New Technological Uses of Copyrighted Works (CONTU Guidelines), the
text of which is available as part of U.S. Copyright Office Circular 21".
Journals
- Data Cache Techniques to Save Power and Deliver High Performance in Embedded Systems
M. Bhadauria, S.A. McKee, K. Singh, G.S. Tyson
Transactions on High Performance Embedded Architectures and Compilers (HiPEAC), vol 2. no. 1, 2007, pages 62-81.
gzipped postscript
or pdf
- Specializing Cache Structures for High Performance and Energy Conservation in Embedded Systems
M.J. Geiger, S.A. McKee, G.S. Tyson
Transactions on High Performance Embedded Architectures and Compilers (HiPEAC), vol. 1, no. 1, 2007, pages 50-90.
gzipped postscript
or pdf
- The Impulse Memory Controller
L. Zhang, Z. Fang, M. Parker, B.K. Mathew, L. Schaelicke, J.B. Carter, W.C. Hsieh, S.A. McKee
IEEE Transactions on Computers, vol. 50, no. 11, November 2001, pp. 1117-1132.
gzipped postscript
or pdf
- Dynamic Access Ordering for Streamed Computations
S.A. McKee, Wm.A. Wulf, J.H. Aylor, R.H. Klenke, M.H. Salinas, S.I. Hong, D.A.B. Weikle
IEEE Transactions
on Computers, vol. 49, no. 11, November 2000, pp. 1255-1271.
gzipped postscript
or pdf
- Impulse: Memory System Support for Scientific Applications
J.B. Carter, W.C. Hsieh, L.B. Stoller, M.R. Swanson, L. Zhang, S.A. McKee
Wiley Journal of Scientific Programming, vol. 7, no. 3-4, fall 1999, pp. 195-209.
gzipped postscript
or pdf
- Smarter Memory: Improving Bandwidth for Streamed References
S.A. McKee, R.H. Klenke, K.L. Wright, Wm.A. Wulf, M.H. Salinas, J.H. Aylor, A.P. Batson
IEEE Computer, July 1998, pp. 54-63.
gzipped postscript
or pdf
Refereed Conferences
- Leveraging High Performance Data Cache Techniques to Save Power in Mobile Embedded Systems
M. Bhadauria, S.A. McKee, K. Singh, G. Tyson
Proc. International Conference on High Performance Embedded Architectures and Compilers, Ghent, BE, February 2007.
gzipped postscript
or pdf
- A Precisely Tunable Drowsy Cache Management Mechanism
M. Bhadauria, S.A. McKee, K. Singh, G. Tyson
Proc. Watson Conference on Interaction between Architecture, Circuits, and Compilers(P=AC^2), Yorktown Heights, NY, October 2006.
gzipped postscript
or pdf
- Beyond Basic Region Caching: Specializing Cache Structures for High Performance and Energy Conservation
M.J. Geiger S.A. McKee, G.S. Tyson
Proc. International Conference on High Performance Embedded Architectures and Compilers (HiPEAC'05), Barcelona, ES, November 2005.
gzipped postscript
or pdf
- Drowsy Region-Based Caches: Minimizing Both Dynamic and Static Power Dissipation
M.J. Geiger, S.A. McKee, G.S. Tyson
Proc. Computing Frontiers (CF'05), Ischia, IT, April 2005.
gzipped postscript
or pdf
- Reflections on the Memory Wall (invited paper)
S.A. McKee
Proc. Computing Frontiers (CF'04), Ischia, IT, April 2004.
gzipped postscript
or pdf
- Reevaluating Online Superpage Promotion with Hardware Support
Z. Fang, L. Zhang, J.B. Carter, W.C. Hsieh, S.A. McKee
Proc. of the Seventh International Symposium on High Performance Computer Architecture (HPCA-7), Monterrey, MX, January 2001.
gzipped postscript
or pdf
- Caches as Filters: A Framework for the Analysis of Caching Systems
D.A.B. Weikle, S.A. McKee, K. Skadron, Wm.A. Wulf
Proc. 3rd Grace Hopper Celebration of Women in Computing (GHC'00), Hyannis, MA, September 2000.
gzipped postscript
or pdf
- Algorithmic Foundations for a Parallel Vector Access Memory System
B.K. Mathew, S.A. McKee, J.B. Carter, A. Davis
Proc. Symposium on Parallel Algorithms and Architectures (SPAA'00), Bar Harbor, ME, July 2000.
gzipped postscript
or pdf
- Hardware-Only Stream Prefetching and Dynamic Access Ordering
C. Zhang, S.A. McKee
Proc. 14th International Conference on Supercomputing (ICS'00), Santa Fe, NM, May 2000.
gzipped postscript
or pdf
- Design of a Parallel Vector Access Unit for SDRAM Memory Systems
B.K. Mathew, S.A. McKee, J.B. Carter, A. Davis
Proc. Sixth International Symposium on High Performance Computer Architecture (HPCA-6), Toulouse, FR, January 2000.
gzipped postscript
or pdf
- Memory System Support for Image Processing
L. Zhang, J.B. Carter, W.C. Hsieh, S.A. McKee
Proc. 1999 International Conference on Parallel Architectures and Compilation Techniques (PACT'99), Newport Beach, CA, October 1999.
gzipped postscript
or pdf
- Access Order and Effective Bandwidth for Streams on a Direct Rambus Memory
S.I. Hong, S.A. McKee, M.H. Salinas, R.H. Klenke, J.H. Aylor, Wm.A. Wulf
Proc. Fifth Internationl
Symposium on High Performance Computer Architecture (HPCA-5), Orlando, FL, January 1999.
gzipped postscript
or pdf
- Caches As Filters: A New Approach to Cache Analysis
D.A.B. Weikle, S.A. McKee, Wm.A. Wulf
Proc. Sixth International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS'98), Montreal, Quebec, July 1998. (best paper award)
gzipped postscript
or pdf
- Design and Evaluation of Dynamic Access Ordering Hardware
S.A. McKee, A. Aluwihare, B.H. Clark, R.H. Klenke, T.C. Landon, C.W. Oliver, M.H. Salinas, A.E. Szymkowiak, K.L. Wright, Wm.A. Wulf, J.H. Aylor
Proc. 10th ACM International Conference on Supercomputing (ICS'96), Philadelphia, PA, May 1996.
gzipped postscript
or pdf
- A Memory Controller for Improved Performance of Streamed Computations on Symmetric Multiprocessors
S.A. McKee, Wm.A. Wulf
Proc. 10th International Parallel Processing Symposium (IPPS'96), Honolulu, HI, April 1996.
gzipped postscript
or pdf
- Bounds on Memory Bandwidth in Streamed Computations
S.A. McKee, Wm.A. Wulf, T.C. Landon
Proc. Europar'95, Stockholm, Sweden, August 1995. Also Lecture Notes in Computer Science 966, Springer-Verlag, Berlin, 1995.
gzipped postscript
or pdf
- Access Ordering and Memory-Conscious Cache Utilization
S.A. McKee, Wm.A. Wulf
Proc. First IEEE Symposium on High Performance Computer Architecture (HPCA), Raleigh, NC, January 1995.
gzipped postscript
or pdf
- Increasing Memory Bandwidth for Vector Computations
S.A. McKee, S.A. Moyer, Wm.A. Wulf, C.Y. Hitchcock
Proc. Programming Languages and Systems Architectures (PLSA), Zurich, Switzerland, March 1994. Also Lecture Notes in Computer Science 782, Springer-Verlag, Berlin.
- Experimental Implementation of Dynamic Access Ordering
S.A. McKee, R.H. Klenke, A.J. Schwab, Wm.A. Wulf, S.A. Moyer, C. Hitchcock, J.H. Aylor
Proc. IEEE 27th Hawaii International Conference on Systems Sciences (HICSS-27), Maui, HI, January 1994.
gzipped postscript
or pdf
Refereed Workshops
Other
- Evaluation of Dynamic Access Ordering Hardware
S.A. McKee, D.A.B. Weikle, K.L. Wright, C.W. Oliver, A.P. Voss, M.H. Salinas, R.H. Klenke, T.C. Landon, Wm.A. Wulf, J.H. Aylor
UVa Technical Report CS-95-50, October 1995.
gzipped postscript
(24 pages, 241KB compressed. This report describes version
1 of the SMC ASIC. See the ICS'96 paper, above, for the version 2
results.) or pdf
- Maximizing Memory Bandwidth for Streamed Computations
S.A. McKee
Ph.D. dissertation, School of Engineering and Applied Science, University of Virginia, May 1995.
gzipped postscript or pdf
- Hitting the Memory Wall: Implications of the Obvious
Wm.A. Wulf, S.A. McKee
Computer Architecture News, 23(1):20-24, March 1995. Also UVa Technical Report CS-94-48, December 1994. This is the article that introduced the term "Memory Wall". It makes several simplifying assumptions, but the point that memory is the bottleneck for many applications finally hit home for many readers. Workshops have been named after this note (and the first met on my birthday --- how's that for coincidence?).
gzipped postscript or pdf
- Hardware Support for Access Ordering: Performance of Some Design Options
S.A. McKee
UVa Technical Report CS-93-08, August 1993. (This is a huge TR that looks at many, many ordering algorithms for a uniprocessor SMC and DRAM technology that was current at the time. Results are summarized in my dissertation. This work predates the media processor "access scheduling" work of Rixner, et al., but targets a different application arena.)
gzipped postscript
or pdf