SC18 Proceedings


Overview | By Event Type | By Tag | Author Index

A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | R | S | T | U | V | W | X | Y | Z

A
Abramson, David · moreEnergy Efficiency Modeling of Parallel Applications · pdf
Agarwal, Deborah · moreDac-Man: Data Change Management for Scientific Datasets on HPC Systems · pdf
Aiken, Alex · moreDynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes · pdf
Alam, Sadaf R. · moreRM-Replay: A High-Fidelity Tuning, Optimization and Exploration Tool for Resource Management · pdf
Almgren, Ann S. · morePhase Asynchronous AMR Execution for Productive and Performant Astrophysical Flows · pdf
Aluru, Srinivas · moreOptimizing High Performance Distributed Memory Parallel Hash Tables for DNA k-mer Counting · pdf
Alvarez, Lluc · moreRuntime-Assisted Cache Coherence Deactivation in Task Parallel Programs · pdf
Amer, Abdelhalim · moreLessons Learned from Analyzing Dynamic Promotion for User-Level Threading · pdf
Amvrosiadis, George · moreScaling Embedded In Situ Indexing with DeltaFS · pdf
Andreadis, Georgios · moreA Reference Architecture for Datacenter Scheduling: Design, Validation, and Experiments · pdf
Anwar, Ali · moreBESPOKV: Application Tailored Scale-Out Key-Value Stores · pdf
Aoyama, Toshikazu · morePerformance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA · pdf
Appelhans, David · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Arndt, Bill · moreExtreme Scale De Novo Metagenome Assembly · pdf
Arnemann, James · moreCosmoFlow: Using Deep Learning to Learn the Universe at Scale · pdf
Atchley, Scott · moreGPU Age-Aware Scheduling to Improve the Reliability of Leadership Jobs on Titan · pdf
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Avancha, Sasikanth · moreAnatomy of High-Performance Deep Learning Convolutions on SIMD Architectures · pdf

B
Baden, Scott · moreDoomsday: Predicting Which Node Will Fail When on Supercomputers · pdf
Balaji, Pavan · moreCharacterization of MPI Usage on a Production Supercomputer · pdf
Lessons Learned from Analyzing Dynamic Promotion for User-Level Threading · pdf
Balmana, Marc Gamell · moreFramework for Scalable Intra-Node Collective Operations Using Shared Memory · pdf
Banerjee, Kunal · moreAnatomy of High-Performance Deep Learning Convolutions on SIMD Architectures · pdf
Bard, Deborah · moreCosmoFlow: Using Deep Learning to Learn the Universe at Scale · pdf
Baseman, Elisabeth · moreLessons Learned from Memory Errors Observed Over the Lifetime of Cielo · pdf
Bauer, Gregory H. · moreBest Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience · pdf
Bauer, Michael · moreDynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes · pdf
Bayatpour, M. · moreCooperative Rendezvous Protocols for Improved Performance and Overlap · pdf
Belviranli, Mehmet E. · moreDRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access · pdf
Berkowitz, Evan · moreSimulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing · pdf
Bertsch, Adam · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Bhatele, Abhinav · moreEvaluation of an Interference-Free Node Allocation Policy on Fat-Tree Clusters · pdf
Mitigating Inter-Job Interference Using Adaptive Flow-Aware Routing · pdf
Bianco, Mauro · moreRM-Replay: A High-Fidelity Tuning, Optimization and Exploration Tool for Resource Management · pdf
Biros, George · moreDistributed-Memory Hierarchical Compression of Dense SPD Matrices · pdf
Blackmore, Robert · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Bland, Arthur S. · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Blumrich, Matthias A. · moreExploiting Idle Resources in a High-Radix Switch for Supplemental Storage · pdf
Bode, Brett · moreBest Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience · pdf
Bollhöfer, Matthias · moreDistributed Memory Sparse Inverse Covariance Matrix Estimation on High-Performance Computing Architectures · pdf
Boushehrinejadmoradi, Nader · moreA Parallelism Profiler with What-If Analyses for OpenMP Programs · pdf
Buluc, Aydin · moreExtreme Scale De Novo Metagenome Assembly · pdf
Byna, Suren · moreA Year in the Life of a Parallel File System · pdf

C
Caheny, Paul · moreRuntime-Assisted Cache Coherence Deactivation in Task Parallel Programs · pdf
Carns, Philip · moreA Year in the Life of a Parallel File System · pdf
Casas, Marc · moreRuntime-Assisted Cache Coherence Deactivation in Task Parallel Programs · pdf
Casses, Ben · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Chakaravarthy, Venkatesan · moreHigh-Performance Dense Tucker Decomposition on GPU Clusters · pdf
Chakraborty, S. · moreCooperative Rendezvous Protocols for Improved Performance and Overlap · pdf
Chambreau, Chris · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Chang, Chia Cheng · moreSimulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing · pdf
Chang, Chun-Kai · moreEvaluating and Accelerating High-Fidelity Error Injection for HPC · pdf
Chen, Bingwei · moreSimulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight · pdf
Chen, Dexun · moreRedesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight · pdf
Chen, Jieyang · moreFault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs · pdf
Chen, Wenguang · moreShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds · pdf
Chen, Xiaofei · moreSimulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight · pdf
Chen, Zizhong · moreFault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs · pdf
Cheng, Yue · moreBESPOKV: Application Tailored Scale-Out Key-Value Stores · pdf
Cheshmi, Kazem · moreParSy: Inspection and Transformation of Sparse Matrix Computations for Parallelism · pdf
Chochia, George · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Choi, Jee · moreHigh-Performance Dense Tucker Decomposition on GPU Clusters · pdf
Choromanski, Krzysztof · moreAdaptive Anonymization of Data with b-Edge Covers · pdf
Chow, Edmond · moreAccelerating Quantum Chemistry with Vectorized and Batched Integrals · pdf
Chunduri, Sudheer · moreCharacterization of MPI Usage on a Production Supercomputer · pdf
Clark, M.A. · moreSimulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing · pdf
Climer, Sharlee · moreAttacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction · pdf
Cranor, Charles D. · moreScaling Embedded In Situ Indexing with DeltaFS · pdf
Cromey, Clara E. · moreMitigating Inter-Job Interference Using Adaptive Flow-Aware Routing · pdf

D
Das, Anwesha · moreDoomsday: Predicting Which Node Will Fail When on Supercomputers · pdf
Davis, Philip · moreStacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In Situ Workflows · pdf
Davison, Gene · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
de Hoop, Maarten V. · moreComputing Planetary Interior Normal Modes with a Highly Parallel Polynomial Filtering Eigensolver · pdf
de Supinski, Bronis R. · moreEnergy Efficiency Modeling of Parallel Applications · pdf
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
DeBardeleben, Nathan · moreLessons Learned from Memory Errors Observed Over the Lifetime of Cielo · pdf
Demirci, Gökalp · moreA Divide and Conquer Algorithm for DAG Scheduling Under Power Constraints · pdf
Dennison, Larry · moreLight-Weight Protocols for Wire-Speed Ordering · pdf
Dennison, Larry R. · moreExploiting Idle Resources in a High-Radix Switch for Supplemental Storage · pdf
DeRose, Luiz · moreEnergy Efficiency Modeling of Parallel Applications · pdf
Deslippe, Jack · moreExascale Deep Learning for Climate Analytics · pdf
Dinh, Minh Ngoc · moreEnergy Efficiency Modeling of Parallel Applications · pdf
Domke, Jens · moreMitigating Inter-Job Interference Using Adaptive Flow-Aware Routing · pdf
Dongarra, Jack · moreHarnessing GPU's Tensor Cores Fast FP16 Arithmetic to Speedup Mixed-Precision Iterative Refinement Solvers · pdf
Douglis, Fred · moreBESPOKV: Application Tailored Scale-Out Key-Value Stores · pdf
Du Bois, Kristof · moreMany-Core Graph Workload Analysis · pdf
Duan, Shaohua · moreStacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In Situ Workflows · pdf
Duan, Xiaohui · moreRedesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight · pdf
Durnov, Dmitry · moreFramework for Scalable Intra-Node Collective Operations Using Shared Memory · pdf

E
Eberle, Hans · moreLight-Weight Protocols for Wire-Speed Ordering · pdf
Eftekhari, Aryan · moreDistributed Memory Sparse Inverse Covariance Matrix Estimation on High-Performance Computing Architectures · pdf
Egan, Rob · moreExtreme Scale De Novo Metagenome Assembly · pdf
Endrei, Mark · moreEnergy Efficiency Modeling of Parallel Applications · pdf
Enos, Jeremy · moreBest Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience · pdf
Erez, Mattan · moreEvaluating and Accelerating High-Fidelity Error Injection for HPC · pdf
Eyerman, Stijn · moreMany-Core Graph Workload Analysis · pdf
Ezell, Matthew A. · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf

F
Fagnan, Kjiersten · moreAttacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction · pdf
Farooqi, Muhammad Nufail · morePhase Asynchronous AMR Execution for Productive and Performant Astrophysical Flows · pdf
Fatica, Massimiliano · moreExascale Deep Learning for Climate Analytics · pdf
Ferdous, S M · moreAdaptive Anonymization of Data with b-Edge Covers · pdf
Ferreira, Kurt B. · moreLessons Learned from Memory Errors Observed Over the Lifetime of Cielo · pdf
Fryman, Joshua B. · moreMany-Core Graph Workload Analysis · pdf
Fu, Haohuan · moreSimulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight · pdf
Large-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers · pdf
Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight · pdf
Fujita, Kohei · moreA Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing · pdf

G
Gambhir, Arjun · moreSimulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing · pdf
Gan, Lin · moreSimulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight · pdf
Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight · pdf
Ganger, Gregory R. · moreScaling Embedded In Situ Indexing with DeltaFS · pdf
Gao, Ping · moreRedesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight · pdf
Garland, Michael · moreDynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes · pdf
Garzaran, Maria · moreFramework for Scalable Intra-Node Collective Operations Using Shared Memory · pdf
Geist, Al · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Georganas, Evangelos · moreAnatomy of High-Performance Deep Learning Convolutions on SIMD Architectures · pdf
Extreme Scale De Novo Metagenome Assembly · pdf
Ghoshal, Devarshi · moreDac-Man: Data Change Management for Scientific Datasets on HPC Systems · pdf
Gibson, Garth A. · moreScaling Embedded In Situ Indexing with DeltaFS · pdf
Gila, Miguel · moreRM-Replay: A High-Fidelity Tuning, Optimization and Exploration Tool for Resource Management · pdf
Goldstone, Robin · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Goltsman, Eugene · moreExtreme Scale De Novo Metagenome Assembly · pdf
Gonsiorowski, Elsa · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Gooding, Tom · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Grider, Gary · moreScaling Embedded In Situ Indexing with DeltaFS · pdf
Grinberg, Leopold · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Gu, Yizi · moreDynamic Data Race Detection for OpenMP Programs · pdf
Guan, Hui · moreExploring Flexible Communications for Streamlining DNN Ensemble Training Pipelines · pdf
Guan, Qiang · moreFault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs · pdf
Guo, Danhao · moreScaling Embedded In Situ Indexing with DeltaFS · pdf
Guo, Fan · moreScaling Embedded In Situ Indexing with DeltaFS · pdf
Guo, Luanzheng · moreFlipTracker: Understanding Natural Error Resilience in HPC Applications · pdf
Guok, Chin · moreFine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences · pdf

H
Haidar, Azzam · moreHarnessing GPU's Tensor Cores Fast FP16 Arithmetic to Speedup Mixed-Precision Iterative Refinement Solvers · pdf
Halappanavar, Mahantesh · moreAdaptive Anonymization of Data with b-Edge Covers · pdf
Han, Jingoo · moreBESPOKV: Application Tailored Scale-Out Key-Value Stores · pdf
Hanson, Bill · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Hargrove, Paul · moreDoomsday: Predicting Which Node Will Fail When on Supercomputers · pdf
Hari, Siva Kumar Sastry · moreOptimizing Software-Directed Instruction Replication for GPU Error Detection · pdf
Harms, Kevin · moreCharacterization of MPI Usage on a Production Supercomputer · pdf
Hartner, Bill · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Hashmi, J. · moreCooperative Rendezvous Protocols for Improved Performance and Overlap · pdf
He, Conghui · moreSimulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight · pdf
He, Siyu · moreCosmoFlow: Using Deep Learning to Learn the Universe at Scale · pdf
Heinecke, Alexander · moreAnatomy of High-Performance Deep Learning Convolutions on SIMD Architectures · pdf
Heirman, Wim · moreMany-Core Graph Workload Analysis · pdf
Henry, Greg · moreAnatomy of High-Performance Deep Learning Convolutions on SIMD Architectures · pdf
Herbein, Stephen · moreEvaluation of an Interference-Free Node Allocation Policy on Fat-Tree Clusters · pdf
Higham, Nicholas · moreHarnessing GPU's Tensor Cores Fast FP16 Arithmetic to Speedup Mixed-Precision Iterative Refinement Solvers · pdf
Hittinger, Jeffrey · moreADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning · pdf
Ho, Shirley · moreCosmoFlow: Using Deep Learning to Learn the Universe at Scale · pdf
Hoefler, Torsten · moreShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds · pdf
Hoffmann, Henry · moreA Divide and Conquer Algorithm for DAG Scheduling Under Power Constraints · pdf
Hofmeyr, Steven · moreExtreme Scale De Novo Metagenome Assembly · pdf
Hori, Muneo · moreA Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing · pdf
Houston, Michael · moreExascale Deep Learning for Climate Analytics · pdf
Hu, Yang · moreTriCore: Parallel Triangle Counting on GPUs · pdf
Huang, H. Howie · moreiSpan: Parallel Identification of Strongly Connected Components with Spanning Trees · pdf
TriCore: Parallel Triangle Counting on GPUs · pdf
Huang, Hai · moreBESPOKV: Application Tailored Scale-Out Key-Value Stores · pdf
Huang, Hua · moreAccelerating Quantum Chemistry with Vectorized and Batched Integrals · pdf
Huang, Renfei · moreSP-Cache: Load-Balanced, Redundancy-Free Cluster Caching with Selective Partition · pdf
Hur, Ibrahim · moreMany-Core Graph Workload Analysis · pdf
Hussain, Zaeem · morePartial Redundancy in HPC Systems with Non-Uniform Node Reliabilities · pdf

I
Ichimura, Tsuyoshi · moreA Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing · pdf
Iosup, Alexandru · moreA Reference Architecture for Datacenter Scheduling: Design, Validation, and Experiments · pdf
Isobe, Yoko · morePerformance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA · pdf
Iwasaki, Shintaro · moreLessons Learned from Analyzing Dynamic Promotion for User-Level Threading · pdf

J
Jacobson, Daniel · moreAttacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction · pdf
Jain, Nikhil · moreEvaluation of an Interference-Free Node Allocation Policy on Fat-Tree Clusters · pdf
Mitigating Inter-Job Interference Using Adaptive Flow-Aware Routing · pdf
Jain, Surabhi · moreFramework for Scalable Intra-Node Collective Operations Using Shared Memory · pdf
Ji, Yuede · moreiSpan: Parallel Identification of Strongly Connected Components with Spanning Trees · pdf
Jiang, Nan · moreExploiting Idle Resources in a High-Radix Switch for Supplemental Storage · pdf
Jin, Chao · moreEnergy Efficiency Modeling of Parallel Applications · pdf
Johnston, J. Travis · more167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation · pdf
Joubert, Wayne · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Attacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction · pdf
Joó, Bálint · moreSimulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing · pdf
Justice, Amy · moreAttacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction · pdf

K
Kaeli, David · morePRISM: Predicting Resilience of GPU Applications Using Statistical Methods · pdf
Kahle, Jim · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Kainer, David · moreAttacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction · pdf
Kalamkar, Dhiraj · moreAnatomy of High-Performance Deep Learning Convolutions on SIMD Architectures · pdf
Kaleem, Rashid · moreFramework for Scalable Intra-Node Collective Operations Using Shared Memory · pdf
Kalinin, Sergei V. · more167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation · pdf
Kalra, Charu · morePRISM: Predicting Resilience of GPU Applications Using Statistical Methods · pdf
Kamil, Shoaib · moreParSy: Inspection and Transformation of Sparse Matrix Computations for Parallelism · pdf
Karlin, Ian · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Karna, Tuomas · moreCosmoFlow: Using Deep Learning to Learn the Universe at Scale · pdf
Karnowski, Thomas P. · more167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation · pdf
Keahey, Kate · moreDynamically Negotiating Capacity Between On-Demand and Batch Clusters · pdf
Keckler, Stephen W. · moreOptimizing Software-Directed Instruction Replication for GPU Error Detection · pdf
Kelly, Nicholas · moreEvaluating and Accelerating High-Fidelity Error Injection for HPC · pdf
Khan, Arif · moreAdaptive Anonymization of Data with b-Edge Covers · pdf
Klasky, Scott · moreStacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In Situ Workflows · pdf
Knight, Christopher · moreTopology-Aware Space-Shared Co-Analysis of Large-Scale Molecular Dynamics Simulations · pdf
Kobayashi, Hiroaki · morePerformance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA · pdf
Kolla, Hemanth · moreStacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In Situ Workflows · pdf
Komatsu, Kazuhiko · morePerformance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA · pdf
Kramer, William T. · moreBest Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience · pdf
Kremer-Herman, Nathaniel · moreA Lightweight Model for Right-Sizing Master-Worker Applications · pdf
Kumar, Nalini · moreCosmoFlow: Using Deep Learning to Learn the Universe at Scale · pdf
Kumaran, Kalyan · moreCharacterization of MPI Usage on a Production Supercomputer · pdf
Kurth, Thorsten · moreSimulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing · pdf
Exascale Deep Learning for Climate Analytics · pdf

L
Laguna, Ignacio · moreFlipTracker: Understanding Natural Error Resilience in HPC Applications · pdf
Lam, Michael O. · moreADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning · pdf
Langer, Akhil · moreFramework for Scalable Intra-Node Collective Operations Using Shared Memory · pdf
Lathrop, Scott · moreBest Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience · pdf
Le, Franck · moreFine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences · pdf
Lee, Dongyoon · moreBESPOKV: Application Tailored Scale-Out Key-Value Stores · pdf
Lee, Seyong · moreDRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access · pdf
Lee, Victor · moreCosmoFlow: Using Deep Learning to Learn the Universe at Scale · pdf
Lee, Wonchan · moreDynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes · pdf
Leininger, Matthew L. · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Letaief, Khaled Ben · moreSP-Cache: Load-Balanced, Redundancy-Free Cluster Caching with Selective Partition · pdf
Leverman, Dustin · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Levy, Scott · moreLessons Learned from Memory Errors Observed Over the Lifetime of Cielo · pdf
Li, Dong · moreRuntime Data Management on Non-Volatile Memory-Based Heterogeneous Memory for Task-Parallel Programs · pdf
FlipTracker: Understanding Natural Error Resilience in HPC Applications · pdf
Li, Hongbo · moreFault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs · pdf
Li, Jiajia · moreHiCOO: Hierarchical Storage of Sparse Tensors · pdf
Li, Liandeng · moreLarge-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers · pdf
Li, Ruipeng · moreComputing Planetary Interior Normal Modes with a Highly Parallel Polynomial Filtering Eigensolver · pdf
Li, Sihuan · moreFault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs · pdf
Li, Xiangyu · morePRISM: Predicting Resilience of GPU Applications Using Statistical Methods · pdf
Li, Yuxuan · moreSimulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight · pdf
Liang, Xin · moreFault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs · pdf
Lim, Seung-Hwan · moreExploring Flexible Communications for Streamlining DNN Ensemble Training Pipelines · pdf
167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation · pdf
Lin, Heng · moreShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds · pdf
Liu, Feng · moreDynamically Negotiating Capacity Between On-Demand and Batch Clusters · pdf
Liu, Hang · moreiSpan: Parallel Identification of Strongly Connected Components with Spanning Trees · pdf
TriCore: Parallel Triangle Counting on GPUs · pdf
Liu, Weiguo · moreRedesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight · pdf
Liu, Xin · moreShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds · pdf
Liu, Xing · moreHigh-Performance Dense Tucker Decomposition on GPU Clusters · pdf
Liu, Y. Jace · moreFine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences · pdf
Liu, Yuanlai · moreFault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs · pdf
Lloyd, Scott · moreADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning · pdf
Lockwood, Glenn K. · moreA Year in the Life of a Parallel File System · pdf
Lowenthal, David K. · moreMitigating Inter-Job Interference Using Adaptive Flow-Aware Routing · pdf
Luehr, Nathan · moreExascale Deep Learning for Climate Analytics · pdf
Lym, Sangkug · moreEvaluating and Accelerating High-Fidelity Error Injection for HPC · pdf

M
Ma, Xiaosong · moreShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds · pdf
MacAuley, John · moreFine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences · pdf
Maddegedara, Lalith · moreA Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing · pdf
Mahesh, Ankur · moreExascale Deep Learning for Climate Analytics · pdf
Mahmoud, Abdulrahman · moreOptimizing Software-Directed Instruction Replication for GPU Error Detection · pdf
Malakar, Preeti · moreTopology-Aware Space-Shared Co-Analysis of Large-Scale Molecular Dynamics Simulations · pdf
March, Don D. · more167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation · pdf
Marincic, Ivana · moreA Divide and Conquer Algorithm for DAG Scheduling Under Power Constraints · pdf
Markthub, Pak · moreDRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access · pdf
Marroquin, Chris · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Martinasso, Maxime · moreRM-Replay: A High-Fidelity Tuning, Optimization and Exploration Tool for Resource Management · pdf
A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing · pdf
Maschhoff, Kristyn · moreCosmoFlow: Using Deep Learning to Learn the Universe at Scale · pdf
Mastenbroek, Fabian · moreA Reference Architecture for Datacenter Scheduling: Design, Validation, and Experiments · pdf
Matheson, Michael · moreExascale Deep Learning for Climate Analytics · pdf
Mathuriya, Amrita · moreCosmoFlow: Using Deep Learning to Learn the Universe at Scale · pdf
Matsuoka, Satoshi · moreDRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access · pdf
Maxwell, Don · moreGPU Age-Aware Scheduling to Improve the Reliability of Leadership Jobs on Titan · pdf
Maxwell, Don E. · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
McCalpin, John D. · moreHPL and DGEMM Performance Variability on the Xeon Platinum 8160 Processor · pdf
McElvain, Ken · moreSimulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing · pdf
McMurtrie, Colin · moreRM-Replay: A High-Fidelity Tuning, Optimization and Exploration Tool for Resource Management · pdf
McNally, Stephen · moreGPU Age-Aware Scheduling to Improve the Reliability of Leadership Jobs on Titan · pdf
Meadows, Lawrence · moreCosmoFlow: Using Deep Learning to Learn the Universe at Scale · pdf
Mehri Dehnavi, Maryam · moreParSy: Inspection and Transformation of Sparse Matrix Computations for Parallelism · pdf
Melhem, Rami · morePartial Redundancy in HPC Systems with Non-Uniform Node Reliabilities · pdf
Mellor-Crummey, John · moreDynamic Data Race Detection for OpenMP Programs · pdf
Mendes, Celso L. · moreBest Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience · pdf
Mendygral, Pete · moreCosmoFlow: Using Deep Learning to Learn the Universe at Scale · pdf
Meng, Xiangxu · moreRedesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight · pdf
Menon, Harshitha · moreADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning · pdf
Mills Strout, Michelle · moreParSy: Inspection and Transformation of Sparse Matrix Computations for Parallelism · pdf
Misra, Sanchit · moreOptimizing High Performance Distributed Memory Parallel Hash Tables for DNA k-mer Counting · pdf
Mlakar, Daniel · morefaimGraph: High Performance Management of Fully-Dynamic Graphs Under Tight Memory Constraints on the GPU · pdf
Mohror, Kathryn · moreADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning · pdf
Moise, Diana · moreCosmoFlow: Using Deep Learning to Learn the Universe at Scale · pdf
Momose, Shintaro · morePerformance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA · pdf
Moody, Adam · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Moretó, Miquel · moreRuntime-Assisted Cache Coherence Deactivation in Task Parallel Programs · pdf
Mudigonda, Mayur · moreExascale Deep Learning for Climate Analytics · pdf
Mueller, Frank · moreDoomsday: Predicting Which Node Will Fail When on Supercomputers · pdf
Munson, Todd · moreTopology-Aware Space-Shared Co-Analysis of Large-Scale Molecular Dynamics Simulations · pdf
Musa, Akihiro · morePerformance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA · pdf

N
Nagarakatte, Santosh · moreA Parallelism Profiler with What-If Analyses for OpenMP Programs · pdf
Nakajima, Kengo · moreA Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing · pdf
Naruse, Akira · moreA Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing · pdf
Newman, Harvey · moreFine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences · pdf
Nguyen, Tan · morePhase Asynchronous AMR Execution for Productive and Performant Astrophysical Flows · pdf
Nicholson, Amy · moreSimulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing · pdf

O
Ohmacht, Martin · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Oliker, Leonid · moreExtreme Scale De Novo Metagenome Assembly · pdf
Oral, Sarp H. · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Orginos, Kostas · moreSimulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing · pdf
Osei-Kuffuor, Daniel · moreADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning · pdf
Ouyang, Kaiming · moreFault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs · pdf

P
Pabst, Hans · moreAnatomy of High-Performance Deep Learning Convolutions on SIMD Architectures · pdf
Pan, Tony C. · moreOptimizing High Performance Distributed Memory Parallel Hash Tables for DNA k-mer Counting · pdf
Panda, D. K. · moreCooperative Rendezvous Protocols for Improved Performance and Overlap · pdf
Pankajakshan, Ramesh · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Papka, Michael E. · moreTopology-Aware Space-Shared Co-Analysis of Large-Scale Molecular Dynamics Simulations · pdf
Parashar, Manish · moreStacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In Situ Workflows · pdf
Parker, Scott · moreCharacterization of MPI Usage on a Production Supercomputer · pdf
Patton, Robert M. · moreExploring Flexible Communications for Streamlining DNN Ensemble Training Pipelines · pdf
167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation · pdf
Pearce, Roger · morePruneJuice: Pruning Trillion-Edge Graphs to a Precise Pattern-Matching Solution · pdf
Peng, Ivy B. · moreSiena: Exploring the Design Space of Heterogeneous Memory Systems · pdf
Pennycook, Simon J. · moreCosmoFlow: Using Deep Learning to Learn the Universe at Scale · pdf
Phillips, Everett · moreExascale Deep Learning for Climate Analytics · pdf
Pittman, Randall · moreExploring Flexible Communications for Streamlining DNN Ensemble Training Pipelines · pdf
Pizzano, Fernando · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Pollard, Samuel D. · moreEvaluation of an Interference-Free Node Allocation Policy on Fat-Tree Clusters · pdf
Pothen, Alex · moreAdaptive Anonymization of Data with b-Edge Covers · pdf
Potok, Thomas E. · more167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation · pdf
Pouchet, Louis-Noel · moreAssociative Instruction Reordering to Alleviate Register Pressure · pdf
Poxon, Heidi · moreEnergy Efficiency Modeling of Parallel Applications · pdf
Prabhat, Mr · moreCosmoFlow: Using Deep Learning to Learn the Universe at Scale · pdf
Exascale Deep Learning for Climate Analytics · pdf
Previlon, Fritz · morePRISM: Predicting Resilience of GPU Applications Using Statistical Methods · pdf

R
R. Butt, Ali · moreBESPOKV: Application Tailored Scale-Out Key-Value Stores · pdf
Ramakrishnan, Lavanya · moreDac-Man: Data Change Management for Scientific Datasets on HPC Systems · pdf
Rastello, Fabrice · moreAssociative Instruction Reordering to Alleviate Register Pressure · pdf
Rawat, Prashant Singh · moreAssociative Instruction Reordering to Alleviate Register Pressure · pdf
Reiz, Severin · moreDistributed-Memory Hierarchical Compression of Dense SPD Matrices · pdf
Ren, Jie · moreRuntime Data Management on Non-Volatile Memory-Based Heterogeneous Memory for Task-Parallel Programs · pdf
Reza, Tahsin · morePruneJuice: Pruning Trillion-Edge Graphs to a Precise Pattern-Matching Solution · pdf
Rinaldi, Enrico · moreSimulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing · pdf
Ringenburg, Michael F. · moreCosmoFlow: Using Deep Learning to Learn the Universe at Scale · pdf
Ripeanu, Matei · morePruneJuice: Pruning Trillion-Edge Graphs to a Precise Pattern-Matching Solution · pdf
Riteau, Pierre · moreDynamically Negotiating Capacity Between On-Demand and Batch Clusters · pdf
Rogers, James H. · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Roman, Eric · moreDoomsday: Predicting Which Node Will Fail When on Supercomputers · pdf
Romero, Joshua · moreExascale Deep Learning for Climate Analytics · pdf
Rose, Derek C. · more167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation · pdf
Rosenburg, Bryan · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Rountev, Atanas · moreAssociative Instruction Reordering to Alleviate Register Pressure · pdf
Rubin, Norman · morePRISM: Predicting Resilience of GPU Applications Using Statistical Methods · pdf

S
Saad, Yousef · moreComputing Planetary Interior Normal Modes with a Highly Parallel Polynomial Filtering Eigensolver · pdf
Sadayappan, P. · moreAssociative Instruction Reordering to Alleviate Register Pressure · pdf
Sanders, Geoffrey · morePruneJuice: Pruning Trillion-Edge Graphs to a Precise Pattern-Matching Solution · pdf
Sannikov, Alexander · moreFramework for Scalable Intra-Node Collective Operations Using Shared Memory · pdf
Sarkar, Vivek · moreDetecting MPI Usage Anomalies via Partial Program Symbolic Execution · pdf
Sato, Masayuki · morePerformance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA · pdf
Schenk, Olaf · moreDistributed Memory Sparse Inverse Covariance Matrix Estimation on High-Performance Computing Architectures · pdf
Schmidt, Drew · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Schordan, Markus · moreADAPT: Algorithmic Differentiation Applied to Floating-Point Precision Tuning · pdf
Schulthess, Thomas C. · moreRM-Replay: A High-Fidelity Tuning, Optimization and Exploration Tool for Resource Management · pdf
A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing · pdf
Schulz, Martin · moreFlipTracker: Understanding Natural Error Resilience in HPC Applications · pdf
Schuman, Catherine D. · more167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation · pdf
Seidel, Hans-Peter · morefaimGraph: High Performance Management of Fully-Dynamic Graphs Under Tight Memory Constraints on the GPU · pdf
Settlemyer, Bradley W. · moreScaling Embedded In Situ Indexing with DeltaFS · pdf
Sewall, Jason · moreCosmoFlow: Using Deep Learning to Learn the Universe at Scale · pdf
Sexton, James · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Shalf, John · morePhase Asynchronous AMR Execution for Productive and Performant Astrophysical Flows · pdf
Shankar, Mallikarjun · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Shao, Lei · moreCosmoFlow: Using Deep Learning to Learn the Universe at Scale · pdf
Shen, Xipeng · moreExploring Flexible Communications for Streamlining DNN Ensemble Training Pipelines · pdf
Shi, Jia · moreComputing Planetary Interior Normal Modes with a Highly Parallel Polynomial Filtering Eigensolver · pdf
Siddiqua, Taniya · moreLessons Learned from Memory Errors Observed Over the Lifetime of Cielo · pdf
Sim, Hyogi · moreBESPOKV: Application Tailored Scale-Out Key-Value Stores · pdf
Sisneros, Roberto R. · moreBest Practices and Lessons from Deploying and Operating a Sustained-Petascale System: The Blue Waters Experience · pdf
Slaughter, Elliott · moreDynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes · pdf
Smith, Staci A. · moreMitigating Inter-Job Interference Using Adaptive Flow-Aware Routing · pdf
Snyder, Shane · moreA Year in the Life of a Parallel File System · pdf
Sridharan, Vilas · moreLessons Learned from Memory Errors Observed Over the Lifetime of Cielo · pdf
Steinberger, Markus · morefaimGraph: High Performance Management of Fully-Dynamic Graphs Under Tight Memory Constraints on the GPU · pdf
Straatsma, Tjerk P. · moreA Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing · pdf
Subedi, Pradeep · moreStacker: An Autonomic Data Movement Engine for Extreme-Scale Data Staging-Based In Situ Workflows · pdf
Subramoni, H. · moreCooperative Rendezvous Protocols for Improved Performance and Overlap · pdf
Sukumaran-Rajam, Aravind · moreAssociative Instruction Reordering to Alleviate Register Pressure · pdf
Sullivan, Michael B. · moreEvaluating and Accelerating High-Fidelity Error Injection for HPC · pdf
Optimizing Software-Directed Instruction Replication for GPU Error Detection · pdf
Sun, Jimeng · moreHiCOO: Hierarchical Storage of Sparse Tensors · pdf

T
Tan, Li · moreLarge-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers · pdf
Tang, Xiongchao · moreShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds · pdf
Tao, Dingwen · moreFault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs · pdf
Taura, Kenjiro · moreLessons Learned from Analyzing Dynamic Promotion for User-Level Threading · pdf
Thain, Douglas · moreA Lightweight Model for Right-Sizing Master-Worker Applications · pdf
Thiagarajan, Jayaraman J. · moreMitigating Inter-Job Interference Using Adaptive Flow-Aware Routing · pdf
Thomson, John · moreLarge-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers · pdf
Tomov, Stan · moreHarnessing GPU's Tensor Cores Fast FP16 Arithmetic to Speedup Mixed-Precision Iterative Refinement Solvers · pdf
Tovar, Benjamin · moreA Lightweight Model for Right-Sizing Master-Worker Applications · pdf
Treichler, Sean · moreDynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes · pdf
Exascale Deep Learning for Climate Analytics · pdf
Tripoul, Nicolas · morePruneJuice: Pruning Trillion-Edge Graphs to a Precise Pattern-Matching Solution · pdf
Tritt, Andrew · moreExtreme Scale De Novo Metagenome Assembly · pdf
Tsai, Timothy · moreOptimizing Software-Directed Instruction Replication for GPU Error Detection · pdf
Tumeo, Antonino · moreAdaptive Anonymization of Data with b-Edge Covers · pdf

U
Unat, Didem · morePhase Asynchronous AMR Execution for Productive and Performant Astrophysical Flows · pdf

V
Valero, Mateo · moreRuntime-Assisted Cache Coherence Deactivation in Task Parallel Programs · pdf
Vazhkudai, Sudharshan S. · moreGPU Age-Aware Scheduling to Improve the Reliability of Leadership Jobs on Titan · pdf
The Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Vergara Larrea, Veronica G. · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Versluis, Laurens · moreA Reference Architecture for Datacenter Scheduling: Design, Validation, and Experiments · pdf
Vetter, Jeffrey S. · moreSiena: Exploring the Design Space of Heterogeneous Memory Systems · pdf
DRAGON: Breaking GPU Memory Capacity Limits with Direct NVM Access · pdf
Vishwanath, Venkatram · moreTopology-Aware Space-Shared Co-Analysis of Large-Scale Molecular Dynamics Simulations · pdf
Vranas, Pavlos · moreSimulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing · pdf
Vuduc, Richard · moreHiCOO: Hierarchical Storage of Sparse Tensors · pdf

W
Walker-Loud, André · moreSimulating the Weak Death of the Neutron in a Femtoscale Universe with Near-Exascale Computing · pdf
Walkup, Bob · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Wan, Wubin · moreSimulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight · pdf
Wang, Chenyu · moreLarge-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers · pdf
Wang, Feiyi · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Wang, Teng · moreA Year in the Life of a Parallel File System · pdf
Wang, Wei · moreSP-Cache: Load-Balanced, Redundancy-Free Cluster Caching with Selective Partition · pdf
Wang, X. Tony · moreFine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences · pdf
Warszawski, Todd · moreDynamic Tracing: Memoization of Task Graphs for Dynamic Task-Based Runtimes · pdf
Watanabe, Osamu · morePerformance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA · pdf
Watson, Py · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Weems, Lance D. · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Wei, Yanwen · moreSimulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight · pdf
Weighill, Deborah · moreAttacking the Opioid Epidemic: Determining the Epistatic and Pleiotropic Genetic Architectures for Chronic Pain and Opioid Addiction · pdf
Weissman, Jon · moreDynamically Negotiating Capacity Between On-Demand and Batch Clusters · pdf
Wells, Jack C. · moreA Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing · pdf
Winter, Martin · morefaimGraph: High Performance Management of Fully-Dynamic Graphs Under Tight Memory Constraints on the GPU · pdf
Wright, Nicholas J. · moreA Year in the Life of a Parallel File System · pdf
Wu, Kai · moreRuntime Data Management on Non-Volatile Memory-Based Heterogeneous Memory for Task-Parallel Programs · pdf
Wu, Panruo · moreFault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs · pdf

X
Xi, Yuanzhe · moreComputing Planetary Interior Normal Modes with a Highly Parallel Polynomial Filtering Eigensolver · pdf
Xiang, Qiao · moreFine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences · pdf
Xu, Jingfang · moreShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds · pdf
Xue, Wei · moreRedesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight · pdf
ShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds · pdf

Y
Yamaguchi, Takuma · moreA Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing · pdf
Yang, Guangwen · moreSimulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight · pdf
Large-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers · pdf
Redesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight · pdf
Yang, Y. Richard · moreFine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences · pdf
Ye, Fangke · moreDetecting MPI Usage Anomalies via Partial Program Symbolic Execution · pdf
Yelick, Katherine · moreExtreme Scale De Novo Metagenome Assembly · pdf
Yin, Junqi · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
Yoga, Adarsh · moreA Parallelism Profiler with What-If Analyses for OpenMP Programs · pdf
Yokokawa, Mitsuo · morePerformance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA · pdf
Young, Steven R. · more167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation · pdf
Yu, Bowen · moreShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds · pdf
Yu, Chenhan D. · moreDistributed-Memory Hierarchical Compression of Dense SPD Matrices · pdf
Yu, Teng · moreLarge-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers · pdf
Yu, Yinghao · moreSP-Cache: Load-Balanced, Redundancy-Free Cluster Caching with Selective Partition · pdf

Z
Zayer, Rhaleb · morefaimGraph: High Performance Management of Fully-Dynamic Graphs Under Tight Memory Constraints on the GPU · pdf
Zhang, J. Jensen · moreFine-Grained, Multi-Domain Network Resource Abstraction as a Fundamental Primitive to Enable High-Performance, Collaborative Data Sciences · pdf
Zhang, Jun · moreSP-Cache: Load-Balanced, Redundancy-Free Cluster Caching with Selective Partition · pdf
Zhang, Lufei · moreShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds · pdf
Zhang, Meng · moreRedesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight · pdf
Zhang, Tingjian · moreRedesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight · pdf
Zhang, Wei · moreSimulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight · pdf
Simulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight · pdf
Zhang, Weiqun · morePhase Asynchronous AMR Execution for Productive and Performant Astrophysical Flows · pdf
Zhang, Wenqiang · moreSimulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight · pdf
Zhang, Wusheng · moreRedesigning LAMMPS for Petascale and Hundred-Billion-Atom Simulation on Sunway TaihuLight · pdf
Zhang, Zhenguo · moreSimulating the Wenchuan Earthquake with Accurate Surface Topography on Sunway TaihuLight · pdf
Zhao, Jisheng · moreDetecting MPI Usage Anomalies via Partial Program Symbolic Execution · pdf
Zhao, Kai · moreFault Tolerant One-Sided Matrix Decompositions on Heterogeneous Systems with GPUs · pdf
Zhao, Wenlai · moreLarge-Scale Hierarchical K-Means for Heterogeneous Many-Core Supercomputers · pdf
Zheng, Qing · moreScaling Embedded In Situ Indexing with DeltaFS · pdf
Zheng, Weimin · moreShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds · pdf
Zhu, Xiaowei · moreShenTu: Processing Multi-Trillion Edge Graphs on Millions of Cores in Seconds · pdf
Ziatdinov, Maxim A. · more167-PFlops Deep Learning for Electron Microscopy: From Learning Physics to Atomic Manipulation · pdf
Zimmer, Christopher · moreGPU Age-Aware Scheduling to Improve the Reliability of Leadership Jobs on Titan · pdf
Zimmer, Christopher J. · moreThe Design, Deployment, and Evaluation of the CORAL Pre-Exascale Systems · pdf
A Fast Scalable Implicit Solver for Nonlinear Time-Evolution Earthquake City Problem on Low-Ordered Unstructured Finite Elements with Artificial Intelligence and Transprecision Computing · pdf
Znati, Taieb · morePartial Redundancy in HPC Systems with Non-Uniform Node Reliabilities · pdf

Created 2018-10-17 20:24