IEEE HPEC Preliminary Agenda

Tuesday, September 13

OMG VSIPL (1pm-5pm)[Alcot]

Organizers: Prof. Tony Skjellum (Director - Auburn University Cyber Research Center)

Tutorial: Mathematics of Big Data (1pm-5pm)[Levermore]

Instructors: Dr. Jeremy kepner (MIT) & Mr. Hayden Jananthan (Vanderbilt)

Tutorial: Securing Your Embedded Systems for Cyberspace (1pm-5pm)[Cambridge]

Instructors: Dr. Michael Vai, Dr. Roger Khazan & Mr. Benjamin Nahill (MIT)

Tutorial: Parallel Programming with OpenMP (1pm-5pm)[Garfield]

Instructor: Dr. Tim Mattson (Principal Engineer - Intel)

Wednesday, September 14 Morning

Plenary Session (9:00-10:00)[Eden Vale B]

Chair: Jeremy Kepner / MIT

Keynote Speaker:

HPEC: The Past, Present and Future Outlook

Mr. David Martinez (HPEC Founder; IEEE Fellow; MIT Lincoln Laboratory Associate Head Cyber Security & Information Sciences Division)

Advanced ASIC & FPGA Technologies 1 (10:20-12:00)[Eden Vale A1]

Chair: Paul Monticciolo / MIT

[Best Paper Finalist] On-chip Memory Efficient Data Layout for 2D FFT on 3D Memory Integrated FPGA

Shreyas Singapura, Viktor Prasanna, Rajgopal Kannan (University of Southern California)

[Best Student Paper Finalist] Real-Time, Low-Latency Image Processing with High Throughput on a Multi-Core SoC

Barath Ramesh, Alan D. George, Herman Lam (University of Florida)

A Hardware Prototype for In-Brain Neural Spike-Sorting

Martin Herbordt, Yinan Liu, Jiayi Sheng (Boston University)

Optimizing Simulation Speed of FPGA Model Based Synthesis

Jeff Caldwell, Bo Marr, David Bloom, Dan Thompson (Raytheon)

Invited Talk

Prof. Martin Herbordt (Boston University)

Graphs & Sparse Data 1 (10:20-12:00)[Eden Vale C1]

Chair: Michael Wolf / Sandia

Invited Talk: Finding the Important Part of a Matrix or Graph

Prof. Gilbert Strang (MIT Mathematics Department)

A Sparse Multi-Dimensional Fast Fourier Transform with Stability to Noise in the Context of Image Processing and Change Detection

Pierre-David Letourneau, M. Harper Langston, Richard Lethin (Reservoir Labs)

Efficient implementation of scatter-gather operations for large scale graph analytics

Manoj Kumar, Mauricio Serrano, Jose Moreira, Pratap Pattnaik, W P Horn, Joefon Jann, Gabriel Tanase (IBM)

Landmark Routing for Large Graphs in Fixed-Memory Environments

Newton Campbell Jr. (BBN), Michael J. Laszlo, Sumitra Mukherjee (Nova Southeastern University)

Mathematical Foundations of the GraphBLAS

Jeremy Kepner (MIT), Peter Aaltonen (Indiana University), David Bader (Georgia Institute of Technology), Aydın Buluc ̧ (Lawrence Berkeley National Laboratory), Franz Franchetti (Carnegie Mellon University), John Gilbert (University of California, Santa Barbara), Dylan Hutchison (University of Washington), Manoj Kumar (IBM), Andrew Lumsdaine (Indiana University), Henning Meyerhenke (Karlsruhe Institute of Technology), Scott McMillan (CMU Software Engineering Institute), Jose Moreira (IBM), John D. Owens (University of California, Davis), Carl Yang (University of California, Davis), Marcin Zalewski (Indiana University), Timothy Mattson (Intel)

High Performance & Cloud Computing 1 (10:20-12:00)[Eden Vale C3]

Chair: Patrick Dreher / NC State

The Open Community Runtime: A Runtime System for Extreme Scale Computing

Timothy G. Mattson, Romain Cledat, Vincent Cave (Intel), Vivek Sarkar (Rice University), Zoran Budimlic, Sanjay Chatterjee (Rice University), Josh Fryman, Ivan Ganev, Robin Knauerhase, Min Lee (Intel), Benoıt Meister (Reservoir Labs), Brian Nickerson, Nick Pepperling, Bala Seshasayee (Intel), Sagnak Tasirlar (Two Sigma), Justin Teller (Facebook), Nick Vrvilo (Rice University)

High-Performance Algorithms and Data Structures to Catch Elephant Flows

Jordi Ros-Giralt, Alan Commike, Richard Lethin (Reservoir Labs), Sourav Maji, Malathi Veeraraghavan (University of Virginia)

Scheduler Technologies in Support of High Performance Data Analysis

Albert Reuther, Chansup Byun, William Arcand, David Bestor, Bill Bergeron, Matthew Hubbell, Michael Jones, Peter Michaleas, Andrew Prout, Antonio Rosa, Jeremy Kepner (MIT)

Optimizing Communication for a 2D-Partitioned Scalable BFS

Jeffrey Young (Georgia Institute of Technology), Julian Romera, Matthias Hauck, Holger Froning (Ruprecht-Karls University of Heidelberg)

Node Level Power Measurements on a Petaflop System

David Brayford, Christoph Bernau, Carla Guillen, Carmen Navarrete (Leibniz Supercomputing Centre)

Lunch; View Posters & Demos 1 (12:00-1:00)[Emerson]

3D DRAM Based Application Specific Hardware Accelerator for SpMV

Fazle Sadi, Larry Pileggi, Franz Franchetti (Carnegie Mellon University)

Rapid Prototyping with Symbolic Computation: Fast Development of Quantum Annealing Solutions

Mark Hodson, Duncan Fletcher, Dan Padilha, Tristan Cook (QxBranch)

CUDA Implementation of an Optimal Online Gaussian-Signal-in-Gaussian-Noise Detector

Nir Nossenson (Northeastern University), Ariel J. Jaffe (Weizmann Institute)

Designing a New High Performance Computing Education Strategy for Professional Scientists and Engineers

Julia S. Mullen, William Arcand, David Bestor, Bill Bergeron, Chansup Byun, Vijay Gadepally, Michael Houle, Matthew Hubbell, Michael Jones, Anna Klein, Peter Michaeleas, Lauren Milechin, Andrew Prout, Antonio Rosa, Siddharth Samsi, Charles Yee, Jeremy Kepner, Albert Reuther (MIT)

Which Program Is Slower? Hypothesis Testing for Performance Regressions

Jiahao Chen, Jarrett Revels (MIT)

GPU Processing of Streaming Data: a CUDA Implementation of the FireHose Benchmark

Mauro Bisson (Nvidia), Massimo Bernaschi (Italian Research Council), Massimiliano Fatica (Nvidia)

Leveraging Trilinos’s Next Generation Computing Framework for an Exa-Scale Poro-Elastic Network Simulator Implementation

Ashesh Chattopadhyay, Vinod Kumar, VMK Kotteda (University of Texas El Paso), William Spotz (Sandia National Laboratory)

Performance Characterization and Parallelization of Tesseract Optical Character Recognition on Multicore Architectures

Sunghwan Bae, Jialing Zhang, Seung Woo Son (University of Massachusetts Lowell)

Using Natural Language Processing Models for Understanding Network Anomalies

Ketul Barot, Jialing Zhang, Seung Woo Son (University of Massachusetts Lowell)

Hardware Accelerators for High Performance Computing

Virginia W. Ross, Kevin L. Schoen (Air Force Research Laboratory)

Architecting Processing System Applications Past Moore’s Law

Jeremy W. Horner, John Holland, Eliot Glaser, Gary Petrosky (Northrop Grumman)

Wednesday, September 14 Afternoon

Advanced ASIC & FPGA Technologies 2 (1:00-2:40)[Eden Vale A1]

Chair: Karen Gettings / MIT

[Best Student Paper Finalist] Novo-G#: Large-Scale Reconfigurable Computing with Direct and Programmable Interconnects

Alan D. George (University of Florida), Martin C. Herbordt (Boston University), Herman Lam, Abhijeet G. Lawande (University of Florida), Jiayi Sheng, Chen Yang (Boston University)

In-Storage Embedded Accelerator for Sparse Pattern Processing

Sang-Woo Jun, Huy T. Nguyen, Vijay Gadepally, Arvind (MIT)

Computational and Memory Analysis of Tegra SoCs

Andrew Milluzzi, Alan George, Herman Lam (University of Florida)

PERFECT Case Studies Demonstrating Order of Magnitude Reduction in Power Consumption

David K. Wittenberg (BAE Systems), Edin Kadric, Andre DeHon (University of Pennsylvania), Jonathan Edwards, Jeffrey Smith, Silviu Chiricescu (BAE Systems)

Invited Talk

Prof. Viktor Prasanna (University of Southern California)

Graphs & Sparse Data 2 (1:00-2:40) [Eden Vale C1]

Chair: Scott McMillan / CMU SEI

Kokkos/Qthreads Task-Parallel Approach to Linear Algebra Based Graph Analytics

Michael M. Wolf, H. Carter Edwards, Stephen Olivier (Sandia National Laboratory)

Generating massive complex networks with hyperbolic geometry faster in practice

Moritz von Looz (Karlsruhe Institute of Technology), Mustafa Safa Ozdayi (Istanbul Technical University), Soren Laue (Friedrich Schiller University), Henning Meyerhenke (Karlsruhe Institute of Technology)

Invited Talk: The Future of Scalable Analytics and Machine Learning

Dr. Jennifer Roberts (DARPA I20 - Program Manager)

Accelerated Low-rank Updates to Tensor Decompositions

Muthu Baskaran, M. Harper Langston, Tahina Ramananandro, David Bruns-Smith, Tom Henretty, James Ezick, Richard Lethin (Reservoir Labs)

cuSTINGER : Supporting Dynamic Graph Algorithms for GPUS

Oded Green, David A. Bader (Georgia Institute of Technology)

High Performance & Cloud Computing (1:00-2:40)[Eden Vale C3]

Chair: Franz Franchetti / CMU

Invited Talk: The Massachusetts Open Cloud: Vision and Early Experiences

Prof. Orran Krieger (Boston University)

Hypervisor Performance Analysis for Real-Time Workloads

Geoffrey Phi C. Tran, Yu-An Chen, Dong-In Kang, John Paul Walters, Stephen P. Crago (University of Southern California)

On SDN-Based Extreme-Scale Networks

Haitham Ghalwash, Chun-Hsi Huang (University of Connecticut)

Cross-Institutional Research Cyberinfrastructure for Data Intensive Science

W.Christopher Lenhardt, Mike Conway, Erik Scott, Brian Blanton, Ashok Krishnamorthy (Renaissance Computing Institute)

Scalability of VM Provisioning Systems

Mike Jones, Bill Arcand, Bill Bergeron, David Bestor, Chansup Byun, Lauren Milechin, Vijay Gadepally, Matt Hubbell, Jeremy Kepner, Pete Michaleas, Julie Mullen, Andy Prout, Tony Rosa, Siddharth Samsi, Charles Yee, Albert Reuther (MIT)

Break (2:40-3:00)

Graphs & Sparse Data 3 (3:00-4:40)[Eden Vale C1]

Chair: John Gilbert / UCSB

Invited Talk: Linear-Algebraic Methods in Algorithmic Graph Theory

Prof. Aleksander Madry (MIT CSAIL)

Novel Graph Processor Architecture, Prototype System, and Results

William S. Song, Vitaliy Gleyzer, Alexei Lomakin, Jeremy Kepner (MIT)

Advantages to Modeling Relational Data using Hypergraphs versus Graphs

Michael M. Wolf, Alicia M. Klinvex, Daniel M. Dunlavy (Sandia National Laboratory)

KNN in the Jaccard Space

Ming Ouyang (University of Massachusetts Boston)

A Scale-Free Structure for Power-Law Graph

Richard M. Veras, Tze Meng Low, Franz Franchetti (Carnegie Mellon University)

High Performance & Cloud Computing 3 (3:00-4:40)[Eden Vale C3]

Chair: Vijay Gadepally / MIT

Enhancing the Performance and Robustness of the FEAST Eigensolver

Brendan Gavin, Eric Polizzi (University of Massachusetts Amherst)

LLMapReduce: Multi-Level Map-Reduce for High Performance Data Analysis

Chansup Byun, Jeremy Kepner, William Arcand, David Bestor, Bill Bergeron, Vijay Gadepally, Matthew Hubbell, Peter Michaleas, Julie Mullen, Andrew Prout, Antonio Rosa, Charles Yee, Albert Reuther (MIT)

A framework to integrate MFiX with Trilinos for high fidelity fluidized bed computations

V M Krushnarao Kotteda, Ashesh Chattopadhyay, Vinod Kumar (University of Texas El Paso), William Spotz (Sandia National Laboratory)

Havens: Explicit Reliable Memory Regions for HPC Applications

Saurabh Hukerikar, Christian Engelmann (Oak Ridge National Laboratory)

Invited Talk: High Throughput Connectomics: The Building of a Brain-Scope

Prof. Nir Shavit (MIT Computer Science & AI Laboratory)

Best Student Paper Award Presentation (4:40) [Eden Vale B]

Chair: Brian Sroka / MITRE

Best Paper Award Presentation (4:50) [Eden Vale B]

Chair: Jeremy Kepner / MIT

Reception; View Posters & Demos; Attend BoFs (5:00-8:00)[Emerson & Foyer & Eden Vale]

RISC-V (6:00-7:00) [Eden Vale A1]

Chair: Kurt Keville / MIT

GraphBLAS BoF (6:00-7:00) [Eden Vale C1]

Co-Chairs: John Gilbert / USCB; Scott McMillan / CMU

Vendor Demos

TBD
TBD
TBD
TBD

Thursday September 15 Morning

Plenary Session (9:00-10:00) [Eden Vale B]

Chair: Dr. Albert Reuther / MIT

Keynote Speaker:

Machine Learning, Data Analytics, and Non-Conventional Computer Architecture

Mr. Trung Tran (DARPA MTO - Program Manager)

Break (10:00-10:20)

GPU & Manycore 1 (10:20-12:00)[Eden Vale A1]

Chair: James Lebak / Mathworks

[Best Paper Finalist] How naive is naive SpMV on the GPU?

Markus Steinberger (Max Planck Institute), Andreas Derler (TU Graz), Rhaleb Zayer, Hans-Peter Seidel (Max Planck Institute)

[Best Student Paper Finalist] Towards Parallel Implementation of Associative Inference for Cogent Confabulation

Zhe Li, Qinru Qiu (Syracuse University), Mangesh Tamhankar (Intel)

Silicon Photonic Memory Interconnect for Many-Core Architectures

Ke Wen, Hang Guan, David M. Calhoun (Columbia University), David Donofrio, John Shalf (Lawrence Berkeley Lab), Keren Bergman (Columbia University)

GPU-Accelerated Charge Mapping

Ahmed Sanaullah, Kathleen Lewis, Martin C. Herbordt (Boston University)

Unified and Lightweight Tasks and Conduits: A High Level Parallel Programming Framework

Chao Liu, Miriam Leeser (Northeastern University)

Big Data 1 (10:20-12:00) [Eden Vale A3]

Chair: Tim Mattson / Intel

[Best Paper Finalist] Julia Implementation of the Dynamic Distributed Dimensional Data Model

Alexander Chen, Alan Edelman, Jeremy Kepner, Vijay Gadepally (MIT), Dylan Hutchison (University of Washington)

[Best Student Paper Finalist] From NoSQL Accumulo to NewSQL Graphulo: Design and Utility of Graph Algorithms inside a BigTable Database

Dylan Hutchison (University of Washington), Jeremy Kepner, Vijay Gadepally (MIT), Bill Howe (University of Washington)

BigDAWG Polystore Query Optimization Through Semantic Equivalences

Zuohao She, Surabhi Ravishankar, Jennie Duggan (Northwestern University)

Benchmarking the Graphulo Processing Framework

Timothy Weale (Department of Defense), Vijay Gadepally (MIT), Dylan Hutchison (University of Washington), Jeremy Kepner (MIT)

The BigDAWG Polystore System and Architecture

Vijay Gadepally, Peinan Chen (MIT), Jennie Duggan (Northwestern University), Aaron Elmore (University of Chicago), Brandon Haynes (University of Washington), Jeremy Kepner, Samuel Madden (MIT), Tim Mattson (Intel), Michael Stonebraker (MIT)

Resilient & IoT Computing 1 (10:20-12:00) [Eden Vale A3]

Chair: David Cousins / BBN

Invited Talk: Future DoD Computing and the Emergence of Autonomous Systems

Mr. Robert Bond (MIT Lincoln Laboratory Associate Head ISR Systems & Technology Division)

Invited Talk: Cyber/Physical Resilience

Dr. Igor Linkov (U.S. Army Corps of Engineers)

Enhancing HPC Security with a User-Based Firewall

Andrew Prout, William Arcand, David Bestor, Bill Bergeron, Chansup Byun, Vijay Gadepally, Matthew Hubbell, Michael Houle, Michael Jones, Peter Michaleas, Lauren Milechin, Julie Mullen, Antonio Rosa, Siddharth Samsi, Albert Reuther, Jeremy Kepner (MIT)

Adding Scalability to Internet of Things Gateways using Parallel Computation of Edge Device Data

Janice Canedo, Anthony Skjellum (Auburn University)

I-Vector Speaker and Language Recognition System on Android

Christian Vazquez-Machado, Pedro Colon-Hernandez (University of Puerto Rico), Pedro A. Torres-Carrasquillo (MIT)

Lunch; View Posters & Demos (12:00-1:00)[Emerson]

Optimization of RAID Erasure Coding Algorithms for Intel Xeon Phi

Aleksei Marov, Andrey Fedorov (Raidix)

Modeling the Performance of Optimizations of 3D Stencil Code on GPUs

Guangwei Zhang, Yinliang Zhao (Xi’an Jiaotong University)

Accelerating Clustering Algorithms Using GPUs

Mahmoud Al-Ayyoub, Qussai Yaseen, Moahmmed Shehab, Yaser Jararweh, Firas Albalas (Jordan University of Science and Technology)

Accelerating FCM-Based Text Classification Algorithm Using GPUs

Moahmmed Shehab, Qussai Yaseen, Mahmoud Al-Ayyoub, Firas Albalas, Yaser Jararweh (Jordan University of Science and Technology)

GPU Accelerated Semantic Search Using Latent Semantic Analysis

Alexandru Iacob, Lucian Itu, Lucian Sasu (Siemens), Florin Moldoveanu (Transilvania University), Constantin Suciu (Siemens)

LDPC Performance over the 802.11n Protocol

Octavio Salcedo Parra, Brayan Reyes Daza (Universidad Distrital FJC)

Parallel Gauss-Seidel Iterative Solution of Laplace's Equation in Clustered1D

Hussam Hussein Abu Azab, Adel Omar Dahmane (Université du Québec à Trois- Rivières), Habib Hamam (University de Moncton)

Title

Author (Institution)

Title

Author (Institution)

Thursday September 15 Afternoon

GPU & Manycore 2 (1:00-2:40) [Eden Vale A1]

Chair: Miriam Leeser / NEU

[Best Paper Finalist] A CUDA Implementation of the PageRank Pipeline Benchmark

Mauro Bisson, Everett Phillips and Massimiliano Fatica (Nvidia)

LU, QR, and Cholesky Factorizations: Programming Model, Performance Analysis and Optimization Techniques for the Intel Knights Landing Xeon Phi

Azzam Haidar, Stanimire Tomov (University of Tennessee), Konstantin Arturovm, Murat Guney, Shane Story (Intel), Jack Dongarra (University of Tennessee)

Polyhedral Compilation for Energy Efficiency

Benoit Pradelle, Muthu Baskaran, Tom Henretty, Benoit Meister, Athanasios Konstantinidis, Richard Lethin (Reservoir Labs)

Implementing Hilbert Transform for Digital Signal Processing on Epiphany Many-Core Coprocessor

Kyle L. Labowski, James A. Ross, Patrick W. Jungwirth (Army Research Lab), David A. Richie (Brown Deer Technology)

Distributed and Configurable Architecture for Neuromorphic Applications on Heterogeneous Cluster

Khadeer Ahmed, Qinru Qiu (Syracuse University), Mangesh Tamhankar (Intel)

Resilient & IoT Computing 2 (1:00-2:40)[Eden Vale C1]

Chair: David Cousins / BBN

Invited Talk: End-to-End Security in the Cloud

Dr. Robert Cunningham (Chair IEEE Cybersecurity Initiative; MIT Lincoln Laboratory Group Leader Secure Resilient Systems & Technology)

Invited Talk: OpenVMS: 40 Years of Mission Critical Computing

Mr. Clair Grant (Director of R&D VMS Software, Inc)

Systems Design of Cybersecurity in Embedded Systems

M. Vai, D. Whelihan (MIT), N. Evancich, K.J. Kwak, J. Li (Intelligent Automation), M. Britton, J. Foley, M. Lynch (Alion Science and Technology), D. Schafer, J. DeMatteis (Air Force Research Laboratory)

Analyzing Heterogeneous Computing Architectures for ADAS and Mobile Imaging Applications

Rafal Malewski (NXP), Markus Levy (EEMBC)

High-throughput Ingest of Data Provenance Records into Accumulo

Thomas Moyer, Vijay Gadepally (MIT)

Quantum Tools & Information Theory 1 (1:00-2:40)[Eden Vale C3]

Chair: Steve Reinhardt / D-Wave

A Quantum Macro Assembler

Scott Pakin (Los Alamos National Laboratory)

Realistic Simulation of Error in Quantum Computing Circuits

Kevin M. Obenland, Andrew J. Kerman (MIT)

ToQ.jl: A high-level programming language for D-Wave machines based on Julia

Daniel O’Malley, Velimir V. Vesselinov (Los Alamos National Laboratory)

Software Systems for High-performance Quantum Computing

Travis S. Humble, Keith A. Britt (Oak Ridge National Laboratory)

Solving large optimization problems with restricted quantum annealers

Federico Spedalieri, Tameem Albash (University of Southern California)

Break (2:40-3:00)

GPU & Manycore 3 (3:00-4:40)[Eden Vale A1]

Chair: Brian Sroka / MITRE

GPU Accelerated, Robust Method for Voxelization of Solid Objects

Cosmin Nita, Iulian Stroia, Lucian Itu, Constantin Suciu, Viorel Mihalef, Manasi Datar, Saikiran Rapaka, Puneet Sharma (Siemens)

Design Space Exploration of GPU Accelerated Cluster Systems for Optimal Data Transfer Using PCIe Bus

Janki Bhimani, Miriam Leeser, Ningfang Mi (Northeastern University)

Performance Analysis and Acceleration of Explicit Integration for Large Kinetic Networks using Batched GPU Computations

Azzam Haidar, Benjamin Brock, Stanimire Tomov, Michael Guidry, Jay Jay Billings, Daniel Shyles, Jack Dongarra (University of Tennessee)

Parallel Motion Estimation and GPU-based Fast Coding Unit Mode Decision for HEVC

Yih-Chuan Lin, Shang-Che Wu (National Formosa University)

Big Data 2 (3:00-4:40) [Eden Vale A3]

Chair: Vijay Gadepally / MIT

[Best Paper Finalist] Benchmarking SciDB Data Import on HPC Systems

Siddharth Samsi, Laura Brattain, William Arcand, David Bestor, Bill Bergeron, Chansup Byun, Vijay Gadepally, Matthew Hubbell, Anna Klein, Peter Michaleas, Lauren Milechin, Julie Mullen, Andrew Prout, Antonio Rosa, Charles Yee, Jeremy Kepner and Albert Reuther (MIT)

[Best Student Paper Finalist] Cross-Engine Query Execution in Federated Database Systems

Ankush M. Gupta, Vijay Gadepally, Michael Stonebraker (MIT)

Integrating Real-Time and Batch Processing in a Polystore

John Meehan, Stan Zdonik Shaobo Tian, Yulong Tian (Brown University), Nesime Tatbul (Intel), Adam Dziedzic, Aaron Elmore (University of Chicago)

Data Transformation and Migration in Polystores

Adam Dziedzic, Aaron J. Elmore (University of Chicago), Michael Stonebraker (MIT)

The BigDawg Monitoring Framework

Peinan Chen, Vijay Gadepally, Michael Stonebraker (MIT)

Quantum Tools & Information Theory 2 (3:00-4:40)[Eden Vale C3]

Chair: Steve Reinhardt / D-Wave

Parameter Setting for Quantum Annealers

Kristen L. Pudenz (Lockheed Martin)

Abstractions Considered Helpful: A Tools Architecture for Adiabatic Quantum Computers

Michael Booth, Edward Dahl, Mark Furtney, Steven P. Reinhardt (D-Wave Systems)

An approach to big data inspired by statistical mechanics

John A. Cortese (MIT)

Associative Array Model of SQL, NoSQL, and NewSQL Databases

Jeremy Kepner, Vijay Gadepally (MIT), Dylan Hutchison (University of Washington), Hayden Jananthan (MIT), Timothy Mattson (Intel), Siddharth Samsi, Albert Reuther (MIT)