Difference between revisions of "Rome APE group publications"

From APEWiki
Jump to navigationJump to search
Line 1: Line 1:
 
==  Papers on Journals or Conference Proceedings ==
 
==  Papers on Journals or Conference Proceedings ==
 +
 +
===2015===
 +
 +
* TWEPP
 +
** A. Lonardo, F. Ameli, R. Ammendola, A. Biagioni, A. Cotta Ramusino, M. Fiorini, O. Frezza, G. Lamanna, F. Lo Cicero, M. Martinelli, I. Neri, P.S. Paolucci, E. Pastorelli, L. Pontisso, D. Rossetti, F. Simeone, F. Simula, M. Sozzi, L. Tosoratto and P. Vicini, '''"NaNet: a configurable NIC bridging the gap between HPC and real-time HEP GPU computing"''', 2015, JINST 10 C04011, doi:10.1088/1748-0221/10/04/C04011 [[http://iopscience.iop.org/1748-0221/10/04/C04011]]
 +
 +
 
===2014===
 
===2014===
  

Revision as of 10:38, 15 April 2015

Papers on Journals or Conference Proceedings

2015

  • TWEPP
    • A. Lonardo, F. Ameli, R. Ammendola, A. Biagioni, A. Cotta Ramusino, M. Fiorini, O. Frezza, G. Lamanna, F. Lo Cicero, M. Martinelli, I. Neri, P.S. Paolucci, E. Pastorelli, L. Pontisso, D. Rossetti, F. Simeone, F. Simula, M. Sozzi, L. Tosoratto and P. Vicini, "NaNet: a configurable NIC bridging the gap between HPC and real-time HEP GPU computing", 2015, JINST 10 C04011, doi:10.1088/1748-0221/10/04/C04011 [[1]]


2014

  • RT
    • A. Lonardo, F. Ameli, R. Ammendola, A. Biagioni, O. Frezza, G. Lamanna, F. Lo Cicero, M. Martinelli, P. S. Paolucci, E. Pastorelli, L. Pontisso, D. Rossetti, F. Simeone, F. Simula, M. Sozzi, L. Tosoratto, P. Vicini, "NaNet: a Low-Latency, Real-Time, Multi-Standard Network Interface Card with GPUDirect Features", preprint arXiv:1406.3568 [physics.ins-det].
  • TWEPP
    • R. Ammendola, A. Biagioni, O. Frezza, G. Lamanna, A. Lonardo, F. Lo Cicero, P. S. Paolucci, F. Pantaleo, D. Rossetti, F. Simula, M. Sozzi, L. Tosoratto, P.Vicini “NaNet: a flexible and configurable low-latency NIC for real-time trigger systems based on GPUs“, in JINST, Journal of Instrumentation, Proceedings of Topical Workshop on Electronics for Particle Physics (TWEPP) 2013, IOP Publishing, 2014 doi:10.1088/1748-0221/9/02/C02023 [[2]]
  • ACAT
    • R. Ammendola, A. Biagioni, O. Frezza, F. Lo Cicero, A. Lonardo, P. S. Paolucci, D. Rossetti, F. Simula, L. Tosoratto and P. Vicini, “Analysis of performance improvements for host and GPU interface of the APENet+ 3D Torus network“, Journal of Physics: Conference Series, Workshop on Advanced Computing & Analysis Techniques in Physics Research (ACAT) 2013, 2014 J. Phys.: Conf. Ser. 523 012013 doi:10.1088/1742-6596/523/1/012013
  • ACAT
    • R. Ammendola, A. Biagioni, L. Deri, M. Fiorini, O. Frezza, G. Lamanna, F. Lo Cicero, A. Lonardo, A. Messina, M. Sozzi, F. Pantaleo, P.S. Paolucci, D. Rossetti, F. Simula, L. Tosoratto and P. Vicini, “GPU for Real Time processing in HEP trigger systems“, Journal of Physics, Conference Series, Workshop on Advanced Computing & Analysis Techniques in Physics Research (ACAT) 2013, 2014 J. Phys.: Conf. Ser. 523 012007 doi: 10.1088/1742-6596/523/1/012007

2013

  • IPDPS
    • R. Ammendola, M. Bernaschi, A. Biagioni, M. Bisson, M. Fatica, O. Frezza, F. Lo Cicero, A. Lonardo, E. Mastrostefano, P. S. Paolucci, D. Rossetti, F. Simula, L. Tosoratto, and P. Vicini, “GPU Techniques Applied to a Cluster Interconnect,” in Parallel and Distributed Processing Symposium Workshops PhD Forum (IPDPSW), 2013 IEEE 27th International, pp. 806–815, 2013, DOI: 10.1109/IPDPSW.2013.128 [[3]]
  • TWEPP
    • R. Ammendola, A. Biagioni, O. Frezza, A. Lonardo, F. Lo Cicero, P.S. Paolucci, D. Rossetti, F. Simula, L. Tosoratto and P. Vicini, “APEnet+ 34 Gbps data transmission system and custom transmission logic,” in JINST, Journal of Instrumentation, Proceedings of Topical Workshop on Electronics for Particle Physics (TWEPP) 2013, IOP Publishing, 2013 doi:10.1088/1748-0221/8/12/C12022. [[4]]
  • FPT
    • Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Pier Stanislao Paolucci, Davide Rossetti, Francesco Simula, Laura Tosoratto, Piero Vicini, “Virtual-to-Physical Address Translation for an FPGA-based Interconnect with Host and GPU Remote DMA Capabilities.,” in Field-Programmable Technology (FPT), 2013 International Conference on, 2013, DOI: 10.1109/FPT.2013.6718331. [[5]]
  • CHEP
    • Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Francesca Lo Cicero, Pier Stanislao Paolucci, Alessandro Lonardo, Davide Rossetti, Francesco Simula, Laura Tosoratto, Piero Vicini, “Architectural improvements and 28 nm FPGA implementation of the APEnet+ 3D Torus network for hybrid HPC systems“, Journal of Physics: Conference Series, International Conference on Computing in High Energy and Nuclear Physics (CHEP) 2013, doi:10.1088/1742-6596/513/5/052002 arXiv:1311.1741.
  • CHEP
    • Roberto Ammendola, Andrea Biagioni, Riccardo Fantechi, Ottorino Frezza, Gianluca Lamanna, Francesca Lo Cicero, Alessandro Lonardo, Pier Stanislao Paolucci, Felice Pantaleo, Roberto Piandani, Luca Pontisso, Davide Rossetti, Francesco Simula, Marco Sozzi, Laura Tosoratto, Piero Vicini “NaNet: a low-latency NIC enabling GPU-based, real-time low level trigger systems“ , Journal of Physics: Conference Series, International Conference on Computing in High Energy and Nuclear Physics (CHEP) 2013, doi:10.1088/1742-6596/513/1/012018 arXiv:1311.1010
  • CHEP
    • G Lamanna, R Ammendola, M Bauce, A Biagioni, R Fantechi, M Fiorini, S Giagu, E Graverini, G Lamanna, A Lonardo, A Messina, F Pantaleo, P S Paolucci, R Piandani, M Rescigno, F Simula, M Sozzi, and P Vicini “GPUs for real-time processing in HEP trigger systems“, Journal of Physics: Conference Series, International Conference on Computing in High Energy and Nuclear Physics (CHEP) 2013, doi:10.1088/1742-6596/513/1/012017
  • CHEP
    • S Amerio, D Bastieri, M Corvo, A Gianelle, W Ketchum, T Liu, A Lonardo, D Lucchesi, S Poprocki, R Rivera, L Tosoratto, P Vicini and P Wittich "Many-core applications to online track reconstruction in HEP experiments", Journal of Physics: Conference Series, International Conference on Computing in High Energy and Nuclear Physics (CHEP) 2013, doi:10.1088/1742-6596/513/1/012002
  • EPS/HEP
    • M.Bauce, A.Biagioni, R.Fantechi, M.Fiorini, S.Giagu, E.Graverini, G.Lamanna , A.Lonardo, A. Messina, F.Pantaleo, R.Piandani , M.Rescigno, F.Simula, M.Sozzi and P.Vicini, “GPU for Real Time processing in HEP trigger systems“, Proceedings of Science, European Physical Society Conference on High Energy Physics (EPS-HEP) 2013, PoS EPS-HEP2013 (2013) 503.
  • NSS/MIC
    • R. Ammendola, M. Bauce, A. Biagioni, R. Fantechi, M. Fiorini, S. Giagu, E. Graverini, G. Lamanna, A. Lonardo, A. Messina, F. Pantaleo, R. Piandani, M. Rescigno, F. Simula, M. Sozzi, P. Vicini “ The GAP Project - GPU for Realtime Applications in High Energy Physics and Medical Imaging“, IEEE Xplore, Nuclear Science Symposium and Medical Imaging Conference workshop (NSS/MIC) 2013, DOI: 10.1109/NSSMIC.2013.6829757.
  • NSS/MIC
    • Gianelle, A., Amerio, S. ; Bastieri, D. ; Corvo, M. ; Ketchum, W. ; Liu, T. ; Lonardo, A. ; Lucchesi, D. ; Poprocki, S. ; Rivera, R. ; Tosoratto, L. ; Vicini, P. ; Wittich, P., "Applications of many-core technologies to on-line event reconstruction in High Energy Physics experiments", Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC), 2013 IEEE, DOI: 10.1109/NSSMIC.2013.6829552.
  • ReConfig
    • Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Pier Stanislao Paolucci, Davide Rossetti, Francesco Simula, Laura Tosoratto and Piero Vicini, “Design and implementation of a modular, low latency, fault-aware, FPGA-based Network Interface" IEEE Xplore, track on International Conference on Reconfigurable Computing and FPGAs (ReConFig) 2013, doi:10.1109/ReConFig.2013.6732275.

2012

  • Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Pier Stanislao Paolucci, Davide Rossetti, Francesco Simula, Laura Tosoratto, Piero Vicini. APEnet+: a 3-D Torus network optimized for GPU-based HPC Systems. New York, NY. Proceedings on 2012 J. Phys.: Conf. Ser. 396 042059 doi:10.1088/1742-6596/396/4/042059 [[6]]. (CHEP 2012).
  • R. Ammendola, A. Biagioni, O. Frezza, A. Lonardo, F. Lo Cicero, P. S. Paolucci, D. Rossetti, A. Salamon, F. Simula, L. Tosoratto, P. Vicini. A 34 Gbps Data Transmission System with FPGAs Embedded Transceivers and QSFP plus Modules. 2012 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE RECORD (NSS/MIC). Book Series: IEEE Nuclear Science Symposium Conference Record. Pages: 872-876. Published: 2012.
  • S. Amerio, R. Ammendola, A. Biagioni, D. Bastieri, D. Benjamin, O. Frezza, S. Gelain, W. Ketchum, Y. K. Kim, F. Lo Cicero, A. Lonardo, T. Liu, D. Lucchesi, P. S. Paolucci, S. Poprocki, D. Rossetti, F. Simula, L. Tosoratto, G. Urso, P. Vicini, and P. Wittich. Applications of GPUs to online track reconstruction in HEP experiments. - NSS-MIC 2012 Proceedings doi:10.1109/NSSMIC.2012.6551422
  • R Ammendola, A Biagioni, O Frezza, F Lo Cicero, A Lonardo, PS Paolucci, D Rossetti, F Simula, L Tosoratto, P Vicini - APEnet+: a 3D Torus network optimized for GPU-based HPC Systems - Journal of Physics: Conference Series - 396 042059 doi:10.1088/1742-6596/396/4/042059 http://iopscience.iop.org/1742-6596/396/4/042059
  • M Bernaschi, M Bisson, D Rossetti - Benchmarking of communication techniques for GPUs, Journal of Parallel and Distributed Computing, 2012/9 http://www.sciencedirect.com/science/article/pii/S0743731512002213.

2011

  • Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Pier Stanislao Paolucci, Davide Rossetti, Francesco Simula, Laura Tosoratto, Piero Vicini, QUonG: A GPU-based HPC System Dedicated to LQCD Computing, Application Accelerators in High-Performance Computing, Symposium on, pp. 113-122, 2011 Symposium on Application Accelerators in High-Performance Computing, 2011, http://doi.ieeecomputersociety.org/10.1109/SAAHPC.2011.15
  • Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Pier Stanislao Paolucci, Davide Rossetti,Francesco Simula, Laura Tosoratto and Piero Vicini. APEnet+ project status. - Proceedings of XXIX International Symposium on Lattice Field Theory (Lattice 2011). July 10-16, 2011. Squaw Valley, Lake Tahoe, CA http://pos.sissa.it/archive/conferences/139/045/Lattice%202011_045.pdf

2010

  • Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Pier Paolucci, Roberto Petronzio, Davide Rossetti, Andrea Salamon, Gaetano Salina, Francesco Simula, Nazario Tantalo, Laura Tosoratto, Piero Vicini APEnet+: a 3D toroidal network enabling Petaflops scale Lattice QCD simulations on commodity clusters, proceedings of The XXVIII International Symposium on Lattice Field Theory PoS(Lattice 2010)022 and arXiv:1012.0253v1
  • R. Ammendola, A. Biagioni, O. Frezza, F. Lo Cicero, A. Lonardo, P.S. Paolucci, D. Rossetti, A. Salamon, G. Salina, F. Simula, L. Tosoratto, P. Vicini - apeNET+: High Bandwidth 3D Torus Direct Network for PetaFLOPS Scale Commodity Clusters, International Conference on Computing in High Energy and Nuclear Physics (CHEP), October 2010, Taipei, Taiwan. (http://arxiv.org/abs/1102.3796)
  • R. Ammendola, A. Biagioni, O. Frezza, F. Lo Cicero, A. Lonardo, P.S. Paolucci, D. Rossetti, A. Salamon, G. Salina, F. Simula, L. Tosoratto, P. Vicini - Mastering multi-GPU computing on a torus network - GPU Technology Conference 2010 (GTC) - poster
  • R. Ammendola, A. Biagioni, G. Chiodi, O. Frezza, A.Lonardo, F. Lo Cicero, R. Lunadei, D. Rossetti, A. Salamon, G. Salina, F. Simula, L. Tosoratto, P. Vicini - High speed data transfer with FPGAs and QSFP+ modules - Topical Workshop on Electronics for Particle Physics, Aachen, Germany / September 20-24, 2010 (http://iopscience.iop.org/1748-0221/5/12/C12019) and (http://arxiv.org/abs/1103.0128)
  • R. Ammendola et al., High speed data transfer with FPGAs and QSFP+ modules,Nuclear Science Symposium Conference Record (NSS/MIC) 2010 IEEE, Publication Year: 2010, Page(s): 1323 1325, November 2010, Knoxville, Tennesse. DOI: 10.1109/NSSMIC.2010.5873983

Talks and other publications

  • Pier Stanislao Paolucci, Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Elena Pastorelli, Francesco Simula, Laura Tosoratto and Piero Vicini, Distributed simulation of polychronous and plastic spiking neural networks: strong and weak scaling of a representative mini-application benchmark executed on a small-scale commodity cluster arXiv:1310.8478
  • Alessandro Lonardo, NaNet: a low-latency NIC enabling GPU-based, real-time low level trigger systems - Talk at Conference on Computing in High Energy and Nuclear Physics (CHEP 2013) in Amsterdam, 15 October 2013. PDF available here.
  • Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Pier Stanislao Paolucci, Davide Rossetti, Francesco Simula, Laura Tosoratto and Piero Vicini, ’Mutual Watch-dog Networking’: Distributed Awareness of Faults and Critical Events in Petascale/Exascale systems, arXiv:1307.0433, July 2013.
  • Andrea Biagioni, The EURETILE hardware experimental platform - Talk at Computing Architectures Software tools and nano-Technologies. For Numerical Embedded and Scalable Systems (CASTNESS 2013) [7]
  • Laura Tosoratto, Fault and Critical Event Awareness: a no-single-point-of-failure approach for distributed systems - Talk at Computing Architectures Software tools and nano-Technologies. For Numerical Embedded and Scalable Systems (CASTNESS 2013) [8]
  • Davide Rossetti, GPU peer-to-peer techniques applied to a cluster interconnect - talk at CASS 2013 workshop in Boston (Communication Architecture for Scalable Systems), 20 May 2013. PDF available here
  • Piero Vicini, Analysis of performance improvements for host and gpu interface of the APENet+ 3D Torus network. - talk at ACAT 2013 (15th International Workshop on advanced computing and analysis techniques in physics) in Beijing, 18 May 2013. Presentation available here
  • Piero Vicini, GPU for Real Time processing in HEP trigger systems - talk at ACAT 2013 (15th International Workshop on advanced computing and analysis techniques in physics) in Beijing, 17 May 2013. Presentation available here
  • Alessandro Lonardo, Building a Low-latency, Real-time, GPU-based Stream Processing System - talk at GTC 2013 conference in San Jose, 20 March 2013. Available in Video [9] and PDF [10]
  • Davide Rossetti, Breadth First Search on APEnet+ - talk at IA^3 Workshop on Irregular Applications at SC12 conference, 10 Nov 2012. Presentation available here
  • Roberto Ammendola, APEnet+: a 12x34 Gbps data transmission system with FPGAs embedded transceivers and QSFP+ modules. - Poster at NSS-12 Nuclear Science Symposium. Anaheim, California, October 29 – November 3, 2012. PDF available here
  • Davide Rossetti, Multi GPU simulations: status and perspectives - talk given at New Frontiers in Lattice Gauge Theory workshop in Florence, 26 Sep 2012.
  • Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Werner Geurts, Gert Goossens, Francesca Lo Cicero, Alessandro Lonardo, Pier Stanislao Paolucci, Davide Rossetti, Francesco Simula, Laura Tosoratto, Piero Vicini A heterogeneous many-core platform for experiments on scalable custom interconnects and management of fault and critical events, applied to many-process applications: Vol. II, technical report - arXiv preprint arXiv:1307.1270, 2012
  • Davide Rossetti, Leveraging NVIDIA GPUDirect on APEnet+ 3D Torus Cluster Interconnect - talk given at GTC 2012 conference. Video and pdf available here
  • I. Bacivarov, I. Belaid, A. Biagioni, A. El Antably, N. Fournel, O. Frezza, W. Geurts, G. Goossens, J. Jovic, R. Leupers, F. Lo Cicero, A. Lonardo, L. Murillo, P. S. Paolucci, D. Rai, D. Rossetti, F. Rousseau, L. Schor, C. Schumacher, F. Simula, L. Thiele, L. Tosoratto, P. Vicini and H. Yang. EURETILE: Unified Networking Infrastructure for Embedded and HPC many-tile platforms. Poster at HIPEAC12. Jan 2012 Paris, France [[11]]
  • P. S. Paolucci. Brain Simulation Benchmark: Inspiring and benchmarking the scalability and fault-tolerance of future many-tile systems. Poster at HIPEAC12. Jan 2012 Paris, France, [[12]]
  • Davide Rossetti, Remote Direct Memory Access Between NVIDIA GPUs with the APEnet 3D Torus Interconnect - talk given at NVidia booth at Supercomputing 2011 conference. Video available here. PPTX of presentation is here
  • Piero Vicini, QUonG: a GPU-based parallel processor system for scientific computing - talk given at SM&FT 2011 conference, Bari september 2011. Presentation available here
  • Piero Vicini - talk given at SAAHPC Symposium on Application Accelerators in High-Performance Computing 2011, July 19-20, 2011. University of Tennessee Conference Center, Knoxville, Tennessee.
  • Davide Rossetti, Status of the APEnet+ project - talk given at Lattice 2011 Conference, July 10-16 2011, Squaw Valley CA. Presentation avaialble here
  • Davide Rossetti, Many-core platforms and HEP experiments computing - talk given at SuperB Computing R&D workshop, 4-7 July 2011, Ferrara, Italy.
  • Davide Rossetti, Many-core platforms and HEP experiments computing - talk given at XVII SuperB Workshop and Kick Off meeting, May 28 - June 2 2011, Isola d'Elba, Italy
  • Davide Rossetti, Mastering Multi-GPU Computing on a Torus Network - poster at GTC 2010 conference, Sept 2010, San Jose CA.
  • Roberto Ammendola APENet+: a 3D Toroidal Network Enabling PetaFlops Scale Lattice QCD Simulations on Commodity Clusters - talk given at Lattice 2010, Villasimius 18 June 2010 [13].
  • Roberto Ammendola, Review on the GPU-related activities in INFN - talk given at INFN CCR & GRID workshop, 17-21 May 2010, Acireale, Italy

M.S. Theses

2014

  • Luca Pontisso, Caratterizzazione della scheda di comunicazione NaNet e suo utilizzo nel Trigger di Livello 0 basato su GPU dell’esperimento NA62, Master Thesis in Physics, Sapienza - Università di Roma. PDF