Max Planck Computing and Data Facility

Are you a researcher?

Create a profile to get free access to personal recommendations for colleagues and new articles.
Max Planck Computing and Data Facility
Short name
MPCDF
Country, city
Germany, Garching bei München
Publications
108
Citations
4 892
h-index
29
Top-3 journals
Top-3 organizations
Top-3 foreign organizations
Duke University
Duke University (6 publications)
Imperial College London
Imperial College London (4 publications)

Most cited in 5 years

Buchfink B., Reuter K., Drost H.
Nature Methods scimago Q1 wos Q1
2021-04-07 citations by CoLab: 2404 Abstract  
We are at the beginning of a genomic revolution in which all known species are planned to be sequenced. Accessing such data for comparative analyses is crucial in this new age of data-driven biology. Here, we introduce an improved version of DIAMOND that greatly exceeds previous search performances and harnesses supercomputing to perform tree-of-life scale protein alignments in hours, while matching the sensitivity of the gold standard BLASTP. An updated version of DIAMOND uses improved algorithmic procedures and a customized high-performance computing framework to make seemingly prohibitive large-scale protein sequence alignments feasible.
Tancogne-Dejean N., Oliveira M.J., Andrade X., Appel H., Borca C.H., Le Breton G., Buchholz F., Castro A., Corni S., Correa A.A., De Giovannini U., Delgado A., Eich F.G., Flick J., Gil G., et. al.
Journal of Chemical Physics scimago Q1 wos Q1
2020-03-31 citations by CoLab: 273 PDF Abstract  
Over the last few years, extraordinary advances in experimental and theoretical tools have allowed us to monitor and control matter at short time and atomic scales with a high degree of precision. An appealing and challenging route toward engineering materials with tailored properties is to find ways to design or selectively manipulate materials, especially at the quantum level. To this end, having a state-of-the-art ab initio computer simulation tool that enables a reliable and accurate simulation of light-induced changes in the physical and chemical properties of complex systems is of utmost importance. The first principles real-space-based Octopus project was born with that idea in mind, i.e., to provide a unique framework that allows us to describe non-equilibrium phenomena in molecular complexes, low dimensional materials, and extended systems by accounting for electronic, ionic, and photon quantum mechanical effects within a generalized time-dependent density functional theory. This article aims to present the new features that have been implemented over the last few years, including technical developments related to performance and massive parallelism. We also describe the major theoretical developments to address ultrafast light-driven processes, such as the new theoretical framework of quantum electrodynamics density-functional formalism for the description of novel light–matter hybrid states. Those advances, and others being released soon as part of the Octopus package, will allow the scientific community to simulate and characterize spatial and time-resolved spectroscopies, ultrafast phenomena in molecules and materials, and new emergent states of matter (quantum electrodynamical-materials).
Guther K., Anderson R.J., Blunt N.S., Bogdanov N.A., Cleland D., Dattani N., Dobrautz W., Ghanem K., Jeszenszki P., Liebermann N., Manni G.L., Lozovoi A.Y., Luo H., Ma D., Merz F., et. al.
Journal of Chemical Physics scimago Q1 wos Q1
2020-07-16 citations by CoLab: 74 PDF Abstract  
We present NECI, a state-of-the-art implementation of the Full Configuration Interaction Quantum Monte Carlo (FCIQMC) algorithm, a method based on a stochastic application of the Hamiltonian matrix on a sparse sampling of the wave function. The program utilizes a very powerful parallelization and scales efficiently to more than 24 000 central processing unit cores. In this paper, we describe the core functionalities of NECI and its recent developments. This includes the capabilities to calculate ground and excited state energies, properties via the one- and two-body reduced density matrices, as well as spectral and Green’s functions for ab initio and model systems. A number of enhancements of the bare FCIQMC algorithm are available within NECI, allowing us to use a partially deterministic formulation of the algorithm, working in a spin-adapted basis or supporting transcorrelated Hamiltonians. NECI supports the FCIDUMP file format for integrals, supplying a convenient interface to numerous quantum chemistry programs, and it is licensed under GPL-3.0.
Rizzuto F.P., Naab T., Spurzem R., Giersz M., Ostriker J.P., Stone N.C., Wang L., Berczik P., Rampp M.
2020-11-21 citations by CoLab: 74 PDF Abstract  
ABSTRACT Young dense massive star clusters are promising environments for the formation of intermediate mass black holes (IMBHs) through collisions. We present a set of 80 simulations carried out with nbody6++gpu of 10 models of compact $\sim 7 \times 10^4 \, \mathrm{M}_{\odot }$ star clusters with half-mass radii Rh ≲ 1 pc, central densities $\rho _\mathrm{core} \gtrsim 10^5 \, \mathrm{M}_\odot \, \mathrm{pc}^{-3}$, and resolved stellar populations with 10 per cent primordial binaries. Very massive stars (VMSs) up to $\sim 400 \, \mathrm{M}_\odot$ grow rapidly by binary exchange and three-body scattering with stars in hard binaries. Assuming that in VMS–stellar black hole (BH) collisions all stellar material is accreted on to the BH, IMBHs with masses up to $M_\mathrm{BH} \sim 350 \, \mathrm{M}_\odot$ can form on time-scales of ≲15 Myr, as qualitatively predicted from Monte Carlo mocca simulations. One model forms an IMBH of 140 $\mathrm{M_{\odot }}$ by three BH mergers with masses of 17:28, 25:45, and 68:70 $\mathrm{M_{\odot }}$ within ∼90 Myr. Despite the stochastic nature of the process, formation efficiencies are higher in more compact clusters. Lower accretion fractions of 0.5 also result in IMBH formation. The process might fail for values as low as 0.1. The IMBHs can merge with stellar mass BHs in intermediate mass ratio inspiral events on a 100 Myr time-scale. With 105 stars, 10 per cent binaries, stellar evolution, all relevant dynamical processes, and 300 Myr simulation time, our large suite of 80 simulations indicate another rapid IMBH formation channel in young and compact massive star clusters.
De Smedt K., Koureas D., Wittenburg P.
Publications scimago Q1 wos Q1 Open Access
2020-04-11 citations by CoLab: 61 PDF Abstract  
Data science is facing the following major challenges: (1) developing scalable cross-disciplinary capabilities, (2) dealing with the increasing data volumes and their inherent complexity, (3) building tools that help to build trust, (4) creating mechanisms to efficiently operate in the domain of scientific assertions, (5) turning data into actionable knowledge units and (6) promoting data interoperability. As a way to overcome these challenges, we further develop the proposals by early Internet pioneers for Digital Objects as encapsulations of data and metadata made accessible by persistent identifiers. In the past decade, this concept was revisited by various groups within the Research Data Alliance and put in the context of the FAIR Guiding Principles for findable, accessible, interoperable and reusable data. The basic components of a FAIR Digital Object (FDO) as a self-contained, typed, machine-actionable data package are explained. A survey of use cases has indicated the growing interest of research communities in FDO solutions. We conclude that the FDO concept has the potential to act as the interoperable federative core of a hyperinfrastructure initiative such as the European Open Science Cloud (EOSC).
Chagas da Silva M., Lorke M., Aradi B., Farzalipour Tabriz M., Frauenheim T., Rubio A., Rocca D., Deák P.
Physical Review Letters scimago Q1 wos Q1 Open Access
2021-02-19 citations by CoLab: 59 Abstract  
Supercell models are often used to calculate the electronic structure of local perturbations from the ideal periodicity in the bulk or on the surface of a crystal or in wires. When the defect or adsorbent is charged, a jellium counter charge is applied to maintain overall neutrality, but the interaction of the artificially repeated charges has to be corrected, both in the total energy and in the one-electron eigenvalues and eigenstates. This becomes paramount in slab or wire calculations, where the jellium counter charge may induce spurious states in the vacuum. We present here a self-consistent potential correction scheme and provide successful tests of it for bulk and slab calculations.
Pakmor R., Balbus S.A., Röpke F.K., Podsiadlowski P., Ohlmann S.T., Schneider F.R.
2020-01-01 citations by CoLab: 53 PDF Abstract  
ABSTRACT About 10 per cent of stars more massive than ${\approx}1.5\, {\mathrm{M}}_{\odot }$ have strong, large-scale surface magnetic fields and are being discussed as progenitors of highly magnetic white dwarfs and magnetars. The origin of these fields remains uncertain. Recent three-dimensional (3D) magnetohydrodynamical simulations have shown that strong magnetic fields can be generated in the merger of two massive stars. Here, we follow the long-term evolution of such a 3D merger product in a 1D stellar evolution code. During a thermal relaxation phase after the coalescence, the merger product reaches critical surface rotation, sheds mass and then spins down primarily because of internal mass readjustments. The spin of the merger product after thermal relaxation is mainly set by the co-evolution of the star–torus structure left after coalescence. This evolution is still uncertain, so we also consider magnetic braking and other angular momentum-gain and -loss mechanisms that may influence the final spin of the merged star. Because of core compression and mixing of carbon and nitrogen in the merger, enhanced nuclear burning drives a transient convective core that greatly contributes to the rejuvenation of the star. Once the merger product relaxed back to the main sequence, it continues its evolution similar to that of a genuine single star of comparable mass. It is a slow rotator that matches the magnetic blue straggler τ Sco. Our results show that merging is a promising mechanism to explain some magnetic massive stars and it may also be key to understand the origin of the strong magnetic fields of highly magnetic white dwarfs and magnetars.
Sand C., Ohlmann S.T., Schneider F.R., Pakmor R., Röpke F.K.
Astronomy and Astrophysics scimago Q1 wos Q1
2020-12-01 citations by CoLab: 49 Abstract  
Common-envelope phases are decisive for the evolution of many binary systems. Cases with asymptotic giant branch (AGB) primary stars are of particular interest because they are thought to be progenitors of various astrophysical transients. In three-dimensional hydrodynamic simulations with the moving-mesh code AREPO, we study the common-envelope evolution of a 1.0 M⊙ early-AGB star with companions of different masses. Although the stellar envelope of an AGB star is less tightly bound than that of a red giant, we find that the release of orbital energy of the core binary is insufficient to eject more than about twenty percent of the envelope mass. Ionization energy that is released in the expanding envelope, however, can lead to complete envelope ejection. Because recombination proceeds largely at high optical depths in our simulations, it is likely that this effect indeed plays a significant role in the considered systems. The efficiency of mass loss and the final orbital separation of the core binary system depend on the mass ratio between the companion and the primary star. Our results suggest a linear relation between the ratio of final to initial orbital separation and this parameter.
Kirsebom O. ., Jones S., Strömberg D. ., Martínez-Pinedo G., Langanke K., Röpke F. ., Brown B. ., Eronen T., Fynbo H. ., Hukkanen M., Idini A., Jokinen A., Kankainen A., Kostensalo J., Moore I., et. al.
Physical Review Letters scimago Q1 wos Q1 Open Access
2019-12-24 citations by CoLab: 43 Abstract  
A significant fraction of stars between 7-11 solar masses are thought to become supernovae, but the explosion mechanism is unclear. The answer depends critically on the rate of electron capture on $^{20}$Ne in the degenerate oxygen-neon stellar core. However, due to the unknown strength of the transition between the ground states of $^{20}$Ne and $^{20}$F, it has not previously been possible to fully constrain the rate. By measuring the transition, we have established that its strength is exceptionally large and enhances the capture rate by several orders of magnitude. This has a decisive impact on the evolution of the core, increasing the likelihood that the star is (partially) disrupted by a thermonuclear explosion rather than collapsing to form a neutron star. Importantly, our measurement resolves the last remaining nuclear physics uncertainty in the final evolution of degenerate oxygen-neon stellar cores, allowing future studies to address the critical role of convection, which at present is poorly understood.
Gapsys V., Hahn D.F., Tresadern G., Mobley D.L., Rampp M., de Groot B.L.
2022-02-22 citations by CoLab: 36 Abstract  
Nowadays, drug design projects benefit from highly accurate protein-ligand binding free energy predictions based on molecular dynamics simulations. While such calculations have been computationally expensive in the past, we now demonstrate that workflows built on open source software packages can efficiently leverage pre-exascale computing resources to screen hundreds of compounds in a matter of days. We report our results of free energy calculations on a large set of pharmaceutically relevant targets assembled to reflect industrial drug discovery projects.
Kutzner C., Miletić V., Palacio Rodríguez K., Rampp M., Hummer G., de Groot B.L., Grubmüller H.
2025-02-13 citations by CoLab: 0 Abstract  
ABSTRACTWe benchmarked the performance of the GROMACS 2024 molecular dynamics (MD) code on a modern high‐performance computing (HPC) cluster with AMD CPUs on up to 65,536 CPU cores. We used five different MD systems, ranging in size from about 82,000 to 204 million atoms, and evaluated their performance using two different Message Passing Interface (MPI) libraries, Intel‐MPI and Open‐MPI. The largest system showed near‐perfect strong scaling up to 512 nodes or 65,536 cores, maintaining a parallel efficiency above 0.9 even at the highest level of parallelization. Energy efficiency for a given number of nodes was generally equal to or slightly better than parallel efficiency. We achieved peak performances of 687 ns/d for the 82k atom system, 116 ns/d for the 53M atom system, and about 35 ns/d for the largest 204M atom system. These results demonstrate that highly optimized software running on a state‐of‐the‐art HPC cluster provides sufficient computing power to simulate biomolecular systems at the mesoscale of viruses and organelles, and potentially small cells in the near future.
Bonafé F.P., Albar E.I., Ohlmann S.T., Kosheleva V.P., Bustamante C.M., Troisi F., Rubio A., Appel H.
Physical Review B scimago Q1 wos Q2
2025-02-05 citations by CoLab: 1 Abstract  
We report an , nonrelativistic QED method that couples light and matter self-consistently beyond the electric dipole approximation and without multipolar truncations. This method is based on an extension of the Maxwell-Pauli-Kohn-Sham approach to a full minimal coupling Hamiltonian, where the space- and time-dependent vector potential is coupled to the matter system, and its back reaction to the radiated fields is generated by the full current density. The implementation in the open-source code is designed for massively parallel multiscale simulations considering different grid spacings for the Maxwell and matter subsystems. Here, we show applications of this framework to simulate renormalized Cherenkov radiation of an electronic wave packet, magneto-optical effects with nonchiral light in nonchiral molecular systems, and renormalized plasmonic modes in a nanoplasmonic dimer. We show that in some cases, the beyond-dipole effects cannot be captured by a multipolar expansion Hamiltonian in the length gauge. Finally, we discuss further opportunities enabled by the framework in the field of twisted light and orbital angular momentum, inelastic light scattering, and strong-field physics. Published by the American Physical Society 2025
Li Y., Colnaghi T., Gong Y., Zhang H., Yu Y., Wei Y., Gan B., Song M., Marek A., Rampp M., Zhang S., Pei Z., Wuttig M., Ghosh S., Körmann F., et. al.
Advanced Materials scimago Q1 wos Q1
2024-11-01 citations by CoLab: 0
Dang K.M., Zhang Y.J., Zhang T., Wang C., Sinner A., Coronica P., Poon J.K.
Journal of Neuroscience Methods scimago Q2 wos Q3
2024-11-01 citations by CoLab: 1 Abstract  
The segmentation of cells and neurites in microscopy images of neuronal networks provides valuable quantitative information about neuron growth and neuronal differentiation, including the number of cells, neurites, neurite length and neurite orientation. This information is essential for assessing the development of neuronal networks in response to extracellular stimuli, which is useful for studying neuronal structures, for example, the study of neurodegenerative diseases and pharmaceuticals.
Yu J., Wang Z., Saksena A., Wei S., Wei Y., Colnaghi T., Marek A., Rampp M., Song M., Gault B., Li Y.
Acta Materialia scimago Q1 wos Q1
2024-10-01 citations by CoLab: 1 Abstract  
Quantitative analysis of microstructural features on the nanoscale, including precipitates, local chemical orderings (LCOs) or structural defects (e.g. stacking faults) plays a pivotal role in understanding the mechanical and physical responses of engineering materials. Atom probe tomography (APT), known for its exceptional combination of chemical sensitivity and sub-nanometer resolution, primarily identifies microstructures through compositional segregations. However, this fails when there is no significant segregation, as can be the case for LCOs and stacking faults. Here, we introduce a 3D deep learning approach, AtomNet, designed to process APT point cloud data at the single-atom level for nanoscale microstructure extraction, simultaneously considering compositional and structural information. AtomNet is showcased in segmenting L12-type nanoprecipitates from the matrix in an AlLiMg alloy, irrespective of crystallographic orientations, which outperforms previous methods. AtomNet also allows for 3D imaging of L10-type LCOs in an AuCu alloy, a challenging task for conventional analysis due to their small size and subtle compositional differences. Finally, we demonstrate the use of AtomNet for revealing 2D stacking faults in a Co-based superalloy, without any stacking-faults-relevant samples in the training dataset, expanding the capabilities for automated exploration of hidden microstructures in APT data. AtomNet can thus recognize challenging microstructures, including nanoprecipitates with diameters above 2 nm, LCOs with diameters of about 1–2 nm without obvious compositional segregation, and even unforeseen planar defects by analyzing atom-atom environments. AtomNet pushes the boundaries of APT analysis, and holds promise in establishing precise quantitative microstructure-property relationships across a diverse range of metallic materials.
Ju Y., Huber D., Perez A., Ulbl P., Markidis S., Schlatter P., Schulz M., Schreiber M., Laure E.
2024-09-25 citations by CoLab: 0 Abstract  
The computational power of High-Performance Computing (HPC) systems increases continuously and rapidly. Data-intensive applications are designed to leverage the high computational capacity of HPC resources and typically generate a large amount of data for traditional post-processing data analytics. However, the HPC systems’ in-/output (IO) subsystem develops relatively slowly, and the storage capacity is limited. This could lead to limited actual performance and scientific discovery. In-situ techniques are a partial remedy to these problems by reducing or avoiding the data flow through the IO subsystem to/from the storage. However, in current practice, asynchronous in-situ techniques with static resource management often allocate separate computing resources for executing in-situ task(s), which remain idle if no in-situ work is at hand. In the present work, we target improving the efficiency of computing resource usage by launching and releasing necessary additional computing resources for in-situ task(s). Our approach is based on extensions for MPI Sessions that enable the required dynamic resource management. In this paper, we propose a basic and an advanced in-situ techniques with dynamic resource management enabled by MPI Sessions, their implementations on two real-world use cases, and a critical analysis of the experimental results.
Li Y., Colnaghi T., Gong Y., Zhang H., Yu Y., Wei Y., Gan B., Song M., Marek A., Rampp M., Zhang S., Pei Z., Wuttig M., Ghosh S., Körmann F., et. al.
Advanced Materials scimago Q1 wos Q1
2024-08-12 citations by CoLab: 5 Abstract  
AbstractIn solids, chemical short‐range order (CSRO) refers to the self‐organization of atoms of certain species occupying specific crystal sites. CSRO is increasingly being envisaged as a lever to tailor the mechanical and functional properties of materials. Yet quantitative relationships between properties and the morphology, number density, and atomic configurations of CSRO domains remain elusive. Herein, it is showcased how machine learning‐enhanced atom probe tomography (APT) can mine the near‐atomically resolved APT data and jointly exploit the technique's high elemental sensitivity to provide a 3D quantitative analysis of CSRO in a CoCrNi medium‐entropy alloy. Multiple CSRO configurations are revealed, with their formation supported by state‐of‐the‐art Monte‐Carlo simulations. Quantitative analysis of these CSROs allows establishing relationships between processing parameters and physical properties. The unambiguous characterization of CSRO will help refine strategies for designing advanced materials by manipulating atomic‐scale architectures.
Krushevska V., Shugarov S., Ochner P., Kuznyetsova Y., Petrov M., Kroll P.
2024-07-22 citations by CoLab: 0 Abstract  
Abstract In this study, we present an investigation of the newly discovered dwarf nova ASASSN-19oc during its superoutburst on 2019 June 2. We carried out detailed UBVR c I c -photometric observations and also obtained a spectrum on day 7 of the outburst, which shows the presence of hydrogen absorption lines commonly found in dwarf nova outbursts. Analysis of photometric data reveals the occurrence of early superhumps in the initial days of observations, followed by ordinary and late superhumps. We have accurately calculated the period of the ordinary superhumps as P ord = 0.05681(10) days and determined the periods at different stages, as well as the rate of change of the superhump period (P dot = P ̇ /P = 8.1 × 10−5). Additionally, we have derived the mass ratio of the components (q = 0.09), and estimated the color temperature during the outburst as ∼11,000 K, the distance to the system (d = 560 pc) and absolute magnitude of the system in outburst (M V = 5.3). We have shown that outbursts of this star are very rare: based on brightness measurements on 600 archival photographic plates, we found only one outburst that occurred in 1984. This fact, as well as the properties listed above, convincingly shows that the variable ASASSN-19oc is a dwarf nova of WZ Sge type.
Kokott S., Merz F., Yao Y., Carbogno C., Rossi M., Havu V., Rampp M., Scheffler M., Blum V.
Journal of Chemical Physics scimago Q1 wos Q1
2024-07-11 citations by CoLab: 10 Abstract  
Hybrid density functional approximations (DFAs) offer compelling accuracy for ab initio electronic-structure simulations of molecules, nanosystems, and bulk materials, addressing some deficiencies of computationally cheaper, frequently used semilocal DFAs. However, the computational bottleneck of hybrid DFAs is the evaluation of the non-local exact exchange contribution, which is the limiting factor for the application of the method for large-scale simulations. In this work, we present a drastically optimized resolution-of-identity-based real-space implementation of the exact exchange evaluation for both non-periodic and periodic boundary conditions in the all-electron code FHI-aims, targeting high-performance central processing unit (CPU) compute clusters. The introduction of several new refined message passing interface (MPI) parallelization layers and shared memory arrays according to the MPI-3 standard were the key components of the optimization. We demonstrate significant improvements of memory and performance efficiency, scalability, and workload distribution, extending the reach of hybrid DFAs to simulation sizes beyond ten thousand atoms. In addition, we also compare the runtime performance of the PBE, HSE06, and PBE0 functionals. As a necessary byproduct of this work, other code parts in FHI-aims have been optimized as well, e.g., the computation of the Hartree potential and the evaluation of the force and stress components. We benchmark the performance and scaling of the hybrid DFA-based simulations for a broad range of chemical systems, including hybrid organic–inorganic perovskites, organic crystals, and ice crystals with up to 30 576 atoms (101 920 electrons described by 244 608 basis functions).
Bauer S., Benner P., Bereau T., Blum V., Boley M., Carbogno C., Catlow R., Dehm G., Eibl S., Ernstorfer R., Fekete Á., Foppa L., Fratzl P., Freysoldt C., Gault B., et. al.
2024-07-03 citations by CoLab: 8 Abstract  
Abstract Science is and always has been based on data, but the terms ‘data-centric’ and the ‘4th paradigm of’ materials research indicate a radical change in how information is retrieved, handled and research is performed. It signifies a transformative shift towards managing vast data collections, digital repositories, and innovative data analytics methods. The integration of Artificial Intelligence (AI) and its subset Machine Learning (ML), has become pivotal in addressing all these challenges. This Roadmap on Data-Centric Materials Science explores fundamental concepts and methodologies, illustrating diverse applications in electronic-structure theory, soft matter theory, microstructure research and experimental techniques like photoemission, and electron microscopy.
While the roadmap delves into specific areas within the broad interdisciplinary field of materials science, the provided examples elucidate key concepts applicable to a wider range of topics. The discussed instances offer insights into addressing the multifaceted challenges encountered in contemporary materials research.
Dharmawardena T.E., Bailer-Jones C.A., Fouesneau M., Foreman-Mackey D., Coronica P., Colnaghi T., Müller T., Wilson A.G.
2024-06-19 citations by CoLab: 2 PDF Abstract  
Abstract Three-dimensional dust density maps are crucial for understanding the structure of the interstellar medium of the Milky Way and the processes that shape it. However, constructing these maps requires large datasets and the methods used to analyse them are computationally expensive and difficult to scale up. As a result it is has only recently become possible to map kiloparsec-scale regions of our Galaxy at parsec-scale grid sampling. We present all-sky three-dimensional dust density and extinction maps of the Milky Way out to 2.8 kpc in distance from the Sun using the fast and scalable Gaussian Process algorithm Dustribution. The sampling of the three-dimensional map is l, b, d = 1○ × 1○ × 1.7 pc. The input extinction and distance catalogue contains 120 million stars with photometry and astrometry from Gaia DR2, 2MASS and AllWISE. This combines the strengths of optical and infrared data to probe deeper into the dusty regions of the Milky Way. We compare our maps with other published 3D dust maps. All maps quantitatively agree at the 0.001 mag pc−1 scale with many qualitatively similar features, although each map also has its own features. We recover Galactic features previously identified in the literature. Moreover, we also see a large under-density that may correspond to an inter-arm or -spur gap towards the Galactic Centre.
Salvadore F., Rossi G., Sathyanarayana S., Bernardini M.
Journal of Supercomputing scimago Q2 wos Q2
2024-06-06 citations by CoLab: 5 Abstract  
Nearly 20 years after the birth of general-purpose GPU computing, the HPC landscape is now dominated by GPUs. After years of undisputed dominance by NVIDIA, new players have entered the arena in a convincing manner, namely AMD and more recently Intel, whose devices currently power the first two clusters in the Top500 ranking. Unfortunately, code porting is still a major problem, even more due to the presence of different vendors, but at the same time the emergence of simplified standard paradigms suggests an encouraging prospect for developers. In this work, we provide a detailed OpenMP porting strategy of STREAmS, a community code for the compressible fluid dynamics. The proposed porting technique is based on the offload functionality of the OpenMP 5.x paradigm and in particular on a hybrid directives/APIs approach that fits seamlessly into the multi-backend software ecosystem of STREAmS. We further carry out a comprehensive performance analysis on the Intel® Data Center GPU Max 1550 (formerly called Ponte Vecchio or PVC). In addition, we analyze the performance of the code on two benchmark clusters powered by PVC, including the exascale Aurora cluster. The performance is evaluated at different levels of parallelism involved, i.e., the intrinsic parallelism of the PVC tile, the inter-tile parallelism within the GPU configuration, between the GPUs within the node and between the nodes within the cluster. The analysis shows that although the implementation complexity of the OpenMP porting is limited, it is necessary to follow some important guidelines to achieve satisfactory performance. The PVC GPU shows about 40% higher performance than the NVIDIA A100 or AMD MI250X GPUs, which, however, were released about 3 years earlier. Both intra-node and internode scalability show good results. Overall, the introduction of PVC into the GPU computing HPC landscape represents a positive step forward for the diversification and competitiveness of the sector.
Cruz-León S., Majtner T., Hoffmann P.C., Kreysing J.P., Kehl S., Tuijtel M.W., Schaefer S.L., Geißler K., Beck M., Turoňová B., Hummer G.
Nature Communications scimago Q1 wos Q1 Open Access
2024-05-11 citations by CoLab: 29 PDF Abstract  
AbstractVisual proteomics attempts to build atlases of the molecular content of cells but the automated annotation of cryo electron tomograms remains challenging. Template matching (TM) and methods based on machine learning detect structural signatures of macromolecules. However, their applicability remains limited in terms of both the abundance and size of the molecular targets. Here we show that the performance of TM is greatly improved by using template-specific search parameter optimization and by including higher-resolution information. We establish a TM pipeline with systematically tuned parameters for the automated, objective and comprehensive identification of structures with confidence 10 to 100-fold above the noise level. We demonstrate high-fidelity and high-confidence localizations of nuclear pore complexes, vaults, ribosomes, proteasomes, fatty acid synthases, lipid membranes and microtubules, and individual subunits inside crowded eukaryotic cells. We provide software tools for the generic implementation of our method that is broadly applicable towards realizing visual proteomics.
Wlazłowski G., Forbes M.M., Sarkar S.R., Marek A., Szpindler M.
PNAS Nexus wos Q1 Open Access
2024-04-15 citations by CoLab: 5 PDF Abstract  
Abstract Ultracold atoms provide a platform for analog quantum computer capable of simulating the quantum turbulence that underlies puzzling phenomena like pulsar glitches in rapidly spinning neutron stars. Unlike other platforms like liquid helium, ultracold atoms have a viable theoretical framework for dynamics, but simulations push the edge of current classical computers. We present the largest simulations of fermionic quantum turbulence to date and explain the computing technology needed, especially improvements in the Eigenvalue soLvers for Petaflop Applications(ELPA) library that enable us to diagonalize matrices of record size (millions by millions). We quantify how dissipation and thermalization proceed in fermionic quantum turbulence by using the internal structure of vortices as a new probe of the local effective temperature. All simulation data and source codes are made available to facilitate rapid scientific progress in the field of ultracold Fermi gases.
Carretero J., Garcia-Blas J., Brinkmann A., Vef M., Besnard J., Torquati M., Ju Y., Montella R.
2024-04-13 citations by CoLab: 0 Abstract  
ADAPIO symposium was a forum to discuss how to create adaptive I/O systems through the creation of an active I/O stack that dynamically adjusts computation and storage requirements through intelligent coordination, malleability of computation and I/O, and the scheduling of storage resources along all levels of the storage hierarchy. Moreover, ADAPIO shows examples of co-designing applications and I/O.

Since 2001

Total publications
108
Total citations
4892
Citations per publication
45.3
Average publications per year
4.5
Average authors per publication
12.15
h-index
29
Metrics description

Top-30

Fields of science

5
10
15
20
25
30
Space and Planetary Science, 28, 25.93%
Astronomy and Astrophysics, 28, 25.93%
General Physics and Astronomy, 17, 15.74%
Condensed Matter Physics, 13, 12.04%
Computer Science Applications, 10, 9.26%
Hardware and Architecture, 10, 9.26%
Software, 9, 8.33%
General Materials Science, 6, 5.56%
Theoretical Computer Science, 6, 5.56%
Physical and Theoretical Chemistry, 5, 4.63%
Mechanical Engineering, 5, 4.63%
Computer Networks and Communications, 5, 4.63%
Multidisciplinary, 4, 3.7%
General Engineering, 4, 3.7%
Mechanics of Materials, 4, 3.7%
Nuclear and High Energy Physics, 4, 3.7%
General Chemistry, 3, 2.78%
General Biochemistry, Genetics and Molecular Biology, 3, 2.78%
Computer Graphics and Computer-Aided Design, 3, 2.78%
Nuclear Energy and Engineering, 3, 2.78%
Artificial Intelligence, 3, 2.78%
Physics and Astronomy (miscellaneous), 2, 1.85%
Library and Information Sciences, 2, 1.85%
Computational Theory and Mathematics, 2, 1.85%
Civil and Structural Engineering, 2, 1.85%
Applied Mathematics, 2, 1.85%
General Computer Science, 2, 1.85%
Modeling and Simulation, 2, 1.85%
Metals and Alloys, 1, 0.93%
Biochemistry, 1, 0.93%
5
10
15
20
25
30

Journals

2
4
6
8
10
12
14
16
2
4
6
8
10
12
14
16

Publishers

2
4
6
8
10
12
14
16
18
2
4
6
8
10
12
14
16
18

With other organizations

2
4
6
8
10
12
14
16
18
2
4
6
8
10
12
14
16
18

With foreign organizations

1
2
3
4
5
6
1
2
3
4
5
6

With other countries

5
10
15
20
25
30
35
USA, 35, 32.41%
France, 15, 13.89%
United Kingdom, 14, 12.96%
China, 12, 11.11%
Spain, 11, 10.19%
Italy, 9, 8.33%
Netherlands, 7, 6.48%
Australia, 6, 5.56%
Austria, 5, 4.63%
Canada, 5, 4.63%
Poland, 5, 4.63%
Switzerland, 5, 4.63%
Denmark, 4, 3.7%
Finland, 4, 3.7%
Japan, 4, 3.7%
Russia, 3, 2.78%
Ukraine, 3, 2.78%
Belgium, 3, 2.78%
Sweden, 3, 2.78%
Greece, 2, 1.85%
Ireland, 2, 1.85%
Mexico, 2, 1.85%
Turkey, 2, 1.85%
Czech Republic, 2, 1.85%
Portugal, 1, 0.93%
Argentina, 1, 0.93%
Brazil, 1, 0.93%
Hungary, 1, 0.93%
Israel, 1, 0.93%
5
10
15
20
25
30
35
  • We do not take into account publications without a DOI.
  • Statistics recalculated daily.
  • Publications published earlier than 2001 are ignored in the statistics.
  • The horizontal charts show the 30 top positions.
  • Journals quartiles values are relevant at the moment.