作者: 欲望小妹 時間: 2025-3-21 22:40
Grundsatzfragen der Kostenplanung,ent, measure, and analyze applications. To our knowledge, this is the first integrated performance tool framework allowing to analyze TM/SE programs. We demonstrate its usefulness and effectiveness by describing experiments with benchmarks and a real-world application.作者: fulmination 時間: 2025-3-22 04:22
Die Ermittlung des Firmenwertess must be elastically adapted at runtime to match the needs of a dynamic application-workload. In this paper, we introduce the architecture and implementation of c-Eclipse, and describe its key characteristics via a use-case scenario that involves a user creating a description of a 3-tier Cloud appl作者: 凹處 時間: 2025-3-22 08:23
https://doi.org/10.1007/978-3-658-21120-2ay be difficult to generalize. In this article, we show how we crafted a coarse-grain hybrid simulation/emulation of StarPU, a dynamic runtime for hybrid architectures, over SimGrid, a versatile simulator for distributed systems. This approach allows to obtain performance predictions accurate within作者: 招待 時間: 2025-3-22 11:23
Die Ermüdung des Eisenbahnschienenmaterials, extracting those same properties from HPC applications, and for associating bandwidth sensitivity to specific structures in the application source code. We apply our framework to a number of large scale HPC applications, observing that the bandwidth sensitivity model shows an absolute mean error t作者: 交響樂 時間: 2025-3-22 14:38 作者: 交響樂 時間: 2025-3-22 19:06 作者: 大炮 時間: 2025-3-23 00:50
https://doi.org/10.1007/978-3-322-83787-5eover, we present a?case study describing how an advanced simulation tool was used to find new configuration for an actual resource manager deployed in the Czech National Grid, significantly increasing its performance.作者: 旅行路線 時間: 2025-3-23 02:44 作者: 膽大 時間: 2025-3-23 05:55
Performance Measurement and Analysis of Transactional Memory and Speculative Execution on IBM Blue Gent, measure, and analyze applications. To our knowledge, this is the first integrated performance tool framework allowing to analyze TM/SE programs. We demonstrate its usefulness and effectiveness by describing experiments with benchmarks and a real-world application.作者: 智力高 時間: 2025-3-23 13:07
c-Eclipse: An Open-Source Management Framework for Cloud Applicationss must be elastically adapted at runtime to match the needs of a dynamic application-workload. In this paper, we introduce the architecture and implementation of c-Eclipse, and describe its key characteristics via a use-case scenario that involves a user creating a description of a 3-tier Cloud appl作者: 和音 時間: 2025-3-23 14:22
Modeling and Simulation of a Dynamic Task-Based Runtime System for Heterogeneous Multi-core Architecay be difficult to generalize. In this article, we show how we crafted a coarse-grain hybrid simulation/emulation of StarPU, a dynamic runtime for hybrid architectures, over SimGrid, a versatile simulator for distributed systems. This approach allows to obtain performance predictions accurate within作者: FID 時間: 2025-3-23 18:31 作者: 傻瓜 時間: 2025-3-24 01:36
Multi-Objective Auto-Tuning with Insieme: Optimization and Trade-Off Analysis for Time, Energy and Rl and a random search at a fraction of the required time (5%) or energy (8%). A comparison to a state-of-the-art multi-objective optimizer (NSGA-II) shows that RS-GDE3 computes solutions of higher quality. Finally, based on the trade-off solutions found by RS-GDE3, we provide a detailed analysis and作者: 金哥占卜者 時間: 2025-3-24 04:13
Characterizing the Performance-Energy Tradeoff of Small ARM Cores in HPC Computation subsystems: single instruction multiple data (SIMD)/floating point and the cache/memory hierarchy; and that static analysis of this kind is sufficient to predict which platform is best for a particular application/input pair. In the context of these findings, we evaluate how some of the key archite作者: 懶洋洋 時間: 2025-3-24 09:02 作者: Rustproof 時間: 2025-3-24 12:37 作者: OASIS 時間: 2025-3-24 16:42
DReAM: Per-Task DRAM Energy Metering in Multicore Systemsw cost, implementation of the ideal model (less than 5% accuracy error when 16 tasks share memory); and (iii) a comparison with standard methods (even distribution and access-count based) proving that . is more accurate than these other methods.作者: 大酒杯 時間: 2025-3-24 19:19
SPAGHETtI: Scheduling/Placement Approach for Task-Graphs on HETerogeneous archItecturean. Moreover, the number of resources to be used for executing the schedule is given by a linear time algorithm. When the resources are bounded we provide a method to reduce the number of necessary resources up to the bound providing a set of compromises between the makespan and the size of the infrastructure.作者: Glutinous 時間: 2025-3-25 01:38 作者: 線 時間: 2025-3-25 04:26 作者: BANAL 時間: 2025-3-25 09:39
,Die Physiologie der Ern?hrung,ormation, exploring how the important blocks vary across thread counts and input sizes, and making modest source code changes (fewer than 10 lines of code) that result in 14-92% savings in parallel program runtime.作者: 錫箔紙 時間: 2025-3-25 15:17
Die Erreger des Fleck- und Felsenfiebersen configuration of cloud virtual machines. We compare the calculated Pareto set with measurements performed in a number of experiments for real–world bags–of–tasks and validate the proposed model and the accuracy of the estimated configurations.作者: 為寵愛 時間: 2025-3-25 16:26
MPI Trace Compression Using Event Flow Graphsw graphs are captured with very low overhead, require orders of magnitude less storage than standard trace files, and can still recover the full sequence of events in the application. We test this new approach with the NERSC-8/Trinity Benchmark suite and achieve compression ratios up to 119x.作者: 咽下 時間: 2025-3-25 21:24
ParaShares: Finding the Important Basic Blocks in Multithreaded Programsormation, exploring how the important blocks vary across thread counts and input sizes, and making modest source code changes (fewer than 10 lines of code) that result in 14-92% savings in parallel program runtime.作者: Offbeat 時間: 2025-3-26 01:13
A Queueing Theory Approach to Pareto Optimal Bags-of-Tasks Scheduling on Cloudsen configuration of cloud virtual machines. We compare the calculated Pareto set with measurements performed in a number of experiments for real–world bags–of–tasks and validate the proposed model and the accuracy of the estimated configurations.作者: trigger 時間: 2025-3-26 05:59
Conference proceedings 2014o, Portugal, in August 2014. The 68 revised full papers presented were carefully reviewed and selected from 267 submissions. The papers are organized in 15 topical sections: support tools environments; performance prediction and evaluation; scheduling and load balancing; high-performance architectur作者: 小教堂 時間: 2025-3-26 10:14
https://doi.org/10.1007/978-3-662-26273-3ide a theoretical performance model that can predict the performance of parallel applications in different virtual machine scheduling policies and evaluate the model in representative hypervisors including KVM, Xen, and VMware. Through this analysis and evaluation, we show that our performance prediction model is accurate and reliable.作者: STENT 時間: 2025-3-26 14:26
Sind Demokratien reformierbar?,d commit with success..Experimental results with the STMBench7 benchmark and the STAMP benchmark suite showed that current coarse-grained, conservative transaction schedulers are not suitable for workloads with long transactions, whereas ProPS is up to 40% faster than all other scheduling alternatives.作者: 兵團 時間: 2025-3-26 17:21
Die Errichtung von Apotheken in Preu?en and deadlines is NP-hard and also that being selfish can cause solutions at most .. far from the optimal, where . is the number of machines and .?>?1 is a constant. Finally, we present efficient heuristics for scenarios with all jobs ready from the beginning.作者: 正常 時間: 2025-3-26 22:53
https://doi.org/10.1007/978-3-662-41414-9hich can thus be adapted for CPUs by removing all the unwanted local-memory arrays together with the obsolete barrier statements. Experiments show that the automated transformation can satisfactorily improve OpenCL kernel performances on Sandy Bridge CPU and Intel’s Many-Integrated-Core coprocessor.作者: 善于騙人 時間: 2025-3-27 04:06 作者: 熱情贊揚 時間: 2025-3-27 07:25
ProPS: A Progressively Pessimistic Scheduler for Software Transactional Memoryd commit with success..Experimental results with the STMBench7 benchmark and the STAMP benchmark suite showed that current coarse-grained, conservative transaction schedulers are not suitable for workloads with long transactions, whereas ProPS is up to 40% faster than all other scheduling alternatives.作者: Hallmark 時間: 2025-3-27 11:06 作者: 業(yè)余愛好者 時間: 2025-3-27 16:33 作者: 極深 時間: 2025-3-27 17:50
ScalaJack: Customized Scalable Tracing with In-situ Data Analysismizable instrumentation and pluggable extension capabilities for problem directed instrumentation and in-situ data analysis. We further eliminate cross cutting concerns by code refactoring for aspect orientation and evaluate these capabilities in case studies within and beyond the scope of tracing.作者: milligram 時間: 2025-3-28 00:02
Energy Efficient Scheduling of MapReduce Jobsduce jobs under a given budget of energy. Using a linear programming relaxation of our problem, we derive a polynomial time constant-factor approximation algorithm. We also propose a convex programming formulation that we combine with standard list scheduling policies, and we evaluate their performance using simulations.作者: nutrients 時間: 2025-3-28 04:55
https://doi.org/10.1007/978-3-642-91288-7w cost, implementation of the ideal model (less than 5% accuracy error when 16 tasks share memory); and (iii) a comparison with standard methods (even distribution and access-count based) proving that . is more accurate than these other methods.作者: Observe 時間: 2025-3-28 07:18 作者: 寵愛 時間: 2025-3-28 12:57
0302-9743 ld in Porto, Portugal, in August 2014. The 68 revised full papers presented were carefully reviewed and selected from 267 submissions. The papers are organized in 15 topical sections: support tools environments; performance prediction and evaluation; scheduling and load balancing; high-performance a作者: 放氣 時間: 2025-3-28 14:52 作者: 修飾語 時間: 2025-3-28 22:12 作者: NIP 時間: 2025-3-29 01:57
MPI Trace Compression Using Event Flow Graphsrmance analysis is becoming increasingly difficult due to the growing complexity of scientific codes and the size of machines. Even though many tools have been developed over the past years to help in this task, current approaches either only offer an overview of the application discarding temporal 作者: doxazosin 時間: 2025-3-29 04:34
ScalaJack: Customized Scalable Tracing with In-situ Data Analysiseasure. We address this problems by combining customized tracing and providing support for in-situ data analysis via ScalaJack, a framework with customizable instrumentation and pluggable extension capabilities for problem directed instrumentation and in-situ data analysis. We further eliminate cros作者: CODA 時間: 2025-3-29 07:55
Performance Measurement and Analysis of Transactional Memory and Speculative Execution on IBM Blue Gle hardware. This in turn makes it increasingly challenging to achieve correct and efficient thread synchronization. To support the programmer in this task, IBM introduced hardware transactional memory (TM) and speculative execution (SE) in their Blue Gene/Q system with its 16-core processor, which 作者: 斜谷 時間: 2025-3-29 13:06 作者: 盡責 時間: 2025-3-29 18:56
Modeling and Simulation of a Dynamic Task-Based Runtime System for Heterogeneous Multi-core Architecformance of such heterogeneous machines is challenging as it requires to carefully offload computations and manage data movements between the different processing units. The most promising and successful approaches so far rely on task-based runtimes that abstract the machine and rely on opportunisti作者: aqueduct 時間: 2025-3-29 20:57 作者: 宮殿般 時間: 2025-3-30 03:31
ParaShares: Finding the Important Basic Blocks in Multithreaded Programscombing through program source or thread traces for pathologies including communication overheads, data dependencies, and load imbalances. This work takes a new approach: it ignores any underlying pathologies, and focuses instead on pinpointing the exact locations in source code that consume the lar作者: 減弱不好 時間: 2025-3-30 04:45
Multi-Objective Auto-Tuning with Insieme: Optimization and Trade-Off Analysis for Time, Energy and Rst, auto-tuners have been successfully applied to minimize execution time. However, besides execution time, additional optimization goals have recently arisen, such as energy consumption or computing costs. Therefore, more sophisticated methods capable of exploiting and identifying the trade-offs am作者: 空氣傳播 時間: 2025-3-30 10:14 作者: insecticide 時間: 2025-3-30 13:22
DReAM: Per-Task DRAM Energy Metering in Multicore Systems multicores, which opens new paths to energy/performance optimizations, such as per-task energy-aware task scheduling and energy-aware billing in datacenters. In particular, the contributions of this paper are (i) an ideal per-task energy metering model for DRAM memories; (ii) ., an accurate, yet lo作者: CROW 時間: 2025-3-30 19:18
Characterizing the Performance-Energy Tradeoff of Small ARM Cores in HPC ComputationThe ARM platform that dominates the embedded and mobile computing segments is now being considered as an alternative to high-end x86 processors that largely dominate HPC because peak performance per watt may be substantially improved using off-the-shelf commodity processors..In this work we methodic作者: hazard 時間: 2025-3-30 21:40
On Interactions among Scheduling Policies: Finding Efficient Queue Setup Using High-Resolution Simulthms have been proposed for systems with specific requirements, mainstream resource management systems and schedulers are still only using a limited set of scheduling policies. Production systems need to balance various policies that are set in place to satisfy both the resource providers and users 作者: 小隔間 時間: 2025-3-31 01:20 作者: definition 時間: 2025-3-31 06:58
A Queueing Theory Approach to Pareto Optimal Bags-of-Tasks Scheduling on Clouds this scalability also becomes limited. To investigate the impact of this limitation we focus on bags–of–tasks where task data is stored outside the cloud and has to be transferred across the network before task execution can commence. The existing bags–of–tasks estimation tools are not able to prov作者: NADIR 時間: 2025-3-31 11:16
SPAGHETtI: Scheduling/Placement Approach for Task-Graphs on HETerogeneous archItecture architecture (e.g. CPU or GPU). We show that this algorithm is optimal in complexity .(|.||.|.?+?|.||.|), where |.| is the number of edges, |.| the number of vertices of the scheduled DAG and |.| the number of architectures – usually a small value – and that it is able to compute the optimal makesp作者: Biofeedback 時間: 2025-3-31 15:11
Energy-Aware Multi-Organization Scheduling Problemove machine utilization; however, this can also increase operational costs of less-loaded organizations..We consider energy as a resource, where the objective is to optimize the total energy consumption without increasing the energy spent by a .. We model the problem as a energy-aware variant of the作者: Fulsome 時間: 2025-3-31 19:52