Blättern nach Person
Ebene hoch |
Norouzi, Mohammad ; Morew, Nicolas ; Ilias, Qamar ; Rothenberger, Lukas ; Jannesari, Ali ; Wolf, Felix (2024)
Fast data-dependence profiling through prior static analysis.
In: Parallel Computing, 119
doi: 10.1016/j.parco.2024.103063
Artikel, Bibliographie
Yu, Sixing ; Mazaheri, Arya ; Jannesari, Ali (2022)
Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement Learning.
39th International Conference on Machine Learning. Baltimore, USA (17.-23.07.2022)
Konferenzveröffentlichung, Bibliographie
Mammadli, Rahim ; Jannesari, Ali ; Wolf, Felix (2020)
Static Neural Compiler Optimization via Deep Reinforcement Learning.
International Conference for High Performance Computing, Networking, Storage and Analysis (SC'20). virtual Conference (09.-19.11.2020)
doi: 10.1109/LLVMHPCHiPar51896.2020.00006
Konferenzveröffentlichung, Bibliographie
Morew, Nicolas ; Norouzi, Mohammad ; Jannesari, Ali ; Wolf, Felix (2020)
Skipping Non-essential Instructions Makes Data-Dependence Profiling Faster.
26th International Conference on Parallel and Distributed Computing. Warsaw, Poland (24.-28.08.2020)
doi: 10.1007/978-3-030-57675-2_1
Konferenzveröffentlichung, Bibliographie
Mazaheri, Arya ; Beringer, Tim ; Moskewicz, Matthew ; Wolf, Felix ; Jannesari, Ali (2020)
Accelerating Winograd Convolutions using Symbolic Computation and Meta-programming.
15th European Conference on Computer Systems. Heraklion, Greece (27.-30.04.2020)
doi: 10.1145/3342195.3387549
Konferenzveröffentlichung, Bibliographie
Norouzi, Mohammad ; Ilias, Qamar ; Jannesari, Ali ; Wolf, Felix (2019)
Accelerating Data-Dependence Profiling with Static Hints.
25th International Conference on Parallel and Distributed Computing (Euro-Par 2019). Göttingen, Germany (26.-30.08.2019)
doi: 10.1007/978-3-030-29400-7_2
Konferenzveröffentlichung, Bibliographie
Mazaheri, Arya ; Schulte, Johannes ; Moskewicz, Matthew ; Wolf, Felix ; Jannesari, Ali (2019)
Enhancing the Programmability and Performance Portability of GPU Tensor Operations.
25th International Conference on Parallel and Distributed Computing (Euro-Par 2019). Göttingen, Germany (26.-30.08.2019)
doi: 10.1007/978-3-030-29400-7_16
Konferenzveröffentlichung, Bibliographie
Norouzi, Mohammad ; Wolf, Felix ; Jannesari, Ali (2019)
Automatic Construct Selection and Variable Classification in OpenMP.
33rd International Conference on Supercomputing. Phoenix, USA (26.-28.06.2019)
doi: 10.1145/3330345.3330375
Konferenzveröffentlichung, Bibliographie
Mammadli, Rahim ; Wolf, Felix ; Jannesari, Ali (2019)
The Art of Getting Deep Neural Networks in Shape.
In: ACM Transactions on Architecture and Code Optimization, 15 (4)
doi: 10.1145/3291053
Artikel, Bibliographie
Mazaheri, Arya ; Wolf, Felix ; Jannesari, Ali (2018)
Unveiling Thread Communication Bottlenecks Using Hardware-Independent Metrics.
47th International Conference on Parallel Processing (ICPP). Eugene, USA (13.08.2018-16.08.2018)
doi: 10.1145/3225058.3225142
Konferenzveröffentlichung, Bibliographie
Atre, Rohit ; Huda, Zia Ul ; Jannesari, Ali ; Wolf, Felix (2018)
Dissecting sequential programs for parallelization - an approach based on computational units.
In: Concurrency and Computation: Practice and Experience, 31 (5)
doi: 10.1002/cpe.4770
Artikel, Bibliographie
Jannesari, Ali ; Huda, Zia Ul ; Atre, Rohit ; Li, Zhen ; Wolf, Felix (2017)
Parallelizing Audio Analysis Applications - A Case Study.
39th International Conference on Software Engineering, Software Engineering Education and Training Track. Buenos Aires, Argentina (20.-28.05.2017)
doi: 10.1109/ICSE-SEET.2017.9
Konferenzveröffentlichung, Bibliographie
Atre, Rohit ; Jannesari, Ali ; Wolf, Felix (2017)
Meeting the challenges of parallelizing sequential programs.
29th ACM Symposium on Parallelism in Algorithms and Architectures. Washington DC., USA (24.-26.07.2017)
doi: 10.1145/3087556.3087592
Konferenzveröffentlichung, Bibliographie
Huda, Zia Ul ; Atre, Rohit ; Jannesari, Ali ; Wolf, Felix (2016)
Automatic Parallel Pattern Detection in the Algorithm Structure Design Space.
30th IEEE International Parallel and Distributed Processing Symposium. Chicago, USA (23.-27.05.2016)
doi: https://doi.og/10.1109/IPDPS.2016.60
Konferenzveröffentlichung, Bibliographie
Li, Zhen ; Atre, Rohit ; Huda, Zia Ul ; Jannesari, Ali ; Wolf, Felix (2016)
Unveiling Parallelization Opportunities in Sequential Programs.
In: Journal of Systems and Software, 117
doi: 10.1016/j.jss.2016.03.045
Artikel, Bibliographie
Jeyakumaran, Thireshan ; Atoofian, Ehsan ; Xiao, Yang ; Li, Zhen ; Jannesari, Ali (2016)
Improving Performance of Transactional Applications through Adaptive Transactional Memory.
24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing. Heraklion, Greece (17.02.2016-19.02.2016)
doi: 10.1109/PDP.2016.85
Konferenzveröffentlichung, Bibliographie
Li, Zhen ; Jannesari, Ali ; Wolf, Felix (2015)
An Efficient Data-Dependence Profiler for Sequential and Parallel Programs.
29th IEEE International Parallel and Distributed Processing Symposium (IPDPS 20215). Hyderabad, India (25.-29.05.2015)
doi: 10.1109/IPDPS.2015.41
Konferenzveröffentlichung, Bibliographie
Jannesari, Ali ; Wolf, Felix (2015)
Automatic Generation of Unit Tests for Correlated Variables in Parallel Programs.
In: International Journal of Parallel Programming (IJPP), 44 (3)
doi: 10.1007/s10766-015-0363-8
Artikel, Bibliographie
Jannesari, Ali (2010)
Dynamic Race Detection in Parallel Programs.
Karlsruhe Institute of Technology (KIT)
Dissertation, Bibliographie