A CPU–GPU hybrid approach for the unsymmetric multifrontal method CD Yu, W Wang, D Pierce Parallel Computing 37 (12), 759-770, 2011 | 58 | 2011 |
Performance Optimization for the K Nearest-Neighbor Kernel on x86 Architectures CD Yu, J Huang, W Austin, B Xiao, G Biros Proceedings of the International Conference for High Performance Computing …, 2015 | 45 | 2015 |
Geometry-oblivious FMM for compressing dense SPD matrices CD Yu, J Levitt, S Reiz, G Biros Proceedings of the International Conference for High Performance Computing …, 2017 | 43 | 2017 |
A Kernel-Independent FMM in General Dimensions WB March, B Xiao, S Tharakan, CD Yu, G Biros Proceedings of the International Conference for High Performance Computing …, 2015 | 34 | 2015 |
ASKIT: an efficient, parallel library for high-dimensional kernel summations WB March, B Xiao, CD Yu, G Biros SIAM Journal on Scientific Computing 38 (5), S720-S749, 2016 | 25 | 2016 |
An algebraic parallel treecode in arbitrary dimensions WB March, B Xiao, CD Yu, G Biros Parallel and Distributed Processing Symposium (IPDPS), 2015 IEEE …, 2015 | 25 | 2015 |
Distributed-Memory Hierarchical Compression of Dense SPD Matrices CD Yu, S Reiz, G Biros Proceedings of the International Conference for High Performance Computing …, 0 | 24* | |
Implementing Strassen's algorithm with CUTLASS on NVIDIA Volta GPUs J Huang, CD Yu, RA van de Geijn arXiv preprint arXiv:1808.07984, 2018 | 23 | 2018 |
Strassen’s algorithm reloaded on GPUs J Huang, CD Yu, RA Geijn ACM Transactions on Mathematical Software (TOMS) 46 (1), 1-22, 2020 | 21 | 2020 |
Fast approximation of the Gauss--Newton Hessian matrix for the multilayer perceptron C Chen, S Reiz, CD Yu, HJ Bungartz, G Biros SIAM Journal on Matrix Analysis and Applications 42 (1), 165-184, 2021 | 20 | 2021 |
INV-ASKIT: A Parallel Fast Direct Solver for Kernel Matrices CD Yu, WB March, B Xiao, G Biros Parallel and Distributed Processing Symposium, 2016 IEEE International, 161-171, 2016 | 18* | 2016 |
Robust Treecode Approximation for Kernel Machines WB March, B Xiao, S Tharakan, CD Yu, G Biros Proceedings of the 21th ACM SIGKDD International Conference on Knowledge …, 2015 | 16 | 2015 |
Distributed O (N) linear solver for dense symmetric hierarchical semi-separable matrices DY Chenhan, S Reiz, G Biros 2019 IEEE 13th International Symposium on Embedded Multicore/Many-core …, 2019 | 12 | 2019 |
Gpunet: Searching the deployable convolution neural networks for gpus L Wang, C Yu, S Salian, S Kierat, S Migacz, AF Florea arXiv preprint arXiv:2205.00841, 2022 | 11 | 2022 |
An N log N Parallel Fast Direct Solver for Kernel Matrices CD Yu, WB March, G Biros Parallel and Distributed Processing Symposium (IPDPS), 2017 IEEE …, 2017 | 11* | 2017 |
Performance models and workload distribution algorithms for optimizing a hybrid CPU–GPU multifrontal solver CD Yu, W Wang Computers & Mathematics with Applications 67 (7), 1421-1437, 2014 | 4 | 2014 |
H-matrix approximation of the Gauss-Newton Hessian matrix for the multilayer perceptron C Chen, S Reiz, C Yu, HJ Bungartz, G Biros 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), 2019 | | 2019 |
The Science of High Performance Algorithms for Hierarchical Matrices CD Yu The University of Texas at Austin, 2018 | | 2018 |
使用多子矩陣法結合中央處理器和圖形處理器解決大型稀疏線性系統 CD Yu 臺灣大學數學研究所學位論文, 1-105, 2012 | | 2012 |
MCSoC 2019 DY Chenhan, S Reiz | | |