
000 | 00000cam u2200205 a 4500 | |
001 | 000046152328 | |
005 | 20230706114759 | |
008 | 230626s2023 maua b 001 0 eng d | |
020 | ▼a 9780323912310 (pbk.) | |
020 | ▼a 0323912311 (pbk.) | |
020 | ▼z 9780323984638 (ePub ebook) | |
020 | ▼z 0323984630 (ePub ebook) | |
035 | ▼a (KERIS)BIB000016775110 | |
040 | ▼a 211043 ▼c 211043 ▼d 211009 | |
082 | 0 4 | ▼a 004.35 ▼2 23 |
084 | ▼a 004.35 ▼2 DDCK | |
090 | ▼a 004.35 ▼b K59p4 | |
100 | 1 | ▼a Hwu, Wen-mei. |
245 | 1 0 | ▼a Programming massively parallel processors : ▼b a hands-on approach / ▼c Wen-mei W. Hwu, David B. Kirk, Izzat El Hajj. |
250 | ▼a 4th ed. | |
260 | ▼a Cambridge, MA : ▼b Elsevier : ▼b Morgan Kaufman, ▼c 2023. | |
300 | ▼a xxviii, 551 p. : ▼b ill. (some col.) ; ▼c 24 cm. | |
500 | ▼a Previous edition: published as by David B. Kirk, Wen-mei W. Hwu. Amsterdam: Elsevier, 2017. | |
504 | ▼a Includes bibliographical references and index. | |
650 | 0 | ▼a Multiprocessors. |
650 | 0 | ▼a Parallel processing (Electronic computers). |
650 | 0 | ▼a Parallel programming (Computer science). |
700 | 1 | ▼a Kirk, David, ▼d 1960-. |
700 | 1 | ▼a Hajj, Izzat El. |
945 | ▼a ITMT |
Holdings Information
No. | Location | Call Number | Accession No. | Availability | Due Date | Make a Reservation | Service |
---|---|---|---|---|---|---|---|
No. 1 | Location Science & Engineering Library/Sci-Info(Stacks2)/ | Call Number 004.35 K59p4 | Accession No. 121263433 | Availability In loan | Due Date 2023-12-29 | Make a Reservation Available for Reserve | Service |
Contents information
Table of Contents
1 Introduction Part I Fundamental Concepts 2 Heterogeneous data parallel computing 3 Multidimensional grids and data 4 Compute architecture and scheduling 5 Memory architecture and data locality 6 Performance considerations Part II Parallel Patterns 7 Convolution: An introduction to constant memory and caching 8 Stencil 9 Parallel histogram 10 Reduction And minimizing divergence 11 Prefix sum (scan) 12 Merge: An introduction to dynamic input data identification Part III Advanced patterns and applications 13 Sorting 14 Sparse matrix computation 15 Graph traversal 16 Deep learning 17 Iterative magnetic resonance imaging reconstruction 18 Electrostatic potential map 19 Parallel programming and computational thinking Part IV Advanced Practices 20 Programming a heterogeneous computing cluster: An introduction to CUDA streams 21 CUDA dynamic parallelism 22 Advanced practices and future evolution 23 Conclusion and outlook Appendix A: Numerical considerations