AMD used to do something similar through their extension cl_amd_fp64. I have a feature request that perhaps Intel can release limited fp64 support through a vendor extensions (something like cl_intel_fp64) where a subset of cl_khr_fp64 can be supported. The specification differences of T4 and V100-PCIe GPU are listed in Table 1. T4 is the GPU that uses NVIDIA’s latest Turing architecture. Thus, it would have been great if we could use fp64 on HD graphics through OpenCL. The system features Intel Skylake processors, up to 24 DIMMs, and up to 3 double width V100-PCIe or 4 single width T4 GPUs in x16 PCIe 3.0 slots. However, the majority of the use-cases of my compiler are in scientific computing where people tend to use fp64 a lot. They benefit from not having to do data transfers in many cases. They are very suitable for the case where both the CPU and the GPU cores work on different parts of the problem, rather than a pure offload model. Display outputs include: 1x HDMI 2.1, 3x DisplayPort 1.4a. Being a triple-slot card, the NVIDIA GeForce RTX 3090 Ti draws power from 1x 16-pin power connector, with power draw rated at 450 W maximum. With Haswell GT3e, I think the HD graphics GPU now has sufficient processing power and bandwidth. The GPU is operating at a frequency of 1560 MHz, which can be boosted up to 1860 MHz, memory is running at 1313 MHz (21 Gbps effective). Also, most people who are interested in using my compiler are also mostly using regular desktops and notebooks.Īs for HD graphics, I do think they can be used as nice co-processors along with the Haswell or Ivy Bridge CPU cores. Regular desktop processors are obviously much cheaper to buy. While I would love to have access to a Xeon Phi, I am limited by the lab's hardware budget and so don't have one. I am a PhD student working on compilers targeting multi and many-core systems.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |