nvidia | 易学教程

OpenCL command queue (CL_QUEUE_OUT_OF_ORDER_EXEC_MODE_ENABLE) not working (MacOS)

阅读更多关于 OpenCL command queue (CL_QUEUE_OUT_OF_ORDER_EXEC_MODE_ENABLE) not working (MacOS)

问题 Working through the examples and source code from Fixstars. Specifically, I'm trying the last bit of code in chapter 5 (two moving averages - aka Golden Cross): http://www.fixstars.com/en/opencl/book/OpenCLProgrammingBook/opencl-programming-practice/ The code is available here: http://www.fixstars.com/en/opencl/book/sample/ I'll post the specific example below. But the short of it is that by setting up the command queue as follows: command_queue = clCreateCommandQueue(context, device_id, CL

Why won't CG shaders work with GL 3.2?

阅读更多关于 Why won't CG shaders work with GL 3.2?

问题 I've tried everything to get OpenGL 3.2 to render with CG shaders in my game engine but I have had no luck. So I decided to make a bare minimal project but still shaders won't work. In theory my test project should just render a red triangle but it is white because the shader is not doing anything. I'll post the code here: #include <stdio.h> #include <stdlib.h> #include <vector> #include <string> #include <GL/glew.h> #include <Cg/cg.h> #include <Cg/cgGL.h> #include <SDL2/SDL.h> int main() {

Many OpenCL SDK's. Which of them i should choose?

阅读更多关于 Many OpenCL SDK's. Which of them i should choose?

问题 In my computer with Windows 7 OS I have three versions of OpenCL SDKS's from this vendors: Intel NVIDIA AMD. I build my application with each of them. As the output I have three different binaries. For example: my_app_intel_x86, my_app_amd_x86, my_app_nvidia_x86 This binaries are different on this: They use different SDK's in likange process They try to find different OpenCL platform name in runtime Can I use only one SDK and check platform on running time? 回答1: SDK's give debuggings tools, a

How to remove CL_INVALID_PLATFORM error in opencl code?

阅读更多关于 How to remove CL_INVALID_PLATFORM error in opencl code?

问题 Doing simple matrix multiplication using OpenCL: // Multiply two matrices A * B = C #include <stdlib.h> #include <stdio.h> #include <math.h> #include <oclUtils.h> #define WA 3 #define HA 3 #define WB 3 #define HB 3 #define WC 3 #define HC 3 // Allocates a matrix with random float entries. void randomInit(float* data, int size) { for (int i = 0; i < size; ++i) data[i] = rand() / (float)RAND_MAX; } ///////////////////////////////////////////////////////// // Program main ///////////////////////

How to write LOP3 based instructions for Maxwell and up NVIDIA Architecture?

阅读更多关于 How to write LOP3 based instructions for Maxwell and up NVIDIA Architecture?

问题 Maxwell Architecture has introduced a new instruction in PTX assembly called LOP3 which according to the NVIDIA blog: "Can save instructions when performing complex logic operations on multiple inputs." At GTC 2016, some CUDA developers managed to accelerated the atan2f function for Tegra X1 processor (Maxwell) with such instructions. However, the below function defined within a .cu file leads to undefined definitions for __SET_LT and __LOP3_0xe2 . Do I have to define them in .ptx file

Which NVIDIA cuDNN release type for TensorFlow on Ubuntu 16.04 [closed]

阅读更多关于 Which NVIDIA cuDNN release type for TensorFlow on Ubuntu 16.04 [closed]

问题 Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow. Closed last year . According to TensorFlow 1.5 installation instructions for Ubuntu 16.04, you need to install cuDNN 7.0 but they don't mention exactly what should be installed: cuDNN v7.0. For details, see NVIDIA's documentation. Ensure that you create the CUDA_HOME environment variable as described in the NVIDIA documentation.

Force windows video driver reload. Is it possible at all?

阅读更多关于 Force windows video driver reload. Is it possible at all?

问题 Some drivers use parameters written in the registry to configure themselves when they get loaded at boot time. I can modify those values and then reboot, but I would like to know if it is possible to force the driver reload, making the changes effective without rebooting. Specifically, I am talking about the video driver (nvidia). I read somewhere, that calling through pINvoke() [User32.ll]::ChangeDisplaySettings() with a 640x480x8bits resolution,(which is so low that it should not be

CUDA and Open MP

阅读更多关于 CUDA and Open MP

问题 I dont have a Fermi at the moment but the targetting platform is tesla/Fermi, the question I want to ask is if Fermi support Open MP like this: #pragma omp parallel for num_threads(N) for (int i=0; i<1000; ++i) { int threadID=omp_get_thread_num(); cudafunctions<<<blocks, threads, 1024, streams[threadID]>>>(input+i*colsizeofinput); }//where there are N streams created. 回答1: Yes, something like that is possible. OpenMP doesn't provide any specific benefit when trying to launch multiple kernels

Linux - run android emulator on Nouveau driver

阅读更多关于 Linux - run android emulator on Nouveau driver

问题 Linux (Debian Sid x64), kernel 4.14, Nvidia GPU. I am unable to run Android emulator on open Nouveau drivers. There is no any error message that I can post, jus segmentation fault. When I choose software rendering, it works but unusable (it runs very slow). Does anybody know any workaround for that, or I am forced to use official Nvidia drivers? 来源： https://stackoverflow.com/questions/47900233/linux-run-android-emulator-on-nouveau-driver

nVidia SLI Tricks [closed]

阅读更多关于 nVidia SLI Tricks [closed]

问题 Closed . This question is opinion-based. It is not currently accepting answers. Want to improve this question? Update the question so it can be answered with facts and citations by editing this post. Closed 4 months ago . I'm optimizing a directx graphics application to take advantage of nVidia's SLI technology. I'm currently investigating some of the techniques mentioned in their 'Best Practices' web page, but wanted to know what advice/experience any of you have had with this? Thanks! 回答1: