Herkese selam. Uzun zamandır GPU hızlandırma uzerine kafa yorduğum bir sorun uzerinde size danışmak istiyorum. Uzun zaman harcadığım zamanda ekran kartımın desteklediği tum hızlandırma turlerini bir şekilde etkinleştirmeyi başardım. Ucbirim'den vainfo, vdpauinfo ve clinfo komutlarından olması gerektiği şekilde cıkış alabiliyorum ancak bu hızlandırmaları programlar icerisinde kullanamıyorum. Openshot video editor, VLC, svp 4, mpv, ffmpeg, Google Chrome, Blender gibi bir şekilde GPU ile cozme/birleştirme veya hesaplama yapan uygulamalar NVIDIA GPU'yu tanımıyor veya ekran kartının desteklediği teknolojileri kullanmama izin vermiyor. Cok ekstrem şeyler denememe rağmen uygulamalar GPU hızlandırmayı kullanamıyor. Bircok Linux forumunda insanların denedikleri cozumleri denedim ancak bu uygulamaların tamamında GPU hızlandırmayı kullanamıyorum. Aklımdaki ve bulabildiğim cozum onerileri ise tukenmiş durumda. Ne yapacağım konusunda hicbir fikrim yok. Sizce ne yapabilirim veya neyi kacırıyorum?
Bilgisayarımın ozellikleri şu şekilde;
- Samsung uretimi bir laptop. NP300E5X-S02TR model kodlu.
- Intel Core i5-3210u işlemci ve icindeki Intel HD Graphics 4000.
- 4 GB DDR3 bellek.
- NVIDIA GT620M ekran kartı.
- Samsung 860 EVO 250 GB SSD.
Yuklu suruculer ve yazılımlar:
Linux Mint 20.3una cinnamon.
NVIDIA tescilli surucusu 390.147(apt'den kuruldu.)
CUDA Tools'un 9.X surumu(en son destekleyen bu idi, bionic tabanından elle kuruldu.)
VA-API'yı VDPAU ile işlemeyi sağlayan cevirici.
Ekran kartımın desteklediği hızlandırma teknolojileri:
-VA-API(cevirici ile sadece video cozme)
-Vdpau(video cozme ve birleştirme)
-Nvdecode(sadece video cozme, CUDA'ya bağlı)
-OpenCL(hesaplama icin, bazı uygulamalar kullanıyor)
-CUDA(hesaplama icin, bazı uygulamaları kullanıyor)
Hızlandırma hakkındaki cıktılar:
[CODE title="vainfo"]libva info: VA-API version 1.7.0
libva info: Trying to open /usr/lib/x86_64-linux-gnu/dri/nvidia_drv_video.so
libva info: Found init function __vaDriverInit_1_0.
libva info: va_openDriver() returns 0
vainfo: VA-API version: 1.7 (libva 2.6.0)
vainfo: Driver version: Splitted-Desktop Systems VDPAU backend for VA-API - 0.7.4
vainfo: Supported profile and entrypoints.
VAProfileMPEG2Simple : VAEntrypointVLD.
VAProfileMPEG2Main : VAEntrypointVLD.
VAProfileMPEG4Simple : VAEntrypointVLD.
VAProfileMPEG4AdvancedSimple : VAEntrypointVLD.
: VAEntrypointVLD.
VAProfileH264Main : VAEntrypointVLD.
VAProfileH264High : VAEntrypointVLD.
VAProfileVC1Simple : VAEntrypointVLD.
VAProfileVC1Main : VAEntrypointVLD.
VAProfileVC1Advanced : VAEntrypointVLD[/CODE]
[CODE title="vdpauinfo"]display: :0 screen: 0
API version: 1
Information string: NVIDIA VDPAU Driver Shared Library 390.147 Mon Dec 13 13:50:31 UTC 2021.
Video surface:
name width height types.
-------------------------------------------
420 4096 4096 NV12 YV12.
422 4096 4096 UYVY YUYV.
Decoder capabilities:
name level macbs width height.
----------------------------------------------------
MPEG1 0 65536 4032 4048.
MPEG2_SIMPLE 3 65536 4032 4048.
MPEG2_MAIN 3 65536 4032 4048.
H264_BASELINE 41 65536 4032 4080.
H264_MAIN 41 65536 4032 4080.
H264_HIGH 41 65536 4032 4080.
VC1_SIMPLE 1 8190 2048 2048.
VC1_MAIN 2 8190 2048 2048.
VC1_ADVANCED 4 8190 2048 2048.
MPEG4_PART2_SP 3 8192 2048 2048.
MPEG4_PART2_ASP 5 8192 2048 2048.
DIVX4_QMOBILE 0 8192 2048 2048.
DIVX4_MOBILE 0 8192 2048 2048.
DIVX4_HOME_THEATER 0 8192 2048 2048.
DIVX4_HD_1080P 0 8192 2048 2048.
DIVX5_QMOBILE 0 8192 2048 2048.
DIVX5_MOBILE 0 8192 2048 2048.
DIVX5_HOME_THEATER 0 8192 2048 2048.
DIVX5_HD_1080P 0 8192 2048 2048.
H264_CONSTRAINED_BASELINE 41 65536 4032 4080.
H264_EXTENDED 41 65536 4032 4080.
H264_PROGRESSIVE_HIGH 41 65536 4032 4080.
H264_CONSTRAINED_HIGH 41 65536 4032 4080.
H264_HIGH_444_PREDICTIVE 41 65536 4032 4080.
HEVC_MAIN --- not supported ---
HEVC_MAIN_10 --- not supported ---
HEVC_MAIN_STILL --- not supported ---
HEVC_MAIN_12 --- not supported ---
HEVC_MAIN_444 --- not supported ---
Output surface:
name width height nat types.
----------------------------------------------------
B8G8R8A8 16384 16384 y Y8U8V8A8 V8U8Y8A8 A4I4 I4A4 A8I8 I8A8.
R10G10B10A2 16384 16384 y Y8U8V8A8 V8U8Y8A8 A4I4 I4A4 A8I8 I8A8.
Bitmap surface:
name width height.
------------------------------
B8G8R8A8 16384 16384.
R8G8B8A8 16384 16384.
R10G10B10A2 16384 16384.
B10G10R10A2 16384 16384.
A8 16384 16384.
Video mixer:
feature name sup.
------------------------------------
DEINTERLACE_TEMPORAL y
DEINTERLACE_TEMPORAL_SPATIAL y
INVERSE_TELECINE y
NOISE_REDUCTION y
SHARPNESS y
LUMA_KEY y
HIGH QUALITY SCALING - L1 y
HIGH QUALITY SCALING - L2 -
HIGH QUALITY SCALING - L3 -
HIGH QUALITY SCALING - L4 -
HIGH QUALITY SCALING - L5 -
HIGH QUALITY SCALING - L6 -
HIGH QUALITY SCALING - L7 -
HIGH QUALITY SCALING - L8 -
HIGH QUALITY SCALING - L9 -
parameter name sup min max.
-----------------------------------------------------
VIDEO_SURFACE_WIDTH y 1 4096.
VIDEO_SURFACE_HEIGHT y 1 4096.
CHROMA_TYPE y
LAYERS y 0 4
attribute name sup min max.
-----------------------------------------------------
BACKGROUND_COLOR y
CSC_MATRIX y
NOISE_REDUCTION_LEVEL y 0.00 1.00
SHARPNESS_LEVEL y -1.00 1.00
LUMA_KEY_MIN_LUMA y
LUMA_KEY_MAX_LUMA y
[/CODE]
[CODE title="clinfo"]Number of platforms 1
Platform Name NVIDIA CUDA.
Platform Vendor NVIDIA Corporation.
Platform Version OpenCL 1.2 CUDA 9.1.84
Platform Profile FULL_PROFILE.
Platform Extensions cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_copy_opts cl_nv_create_buffer.
Platform Extensions function suffix NV.
Platform Name NVIDIA CUDA.
Number of devices 1
Device Name GeForce GT 620M.
Device Vendor NVIDIA Corporation.
Device Vendor ID 0x10de.
Device Version OpenCL 1.1 CUDA.
Driver Version 390.147
Device OpenCL C Version OpenCL C 1.1
Device Type GPU.
Device Topology (NV) PCI-E, 01:00.0
Device Profile FULL_PROFILE.
Device Available Yes.
Compiler Available Yes.
Max compute units 2
Max clock frequency 1250MHz.
Compute Capability (NV) 2.1
Max work item dimensions 3
Max work item sizes 1024x1024x64.
Max work group size 1024.
Preferred work group size multiple 32.
Warp size (NV) 32.
Preferred / native vector sizes
char 1 / 1
short 1 / 1
int 1 / 1
long 1 / 1
half 0 / 0 (n/a)
float 1 / 1
double 1 / 1 (cl_khr_fp64)
Half-precision Floating-point support (n/a)
Single-precision Floating-point support (core)
Denormals Yes.
Infinity and NANs Yes.
Round to nearest Yes.
Round to zero Yes.
Round to infinity Yes.
IEEE754-2008 fused multiply-add Yes.
Support is emulated in software No.
Correctly-rounded divide and sqrt operations No.
Double-precision Floating-point support (cl_khr_fp64)
Denormals Yes.
Infinity and NANs Yes.
Round to nearest Yes.
Round to zero Yes.
Round to infinity Yes.
IEEE754-2008 fused multiply-add Yes.
Support is emulated in software No.
Address bits 64, Little-Endian
Global memory size 1011417088 (964.6MiB)
Error Correction support No.
Max memory allocation 252854272 (241.1MiB)
Unified memory for Host and Device No.
Integrated memory (NV) No.
Minimum alignment for any data type 128 bytes.
Alignment of base address 4096 bits (512 bytes)
Global Memory cache type Read/Write
Global Memory cache size 32768 (32KiB)
Global Memory cache line size 128 bytes.
Image support Yes.
Max number of samplers per kernel 16.
Max 2D image size 16384x16384 pixels.
Max 3D image size 2048x2048x2048 pixels.
Max number of read image args 128.
Max number of write image args 8
Local memory type Local.
Local memory size 49152 (48KiB)
Registers per block (NV) 32768.
Max number of constant args 9
Max constant buffer size 65536 (64KiB)
Max size of kernel argument 4352 (4.25KiB)
Queue properties
Out-of-order execution Yes.
Profiling Yes.
Profiling timer resolution 1000ns.
Execution capabilities
Run OpenCL kernels Yes.
Run native kernels No.
Kernel execution timeout (NV) Yes.
Concurrent copy and kernel execution (NV) Yes.
Number of async copy engines 1
Device Extensions cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64 cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll cl_nv_copy_opts cl_nv_create_buffer.
NULL platform behavior.
clGetPlatformInfo(NULL, CL_PLATFORM_NAME, ...) NVIDIA CUDA.
clGetDeviceIDs(NULL, CL_DEVICE_TYPE_ALL, ...) Success [NV]
clCreateContext(NULL, ...) [default] Success [NV]
clCreateContextFromType(NULL, CL_DEVICE_TYPE_DEFAULT) No platform.
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CPU) No devices found in platform.
clCreateContextFromType(NULL, CL_DEVICE_TYPE_GPU) No platform.
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ACCELERATOR) No devices found in platform.
clCreateContextFromType(NULL, CL_DEVICE_TYPE_CUSTOM) Invalid device type for platform.
clCreateContextFromType(NULL, CL_DEVICE_TYPE_ALL) No platform.
ICD loader properties.
ICD loader Name OpenCL ICD Loader.
ICD loader Vendor OCL Icd free software.
ICD loader Version 2.2.11
ICD loader Profile OpenCL 2.1
[/CODE]
Eklemek istediğim bazı şeyler(belki işe yarar);
- Bilgisayar muxless tasarıma sahip. dGPU(NVIDIA) ekran kartı goruntuyu işleyip Intel'in IGPU'suna aktarıyor ve Intel IGPU'su uzerinden ekrana yansıtılıyor.
- NVIDIA ekran kartı doğrudan goruntu işleyecek şekilde secili, oyunlarda duzgun calışıyor.
NVIDIA GPU ile hızlandırma problemi
Linux0 Mesaj
●2 Görüntüleme
- ReadBull.net
- Donanım Forumları
- işletim Sistemleri
- Linux
- NVIDIA GPU ile hızlandırma problemi