The eupolicy.social admin @admin

**It's FOSS** @itsfoss@mastodon.social · 2d

It's FOSS @itsfoss@mastodon.social

CUDA comes to RISC-V!

https://news.itsfoss.com/nvidia-cuda-risc-v/

It's FOSS News · 2dIn a Surprise Move, NVIDIA Brings CUDA to RISC-V ProcessorsA surprise collaboration, I must say.

#riscv #nvidia #cuda

**knoppix** @knoppix95@mastodon.social · 2d

knoppix @knoppix95@mastodon.social

In a surprise move, NVIDIA is bringing CUDA to RISC-V CPUs
Announced at RISC-V Summit China , this allows RISC-V processors to run CUDA drivers + logic, with NVIDIA GPUs handling compute tasks
Enables open CPU + proprietary GPU AI systems—big for edge, HPC & China’s chipmakers

A potential shift in global AI infrastructure

@itsfoss

https://news.itsfoss.com/nvidia-cuda-risc-v/

It's FOSS News · 2dIn a Surprise Move, NVIDIA Brings CUDA to RISC-V ProcessorsA surprise collaboration, I must say.

#CUDA #RISCV #OpenHardware

**Kathy Reid** @KathyReid@aus.social · 4d

Kathy Reid @KathyReid@aus.social

By Anton Shilov for @TomsHardware - #NVIDIA #CUDA now supports #RISCV - is this a signal of broader ecosystem support?

https://www.tomshardware.com/pc-components/gpus/nvidias-cuda-platform-now-supports-risc-v-support-brings-open-source-instruction-set-to-ai-platforms-joining-x86-and-arm

Tom's Hardware · 4dNvidia's CUDA platform now supports RISC-V — support brings open source instruction set to AI platforms, joining x86 and ArmBy Anton Shilov

**Benjamin Carr, Ph.D.** @BenjaminHCCarr@hachyderm.io · 4d

Benjamin Carr, Ph.D. @BenjaminHCCarr@hachyderm.io

#NVIDIA Bringing #CUDA To #RISCV
NVIDIA's drivers and CUDA software stack are predominantly supported on x86_64 and AArch64 systems but in the past was supported on IBM POWER. This week at the RISC-V Summit China event, NVIDIA's Frans Sijstermans announced that CUDA will be coming to RISC-V.
#AMD for their part with the upstream #opensource #AMDKFD kernel compute driver can already build on RISC-V and the #ROCm user-space components can also be built on RISC-V.
https://www.phoronix.com/news/NVIDIA-CUDA-Coming-To-RISC-V

www.phoronix.comNVIDIA Bringing CUDA To RISC-VNVIDIA announced this week that they are bringing their CUDA software to RISC-V processors.

**heise online English** @heiseonlineenglish@social.heise.de · 6d

heise online English @heiseonlineenglish@social.heise.de

Apple AI framework MLX: future support for Nvidia's CUDA

Although Nvidia GPUs no longer run in Macs, Apple's MLX will soon be running there too. This makes interesting ports possible.

https://www.heise.de/en/news/Apple-AI-framework-MLX-future-support-for-Nvidia-s-CUDA-10493373.html?wt_mc=sm.red.ho.mastodon.mastodon.md_beitraege.md_beitraege&utm_source=mastodon

heise online · 6dApple AI framework MLX: future support for Nvidia's CUDABy Ben Schwan

#Apple #CUDA #IT

**Mac & i** @macandi@social.heise.de · 6d

Mac & i @macandi@social.heise.de

Apple-KI-Framework MLX: Künftig Support für Nvidias CUDA

Zwar laufen in Macs keine Nvidia-GPUs mehr, dennoch soll Apples MLX nun bald auch dort laufen. Das macht interessante Portierungen möglich.

https://www.heise.de/news/Apple-KI-Framework-MLX-Kuenftig-Support-fuer-Nvidias-CUDA-10491534.html?wt_mc=sm.red.ho.mastodon.mastodon.md_beitraege.md_beitraege&utm_source=mastodon

heise online · 6dApple-KI-Framework MLX: Künftig Support für Nvidias CUDABy Ben Schwan

#Apple #CUDA #IT

**st1nger** @st1nger@infosec.exchange · Jul 16

Jul 16

st1nger @st1nger@infosec.exchange

#GPUHammer is the first attack to show #Rowhammer bit flips on #GPU memories, specifically on a GDDR6 memory in an #NVIDIA A6000 GPU. Our attacks induce bit flips across all tested DRAM banks, despite in-DRAM defenses like TRR, using user-level #CUDA #code. These bit flips allow a malicious GPU user to tamper with another user’s data on the GPU in shared, time-sliced environments. In a proof-of-concept, we use these bit flips to tamper with a victim’s DNN models and degrade model accuracy from 80% to 0.1%, using a single bit flip. Enabling Error Correction Codes (ECC) can mitigate this risk, but ECC can introduce up to a 10% slowdown for #ML #inference workloads on an #A6000 GPU.

https://gpuhammer.com/

GPUHammerGPUHammer

**Hacker News 50** @hn50@social.lansky.name · Jul 14

Jul 14

Hacker News 50 @hn50@social.lansky.name

Apple's MLX adding CUDA support

Link: https://github.com/ml-explore/mlx/pull/1983
Discussion: https://news.ycombinator.com/item?id=44565668

This PR is an ongoing effort to add a CUDA backend to MLX, very little things work now but you can run the tutorial example already.
To build and test:
$ cmake . -Bbuild -DMLX_BUILD_CUDA=ON -DMLX_B...

GitHub[WIP] CUDA backend by zcbenz · Pull Request #1983 · ml-explore/mlxBy zcbenz

#apple #cuda

**HGPU group** @hgpu@mast.hpc.social · Jul 13

Jul 13

HGPU group @hgpu@mast.hpc.social

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

#CUDA #TaskScheduling #Package

https://hgpu.org/?p=30037

hgpu.org · Jul 13Hardware Compute Partitioning on NVIDIA GPUs for Composable SystemsAs GPU-using tasks become more common in embedded, safety-critical systems, efficiency demands necessitate sharing a single GPU among multiple tasks. Unfortunately, existing ways to schedule multip…

**HGPU group** @hgpu@mast.hpc.social · Jul 13

Jul 13

HGPU group @hgpu@mast.hpc.social

Demystifying NCCL: An In-depth Analysis of GPU Communication Protocols and Algorithms

#CUDA #GPUcluster #Communication

https://hgpu.org/?p=30035

hgpu.org · Jul 13Demystifying NCCL: An In-depth Analysis of GPU Communication Protocols and AlgorithmsThe NVIDIA Collective Communication Library (NCCL) is a critical software layer enabling high-performance collectives on large-scale GPU clusters. Despite being open source with a documented API, i…

**Benjamin Carr, Ph.D.** @BenjaminHCCarr@hachyderm.io · Jul 10

Jul 10

Benjamin Carr, Ph.D. @BenjaminHCCarr@hachyderm.io

New #ZLUDA 5 Preview Released For #CUDA On Non-NVIDIA #GPU
For now this ability to run unmodified CUDA apps on non-#NVIDIA GPUs is focused on #AMD GPUs of the #Radeon RX 5000 series and newer, which is AMD Radeon GPUs with #ROCm. Besides CUDA code samples, GeekBench has been one of the early targets for testing.
https://www.phoronix.com/news/ZLUDA-5-preview.43

www.phoronix.comNew ZLUDA 5 Preview Released For CUDA On Non-NVIDIA GPUs

**HGPU group** @hgpu@mast.hpc.social · Jul 6

Jul 6

HGPU group @hgpu@mast.hpc.social

Accelerated discovery and design of Fe-Co-Zr magnets with tunable magnetic anisotropy through machine learning and parallel computing

#CUDA #Physics #MaterialsScience #CondensedMatter #MachineLearning #ML #Package

https://hgpu.org/?p=30007

hgpu.org · Jul 6Accelerated discovery and design of Fe-Co-Zr magnets with tunable magnetic anisotropy through machine learning and parallel computingRare earth (RE)-free permanent magnets, as alternative substitutes for RE-containing magnets for sustainable energy technologies and modern electronics, have attracted considerable interest. We per…

**HGPU group** @hgpu@mast.hpc.social · Jul 6

Jul 6

HGPU group @hgpu@mast.hpc.social

ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation Tasks

#CUDA #OpenMP #LLM #CodeGeneration #Benchmarking #Package

https://hgpu.org/?p=30005

hgpu.org · Jul 6ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation TasksGPGPU architectures have become significantly diverse in recent years, which has led to an emergence of a variety of specialized programming models and software stacks to support them. While portab…

**Benjamin Carr, Ph.D.** @BenjaminHCCarr@hachyderm.io · Jul 3

Jul 3

Benjamin Carr, Ph.D. @BenjaminHCCarr@hachyderm.io

#ZLUDA Making Progress In 2025 On Bringing #CUDA To Non-NVIDIA #GPU
ZLUDA #opensource effort that started half-decade ago as drop-in CUDA implementation for #Intel GPUs and then for several years was funded by ##AMD as a CUDA implementation for #Radeon GPUs atop #ROCm and then open-sourced but then reverted has been continuing to push along a new path since last year. Current take on ZLUDA is a multi-vendor CUDA implementation for non-NVIDIA GPUs for #AI workloads & more.
https://www.phoronix.com/news/ZLUDA-Q2-2025-Update

www.phoronix.comZLUDA Making Progress In 2025 On Bringing CUDA To Non-NVIDIA GPUs

**Hacker News 50** @hn50@social.lansky.name · Jun 18

Jun 18

Hacker News 50 @hn50@social.lansky.name

Show HN: I built a tensor library from scratch in C++/CUDA

Link: https://github.com/nirw4nna/dsc
Discussion: https://news.ycombinator.com/item?id=44310678

GitHubGitHub - nirw4nna/dsc: Tensor library & inference framework for machine learningTensor library & inference framework for machine learning - nirw4nna/dsc

#cuda

**HGPU group** @hgpu@mast.hpc.social · Jun 15

Jun 15

HGPU group @hgpu@mast.hpc.social

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

#CUDA #LLM #Compilers #AI #PerformancePortability #Package

https://hgpu.org/?p=29940

hgpu.org · Jun 15HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary ExplorationThe rapid growth of deep learning has driven exponential increases in model parameters and computational demands. NVIDIA GPUs and their CUDA-based software ecosystem provide robust support for para…

**Hacker News 50** @hn50@social.lansky.name · Jun 8

Jun 8

Hacker News 50 @hn50@social.lansky.name

Ask HN: How to learn CUDA to professional level

Discussion: https://news.ycombinator.com/item?id=44216123

news.ycombinator.comAsk HN: How to learn CUDA to professional level | Hacker News

#cuda

**InfoQ** @infoq@techhub.social · Jun 6

Jun 6

InfoQ @infoq@techhub.social

Java developers are no longer limited by CPU cores!

This #InfoQ article explores how to bring GPU-level acceleration to enterprise Java using CUDA, with a practical JNI-based integration pattern, real-world use case, and performance benchmarks.

If you're tackling high-throughput challenges, see how to make Java truly parallel!

Read now: https://bit.ly/4kRGmD7

#Java #CUDA #SoftwareDevelopment

**.:\dGh/:.** @darkghosthunter@mastodon.social · May 30

May 30

.:\dGh/:. @darkghosthunter@mastodon.social

That's it, I'm going against AMD for recommending computers for #AI.

I don't even know how to start running something on their NPU via Linux, or check it's running at all. Windows fares better but it's `llama.cpp` doesn't work there.

So, if you want to run AI on your computer: RTX, Mac, or don't bother at all.

#ML #LLM #LargeLanguageModels