#FluidX3D #CFD v3.2 is out! I've implemented the much requested #GPU summation for object force/torque; it's ~20x faster than #CPU #multithreading.
Horizontal sum in #OpenCL was a nice exercise - first local memory reduction and then hardware-supported atomic floating-point add in VRAM, in a single-stage kernel. Hammering atomics isn't too bad as each of the ~10-340 workgroups dispatched at a time does only a single atomic add.
Also improved volumetric #raytracing!
https://github.com/ProjectPhysX/FluidX3D/releases/tag/v3.2

FluidX3D simulation of the X-wing with velocity raytracing visualization

FluidX3D simulation of the X-wing with density raytracing visualization

**Giuseppe Bilotta** @giuseppebilotta@fediscience.org · Feb 10

Feb 10

Giuseppe Bilotta @giuseppebilotta@fediscience.org

Remember when I mentioned we had ported our #fire propagation #cellularAutomaton from #Python to #Julia, gaining performance and the ability to parallelize more easily and efficiently?

A couple of days ago we had to run another big batch of simulations and while things progressed well at the beginning, we saw the parallel threads apparently hanging one by one until the whole process sat there doing who know what.

Our initial suspicion was that we had come across some weird #JuliaLang issue with #multithreading, which seemed to be confirmed by some posts we found on the Julia forums. We tried the workarounds suggested there, to no avail. We tried a different number of threads, and this led to the hang occurring after a different percent completion. We tried restarting the simulations skipping the ones already done. It always got stuck at the same place (for the same number of threads).

So, what was the problem?

1/n

**looopTools** @looopTools@mastodon.social · Feb 10

Feb 10

looopTools @looopTools@mastodon.social

This is literally one of the best #documentation sites for worker pools I have ever seen. It gives a good basic overview with all the information you need: https://rdkcentral.github.io/Thunder/utils/threading/worker-pool/ it is a little #cpp heavy in the started but go down to the Thread Pool Concept. I think it is worth a read for many #threading #multithreading #development

rdkcentral.github.ioWorker Pool - Thunder

**MClare** @mclare@recurse.social · Dec 20, 2024

Dec 20, 2024

MClare @mclare@recurse.social

While doing my latest data puzzle project, I hit a performance issue with #selenium on my Mac, but *not* my Thinkpad.

I wrote up initial findings here:
https://mclare.blog/posts/why-is-multithreading-selenium-lousy-on-macos/

But I won't have time to investigate further until after the holidays and move

mclare.blogWhy is multithreading Selenium lousy on MacOS? | MClare Blog

#multithreading #python #concurrency

**Christian Grobmeier** @grobmeier@mastodon.social · Dec 13, 2024

Dec 13, 2024

Christian Grobmeier @grobmeier@mastodon.social

Multithreading can lead to deadlocks. Do one thing at a time, you are not the JVM.

December 13
#ZenDevAdvent #java #programming #multithreading

**Royce Williams** @tychotithonus@infosec.exchange · Oct 27, 2024 *

Oct 27, 2024 *

Royce Williams @tychotithonus@infosec.exchange

Multithreaded CLI developers: let your users configure the number of threads.

Entire classes of use cases are hiding inside that will make your life easier as a dev -- and threads=1 is usually not hard to add.

One example: if your multithreaded tool works significantly faster on a single file when I force your tool to just use a single thread and parallelize it with parallel --pipepart --block instead, then either:

you might decide to develop sharding the I/O of the physical file yourself, or
you might consciously decide to not develop it, and leave that complexity to parallel (which is fine!)

But if your tool has no threads=N option, I have no workaround.

Configurable thread count lets me optimize in the meantime (or instead).

#CLI #multithreading

**Emilia Jarochowska** @Emiliagnathus@circumstances.run · Sep 1, 2024

Sep 1, 2024

Emilia Jarochowska @Emiliagnathus@circumstances.run

Have you ever programmed a human computer? Having 30 people walking around the room to exchange information between RAM addresses and CPU registers, and human CPUs execute operations on the clock is a very special experience*.

This week I learned more than in a ~year of self-study, thanks to the 16th Advanced Scientific Programming in #Python https://aspp.school
We covered version control, packaging, testing, debugging, computer architecture, some #numpy and #pandas -fu, programming patterns aka what goes into a class and what doesn't, big-O to understand the scaling of various operations and how to find the fastest one for the given data type and size, and an intro to #multithreading and #multiprocessing

A personal highlight for me was pair programming. I never thought writing code in with a buddy would be so much fun, but I learned a lot from my buddies and now I don't want to go back to writing code alone

Very indebted to the teachers and organizers; https://aspp.school/wiki/faculty if you ever meet one of those people, please buy them a drink for what they have done for a better code karma state in the universe

*our human computer didn't manage to execute the simplest sorting algorithm and the CPUs started to sweat; we experienced what happens when the code is ambiguous and imprecise

Participants and staff members (around 38 people) standing in a restaurant and looking very happy

**Benjamin Carr, Ph.D.** @BenjaminHCCarr@hachyderm.io · Aug 5, 2024

Aug 5, 2024

Benjamin Carr, Ph.D. @BenjaminHCCarr@hachyderm.io

#LZ4 v1.10 Introduces #MultiThreading Support For Major #Compression Speedups

LZ4 1.10 has been dubbed the "multi-cores edition" with this version adding multi-threading support to help speed-up compression now that modern I/O storage with NVMe is so much faster there's a real need to make compressing data even faster.
https://www.phoronix.com/news/LZ4-1.10-Multi-Threading

**root42** @root42@chaos.social · Jul 9, 2024

Jul 9, 2024

root42 @root42@chaos.social

Question to my #cplusplus bubble: are there any caveats regarding thread_local variables in shared libraries? Do they work in the same way as thread_local variables in statically linked code?
#cxx #multithreading

**KDAB** @kdab@techhub.social · Jul 3, 2024

Jul 3, 2024

KDAB @kdab@techhub.social

Explore the "Multithreading with Qt" playlist on our YouTube channel. This series introduces you to the basics of #multithreading, best practices, and how to avoid common pitfalls. Watch here: https://www.youtube.com/playlist?list=PL6CJYn40gN6jgr-Rpl3J4XDQYhmUnxb-g #QtDev #Programming

YouTubeMultithreading with QtBy KDAB

**IT News** @itnewsbot@schleuss.online · Jun 8, 2024

Jun 8, 2024

IT News @itnewsbot@schleuss.online

Make Your Code Slower With Multithreading - With the performance of modern CPU cores plateauing recently, the main performance... - https://hackaday.com/2024/06/07/make-your-code-slower-with-multithreading/ #multithreading #softwarehacks #performance #profiling #spinlocks #syscall #futex #mutex #perf

Hackaday · Jun 8, 2024Make Your Code Slower With MultithreadingWith the performance of modern CPU cores plateauing recently, the main performance gains are with multiple cores and multithreaded applications. Typically, a fast GPU is only so mind-bogglingly qui…

**weberc2 ️** @weberc2@stranger.social · May 7, 2024 *

May 7, 2024 *

weberc2 ️ @weberc2@stranger.social

Once you really get your head around good concurrency patterns and architectures, Go's concurrency primitives are _an absolute joy_ to program with.

#golang #concurrency #parallelism

**SoftwareMill** @softwaremill@softwaremill.social · Mar 22, 2024

Mar 22, 2024

SoftwareMill @softwaremill@softwaremill.social

And here’s Wojciech Mazur
sharing the story of how his team dealt with 201 obstacles to run multithreaded programs natively #scalanative

#scalarconf #multithreading

**Jan** @janriemer@floss.social · Jan 21, 2024

Jan 21, 2024

Jan @janriemer@floss.social

My Best and Worst #Deadlock in #Rust - by snoyman

https://www.snoyman.com/blog/2024/01/best-worst-deadlock-rust/

Really great read! I didn't know that about `RwLock`.

Michael Snoyman's homepageMy Best and Worst Deadlock in Rust

#RustLang #Concurrency #DataRace

**Albert Cardona** @albertcardona@mathstodon.xyz · Jan 5, 2024 *

Jan 5, 2024 *

Albert Cardona @albertcardona@mathstodon.xyz

Leslie Lamport, of LaTeX fame, is a very accomplished mathematician and computer scientist with a Turing award for his work on “fundamental contributions to the theory and
practice of distributed and concurrent systems”. He just published a draft of his new book:

"A science of concurrent programs"

https://lamport.azurewebsites.net/tla/science.pdf

True to his pedagogic approach to everything he does, "The book assumes only that you know the math one learns before entering a university." Even the appendices are fantastic. Can only wish I'll remain this lucid at his 82 years old.

#multithreading #maths #LesleyLamport

**Dr. Robert M Flight** @rmflight@mastodon.social · Nov 1, 2023

Nov 1, 2023

Dr. Robert M Flight @rmflight@mastodon.social

At what point does setting more threads for OpenBLAS actually help?

For example, I have an SVD operation in #RStats on largish matrices (6000 rows and 6000 columns; doing an inverse), where default BLAS on Ubuntu is ~ 20 min.

OpenBLAS with 1 or 4 threads takes ~ 2 min (10X speedup!). With 4 threads, I can see the additional usage of cores, but overall time is the same as 1 thread.

Is there some magic size where using more threads for SVD will actually help?

#MultiThreading #OpenBLAS