norden.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
Moin! Dies ist die Mastodon-Instanz für Nordlichter, Schnacker und alles dazwischen. Folge dem Leuchtturm.

Administered by:

Server stats:

3.4K
active users

#ReproducibleResearch

0 posts0 participants0 posts today

Last year at the Open Science Retreat (#OSR24NL) I have been introduced to nanopubs by @egonw and created my first nanopub declaring citations for a paper using CiTOs (citation ontologies).

Now, travelling to #OSR25CH, due to issues with the train network foreseen with plenty of time, I used the opportunity and created a new one ( w3id.org/np/RAicq7k9QHX8EG8ho7 ) for my latest publication on teaching #Snakemake on #HPC systems.

The teaching material is — again — in desperate need of additions and overhaul, but that is for another day.

w3id.orgRAicq7k9QH (explore) | nanodash
Continued thread

Reading report 182 by Institute for replication:

« While demand estimates perfectly replicate, the production function estimates of Hong and Luparello (2024) differ slightly from those in Orr (2022) to the second decimal.

Hong and Luparello (2024) argue this due to numerical instability of STATA’s matrix inversion when recovering marginal cost, which I believe is correct.

The documentation for the particular command used for the marginal cost inversion — luinv in STATA — notes that different computers can give different results when a matrix is close to singular. »

Ok, now I would like to inspect the numerical method behind ’luinv’.

Doc reads « This function uses the MKL LAPACK by default. » … but for some cases Netlib LAPACK could also be used. Hum?!

🤔 Which LAPACK version? Compiled using which options? In other words, which computational environment?

i4replication.org
stata.com/manuals/m-5luinv.pdf

i4replication.orgInstitute For Replication

Do you know Stata?

« a general-purpose statistical proprietary software package developed by StataCorp for data manipulation, visualization, statistics, and automated reporting. It is used by researchers in many fields, including biomedicine, economics, epidemiology, and sociology. »

Guess how much does it cost? $USD ~1000 per year! Option government/nonprofit single-user. Bang!

And guess what?

For this price, you have only the “fast“ option, not “twice as fast”, neither “almost four times as fast” or neither even “even faster”. Ahah!

Crazy to read we’re still there in 2025 about #ReproducibleResearch and #OpenScience in some academic fields…

en.wikipedia.org/wiki/Stata
stata.com/order/new/gov/single

en.wikipedia.orgStata - Wikipedia
Continued thread

Ce qui est toujours passionant aux journées du Réseau Français de la Recherche Reproductible, c’est l’inter-disciplinarité.

Hier aprês des ouvertures sur les pratiques éditoriales, des présentations autour de l’analyse d'images IRM et de la recherche pré-clinique.

Puis discussion avec un café et des pralines sur la géomatique avec @NRoelandt.

Ce matin, une présentation sur la Science Ouverte en archéologie pré-historique puis ensuite une sur l’astrophysique.

Une belle stimulation par la pollinisation croisée.

jrfrr-2025.sciencesconf.org/?l
#ReproducibleResearch #OpenScience

jrfrr-2025.sciencesconf.orgjrfrr-2025 : Journées du Réseau National de la Recherche Reproductible - Sciencesconf.org

« Revisiting my PhD dissertation after 4 years
or Why I wanted to make my dissertation reproducible »

Regret 5: Not keeping track of dependencies

« Dependencies are not only about the R packages. Some R packages require certain software to be installed on the OS, which are called system requirements. For example, the ggplot2 package requires clang++ (C++ compiler), which usually comes with an operating system. The situation gets complex when installing a package with multiple package dependencies that require different system tools. For example, installing the kableExtra package requires the svgLite and xml2 packages that require libpng and libxml2 as system requirements, respectively. So, I had to deal with the system requirements of the 240+ packages in addition to installing those packages. The process was time-consuming. »

🤔 Maybe a perfect job for #Guix?

blog.namisunami.com/b38e5ff5

#OpenScience #ReproducibleResearch @nsunami

blog.namisunami.comRevisiting my PhD dissertation after 4 yearsMy regrets and how I would do differently if I were to start over.

I will continue to find it disturbing, if new #HPC cluster users explicitly instruct a program to use one core/cpu only and then complain that the cluster is so slow. Slower than their basement server.

Usually they do not spot their mistake on their own.

But THIS is actually NOT the disturbing part: such users also tend to always use default parameters. This might or might not be the sensible thing to do for their problem. Also, when reading papers, software parameterization frequently is not reported.

We have a long way to go.

☕ break? #Guix #HPC activity report is out! 🎉

Check it out:

hpc.guix.info/blog/2025/02/gui

The seventh report, time flies! Highlight of key achievements:

• More than 57,000 packages, Guix + related channels for scientific packages;
• Performance and portability: more about MPI;
• ROCm/HIP software stack for AMD GPUs upgrades;
• Migration of Guix-Science channel to Codeberg;
• New version of Guix-Jupyter;
• Common Workflow Language #CWL + Guix = ❤️ : ccwl and ravanan;
• Ensuring Source Code Availability: #SoftwareHeritage rescue mode;
• Re-Deploying Software from the Past;
• Supporting Artifact Evaluation at SuperComputing 2024;
• “Package to Container image” conversion pipeline, #DIAMOND and #DIADEM;
• Digital Electronics Design;
• Reproducible Multiphysics Simulation and Workflows;
• Impact of Hardware Variability;
• Toward Guix on French Tier-1 and EuroHPC Tier-0 machines;
• Pangenome Genetics Research
• Supporting RISC-V
• List of articles, talks, tutorials, events, training sessions.
• … and more…

What a year, isn’t it? 😀

hpc.guix.infoGuix-HPC — Guix-HPC Activity Report, 2024