There is now a #gitAnnex package on #PyPi: https://pypi.org/project/git-annex/
This should make it simpler to deploy git-annex in Python virtual environments, also as versioned dependencies for software like #Datalad
Packages are built for Linux, Windows, and Mac via GitHub actions: https://github.com/psychoinformatics-de/git-annex-wheel/
Contributions to cover more platforms are most welcome!
I want a build system that:
- is as powerful and flexible as #SCons
- as readable and concise as #SnakeMake
- has a fricking progress bar+ETA
- is
#datalad /
#gitannex agnostic (knows that files can be fetched from elsewhere
- remembers how long building things takes
- balances that to decide if rebuilding locally instead of fetching gigabytes via slow internet is favorable
- integrates well with
#nix for reproducibility
#rdm #dataAnalysis
In the latest DataLad blog post I try out two changes which were introduced in git-annex within the last year: git-remote-annex Git remote helper (this is the big one!) and a small change to enabling WebDAV special remotes. They work brilliantly, and combined they enable read-only data publishing on Nextcloud instances.
https://blog.datalad.org/posts/annex-nextcloud/
#datalad #gitAnnex #nextcloud

Putting new git-annex features to use with Nextcloud
Git-annex continues to evolve. In this post, I want to look at two changes, one big and one small, introduced within the last year. Together, they make publishing files through Nextcloud much nicer.
Specifically, it is now possible for a read-only shared Nextcloud folder to be a one-stop shop for cloning the dataset and getting file contents. This can be a useful setup for sharing (research) data: having the shared folder be a single point of access is convenient, and restricting write access is necessary to prevent unauthorized changes.
To be distributed...✨ Join the next upcoming Mannheim Open Science Meetup! ✨
🗞️ Topic: Reproducible Research Data Management with @datalad
🗣️ Speaker: @lnnrtwttkhn
📅 Date: Wed, Feb 26, 2025
⏰ Time: 2:00 PM
📍 Location: Online, sign up here: https://uni-mannheim.zoom-x.de/meeting/register/u5wpc-ygqDIpH9Z8JRmpRDnkMg1Si9uXnx7h
Why Attend?
✔️ Learn cutting-edge tools like Git, Docker & DataLad
✔️ Boost transparency & reproducibility in research
#OpenScience #ResearchDataManagement #DataLad #Reproducibility


Welcome! You are invited to join a meeting: Mannheim Open Science Meetup - Dr. Lennart Wittkuhn . After registering, you will receive a confirmation email about joining the meeting.
Reproducible Research Data Management with DataLad
Inadequate reproducibility, a lack of transparency, and inefficient workflows - many scientific disciplines have a pressing need for innovation. Additionally, the research process is becoming increasingly digital, computational, and collaborative. I argue that technical solutions to these challenges already exist and can be found in professional software development. These solutions primarily include tools for interactive version control of code and data, such as Git or DataLad, as well as software containers like Docker. In this presentation, I will primarily focus on effective research data management with DataLad. I will discuss how these tools are transforming scientific workflows and identify the (predominantly non-technical) hurdles that exist regarding their implementation.
ZoomJust set up a new Synology NAS box and installed forgejo-aneksajo (a git web UI with built-in git-annex support) on it: https://effigies.gitlab.io/posts/forgejo-aneksajo-synology/
Just a quick post that highlights what needed to be adapted from this earlier post on the #DataLad blog: https://mas.to/@mih/112880585950408351
#gitannex
Chris Markiewicz - Running a git-annex forge on a Synology NAS
Notes on getting forgejo-aneksajo to work with Synology
Chris MarkiewiczMy
#NixOS always wants to bulid
#datalad from source and the tests take aaaages and neither override{,Python}Attrs doCheck=false nor pytestCheckPhase="" works to prevent it 😩

Reproducible Research II: Practices and tools for managing compu…
In this MOOC, we will show you how to improve your practices and your ability to manage and process larger amounts of data, complex computations, while controlling your software environment.
FUN MOOC
Deploying and managing Forgejo-aneksajo with podman and systemd
Run Forgejo as a containerized service in user-space with SSH-passthrough on the host.
@nobodyinperson The `datalad-fuse` extension allows you to use `datalad fsspec-head` to achieve this. I believe it uses git-annex to find a remote URL and then Python's `fsspec` to do the actual fetch.
#DataLad