𝗬𝗮𝗳𝗮𝗻 𝗛𝘂𝗮𝗻𝗴 [Advisor: Guanpeng Li] will defend his doctoral thesis entitled "Data-efficient and Fault-tolerant Exascale Computing" tomorrow Thursday 4/16 at 3pm.

Deets at https://bit.ly/huang_4_16

#FinalExam #PhDLife #UIowaGrad26 #HPC #FaultTolerance

So apparently, NASA's secret sauce for building a "fault-tolerant" computer involves getting blocked by #Cloudflare while trying to access the article. 🚫✨ Who knew #cybersecurity was just a fancy way of saying you can't read about computers? 🤖🔒
https://cacm.acm.org/news/how-nasa-built-artemis-iis-fault-tolerant-computer/ #NASA #FaultTolerance #ComputerScience #HackerNews #ngated
How NASA Built Artemis II’s Fault-Tolerant Computer

Communications of the ACM
How NASA Built Artemis II’s Fault-Tolerant Computer

Communications of the ACM

For those with more than a passing interest in Information Systems security (OK not everyone🤣) , this might prove to be interesting. Although as commented,

’build the governance first, get the key parties committed, define the trust roots, enforce the rules – is precisely the kind of process that works in Switzerland and struggles almost everywhere else.’

https://www.theregister.com/2026/03/17/switzerland_bgp_alternative/

#IT #Security #FaultTolerance #ETH #Switzerland

Switzerland built a secure alternative to BGP. The rest of the world hasn't noticed yet

Feature: SCION: Proven in banking and healthcare, slow to spread everywhere else

The Register

#Uber redesigned its #MySQL fleet using a consensus-driven architecture based on MySQL Group Replication:
✅ Cluster failover dropped from minutes → seconds
✅ Leader election & failure detection now inside the database layer
✅ Improved availability, simpler orchestration, stronger consistency across thousands of production clusters

Learn more: https://bit.ly/4b8RWYL

#SoftwareArchitecture #DistributedSystems #Clusters #FaultTolerance #RelationalDatabases

A ​Consenso ⁤Probabilístico ⁣Rápido - Fast Probabilistic Consensus (FPC)#Criptomonedas #Blockchain’s #ConsensusAlgorithms #Cryptocurrencies #DistributedSystems #FastProbabilisticConsensus #FaultTolerance #FPC #NetworkConsensus #ProbabilisticAlgorithms #Scalability O Fast Probabilistic Consensus (FPC) é um algoritmo de consenso rápido e eficiente, com base na teoria dos grafos probabilísticos. Esta abordagem oferece uma sol...
https://djltrading.com/fast-probabilistic-consensus-fpc/?fsp_sid=39657
A ​Consenso ⁤Probabilístico ⁣Rápido - Fast Probabilistic Consensus (FPC)

O Fast Probabilistic Consensus (FPC) é um algoritmo de consenso rápido e eficiente, com base na teoria dos grafos probabilísticos. Esta abordagem oferece uma solução escalável e confiável para sistemas distribuídos.

Investimentos e Trading
Goal: This article will demonstrate how to add AI features to a Jakarta EE / MicroProfile application using LangChain4J‑CDI, with simple to implement examples that runs on Payara, WildFly, Open Liberty, Helidon, Quarkus or any CDI 4.x compatible runtime. Note: This is an updated article to the one...
#AIAgent #AIServices #faulttolerance #JakartaEE #langchain4j #langchain4jcdi #LLM #MicroProfile #OpenTelemetry
https://foojay.io/today/bring-ai-into-your-jakarta-ee-apps-with-langchain4j-cdi/
Bring AI into your Jakarta EE apps with LangChain4J-CDI – foojay

Looking for a single fiber media converter for fiber optic service level agreement requirements or a single fiber media converter for fiber optic network fault tolerance?

Versitron provides robust, reliable solutions engineered for uptime, stability, and performance in demanding fiber networks.

Ideal for mission-critical environments and high-availability infrastructure.

#Versitron #FiberOptics #MediaConverter #SingleFiber #NetworkReliability #FaultTolerance

What is Erasure Coding – A Shield Against Data Loss

Erasure Coding (erasure code) is a data protection mechanism that protects against data loss by breaking data items, such as files, into fragments, calculating additional data pieces (parity information), and storing them across a set of independent locations or storage media. For decades, traditional methods like replication have been the go-to solution for protecting against data loss or corruption. In recent years, however, a more efficient and resource-friendly technique has become more […]

https://www.simplyblock.io/blog/what-is-erasure-coding-a-shield-against-data-loss/

🔥 Behold the #PyTorch blog masterpiece: "Fault Tolerant #Llama Training" - because who doesn't love 2000 failures every 15 seconds? 😂💥 Forget checkpoints, because llamas are clearly bred for #chaos on a Crusoe L40S! 🙄✨
https://pytorch.org/blog/fault-tolerant-llama-training-with-2000-synthetic-failures-every-15-seconds-and-no-checkpoints-on-crusoe-l40s/ #Training #FaultTolerance #MachineLearning #HackerNews #ngated
Fault Tolerant Llama: training with 2000 synthetic failures every ~15 seconds and no checkpoints on Crusoe L40S – PyTorch