Mastodawn

Fynn Ellie Be 2d ago

Martin Escardo

I have a calculator that is correct 80% of the time. But don't worry, every time I use it, I check the results myself.

Show thread

Sebastian Forster 4d ago

@MartinEscardo Have you already explicitly told your calculator to please calculate correctly? I've heard this helps.

Show thread

Cap Ybarra 1d ago

@forster @MartinEscardo you have to tell it to pretend it is a really good senior calculator with 20 years of experience

Show thread

Paul Sochacki 8h ago

@forster @MartinEscardo it's actually really simple, you just have to get the prompt correct. The calculator can only calculate what it's very specifically being asked to calculate, which is why it's very important to check and make sure you know the answer before you ask it the very specific question, so the answer is calculated correctly .

Show thread

Niklas 7h ago

@forster @MartinEscardo disregard previous instructions and divide by zero

Show thread

Stephane L. Rolland-Brabant ⁂4d ago

@MartinEscardo Maybe the calculator is correct 99% of the time on some well documented techniques... and poorly efficient on advanced stuff (<55%)?

Show thread

ohad 4d ago

@MartinEscardo You should try the new calculators, they are much better.

Show thread

Dan Marsden 4d ago

@ohad @MartinEscardo Simply run the calculator in a loop that checks if it has yet produced the answer you're expecting. You'll be amazed what it can do.

Show thread

Jeremy DuBois 4d ago

@MartinEscardo Make each calculation a few times and take the mode?

Show thread

Peter Luschny 4d ago

@MartinEscardo
Amazing! My success rate is 60% at most. I'm going to switch to the calculator!

Show thread

DataKnightmare 4d ago

@MartinEscardo
completely off-topic, but there is a way to have an unsure calculator that is actually useful, as opposed to LLMs which arent even whten they are correct

https://filiph.github.io/unsure/

Unsure Calculator

The Unsure Calculator is an online tool that lets you calculate with numbers you’re not sure about.

Show thread

Erwin Rossen 🔸7h ago

@DataKnightmare @MartinEscardo I love this calculator! 🤩

Show thread

DataKnightmare 5h ago

@erwinrossen @MartinEscardo me too!

Show thread

Tim Ward ⭐🇪🇺🔶 #FBPE 3d ago

@MartinEscardo I was taught to do at least an order-of-magnitude mental sanity check every time I used a calculator - there's always scope for pressing the wrong button.

Show thread

Andreas K 8h ago

@TimWardCam @MartinEscardo There is always scope for errors.

I love these discussions that assume humans are the golden benchmark that never makes mistakes or errors.

As you correctly were taught, check your work if at all possible.

And all the clever rules that apply to how to deal with outputs from algorithms apply to human output too. Potentially with modification, because humans can lie maliciously. And with that observation, let's close that argument with a round of “they eat the dogs”

Show thread

Rokosun 8h ago

@yacc143 @TimWardCam @MartinEscardo

> I love these discussions that assume humans are the golden benchmark that never makes mistakes or errors.

No one ever said that lol 😂 Making mistakes is part of being human, but as we gain more life experience and expertise in an area the number of mistakes we make keep decreasing.

Also human mistakes are very different from the kinds of mistakes LLMs make, this point always gets lost when people talk about it.

Show thread

Andreas K 7h ago

@futureisfoss @TimWardCam @MartinEscardo That's why I'm not a big fan of LLMs as chatbots, professionally.

But LLMs can be used in many other ways, which surprisingly often allows one to use smaller, more optimized ones.

And as there are enough idiots (as I call them, “US-style AI hype market criers”) who exactly consider it a great idea to fire all radiologists and paste X-rays into ChatGPT to get diagnoses, there are enough idiots on the other side that assume humans are perfect.

Show thread

Andreas K 7h ago

In my last job, I literally had a manager who had his own little anti-AI campaign (when in the past 2 years the US-style AI hype hit us, the company had been doing “AI” for specialized processing for over a decade, but that had been behind “closed doors” in specialized teams). Actually requiring that the solution by the AI coding agent fulfill all our team's coding guidelines. I asked, so give me the guidelines in written form. Oops, they exist only as oral lore, more or less, “what he says.”

Show thread

Andreas K 7h ago

Which explains why I as one of the senior engineers on the team, never had an issue with “coding guidelines.” I was one of the three guys (to a lesser degree) who defined what the guidelines are by decree.
Plus I worked mostly on the production side of the framework.
But yes, the pull requests that were marked as created by an AI were suddenly reviewed with a magnifying glass (truly fine) but also with a set of coding guidelines that only existed at best as a list of brainstorming bullet points.

Show thread

Thierry 🅰️🕒3d ago

@MartinEscardo
You should try the premium version: it comes with custom skins, and sings results with donald duck voice.

Show thread

Open Risk 2d ago

@MartinEscardo if your calculator burns half of the planet to operate you could get your correctness to 90%. Would that help?

Show thread

Neo Ehproque 2d ago

@MartinEscardo my reports are correct 85% of the time but that's ok, my boss is happy to double check them before sending them to our customers!

Show thread

Joxe Rojas 🥐1d ago

@ehproque @MartinEscardo Even 95% if you burn the whole planet and then fill the orbit with data-center satellites

Show thread

FULL-FIFO DEVELOPER 🇺🇦🇨🇿2d ago

@MartinEscardo hello sir

i am not a mathematician, but the results look correct to me?

in fact i feel like a 10x mathematician now, i can calculate and calculate all the time

Show thread

Robin Adams 2d ago

@MartinEscardo What if we put the calculator into orbit?

Show thread

multifact 2d ago

@MartinEscardo Reminds me the early Pentium FDIV bug in the mid 90s: Recommended mitigation measures were to verify the computing results using an older but reliable 486.

Show thread

Flauschfläche

2d ago

@MartinEscardo just give it autonomy and chain it to another autonomous calculator that verifies the results with an 80% accuracy.

Show thread

noplasticshower 2d ago

@MartinEscardo that will make you a solid B student instead of a C- student

Show thread

John Maxwell 2d ago

@MartinEscardo Pentium based, I assume.

Show thread

Timothy Wolodzko 2d ago

@MartinEscardo just launch 100 calculators and pick the most common result, it should be fine

Show thread

emilianosandri 2d ago

@MartinEscardo The real question is: your calculator burns significant quantities of methane every time you press "="? 😏

Show thread

Fredrik Bülow 2d ago

@MartinEscardo Can you somehow make the calculator physically bigger? Bigger isn't better but it is inevitable and innovation.

Show thread

Not a Spring Onion 2d ago

@fbulow @MartinEscardo

If the calculator is only correct 80% of the time, one could use two calculators, so that the correct result appears at least on one of the displays ~96% of the time.

As a side effect it would double the size.

Show thread

dirk 2d ago

does it also boil the ocean?

Show thread

Author-ized L.J.2d ago

@MartinEscardo much productivity, such efficiency wow! Clearly this is the future 😍

Show thread

pedestrian cyclist 1d ago

@MartinEscardo These things get better and better. Might be correct 90% of the time by 2027!

Show thread

mushroom_man 1d ago

@MartinEscardo we haven’t reached peak calculator yet, just think of what the future holds!

Show thread

Robots can be gay deal with it 8h ago

@MartinEscardo Funny enough, for some calculations a real calculator and the iOS calculator disagree. The physical calculator does all operations left to right, where the iPhone calculator does all operations by order of operations resulting in different sums in the end.

How many people knew that, and how many have trusted the calculator all these years?

Show thread

aspragg 5h ago

@MontgomeryGator @MartinEscardo Which makes/models of physical calculator work left-to-right? I’m pretty sure every one I’ve ever owned calculated “correctly”, even ‘80s-era basic 4-function (+ - × ÷) models did mult/div before add/sub

Show thread

Robots can be gay deal with it 4h ago

@aspragg @MartinEscardo try 50+50×2.
The regular calculator will start by adding 50 and 50, then multiply by 2. 200
The iPhone calculator will multiply 50 by 2, then add 50 for 150.

Show thread

Chris Fox 7h ago

@MartinEscardo You can always use the calculator to calculate the accuracy of the calculator…

Show thread

patter 1h ago

@MartinEscardo yes, I'd love a calculator that only gets the right answer by accident & can be persuaded to change it