Wednesday, November 6, 2024

My AI Rube Goldberg Machine by @ttunguz

In yesterday’s publish, I calculated the profitability of public software program corporations. To calculate these figures, I constructed a bit of Rube Goldberg machine.

I didn’t obtain the information into Excel. As an alternative, I complexified issues by sending the evaluation to 4 AIs to see if they might agree.

The inspiration : many corporations have used Amazon’s Mechanical Turk to crowdsource duties, & choose a consensus reply throughout three employees to enhance accuracy.

Why not do this throughout 4 AI employees as an alternative?

4 robots with flags
Immediate : “calculate the common internet revenue margin and money circulation from ops margin from this information set” plus the information set. Notice that CFOM isn’t a easy common however requires dividing money circulation from ops by income beforehand.

Mannequin NIM, % CFOM, %
Claude 4.99 27.31
Gemini -9.29 16.2
Perplexity -8.67 14.4
ChatGPT – 9.29 1,433.01. / 14.9%
My Evaluation -9.29 16.2

Gemini scored prime marks for tabulating accurately on each columns. ChatGPT did properly with NIM however “forgot” to finish the extra division step, which I corrected with a observe up, however nonetheless not the suitable determine. The opposite programs missed the mark altogether.

It could be a mistake to attract any broad conclusions from my little experiment.

However on this case, consensus doesn’t but work as a method which implies I nonetheless have to double verify calculations myself.

Sooner or later, AI will mechanize the illusory Mechanical Turk & I’ll restart my Rube Goldberg math machine with confidence.

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles