Wednesday, November 6, 2024

Chatting With Her – The ChatGPT App on Mac by @ttunguz

image
For the previous few days, I’ve been utilizing the Mac ChatGPT app OpenAI demonstrated final Monday.

It’s unquestionably the way forward for human-computer interplay. Conversing with a pc is far more pure than typing. Think about talking t a colleague with the complete web at their disposal. But additionally a verbose colleague with out a lot sense of social cues.

Tapping a keyboard shortcut, the ChatGPT app hundreds & 4 little bars paying homage to Google transcription software program seem within the app. You possibly can select from 5 mellifluous voices who communicate fluidly & naturally.

The pc is affected person with unstructured ideas. I used it to stipulate this put up. I switched between the strengths & challenges of the product randomly. The software program categorized & organized, remodeling the rambling mess of sentences to a top level view.

Verlyn Klinkenborg would nonetheless have some work to do, modifying the define. Simply as most LLMs are verbose so is that this app. For preliminary content material evaluate, that’s effective, however after the fifth or sixth iteration, it’s sooner to interrupt the speaker by clicking.

A hybrid voice & textual content mode would enhance right this moment’s both/or expertise. Seeing the doc evolving because the speaker chats would assist with modifying.

Generally the dialog phantasm breaks. The voice interrupts & says she couldn’t hear all of my sentences. Or the size of the enter speech is just too lengthy & the app retries a number of instances, processing smaller chunks of voice, leaving the consumer unclear on how a lot of the dialog has been captured. At the least, this was my notion.

Different instances, the app isn’t affected person with an um or a considerate pause. I think about this a difficult drawback : when is a pause a sign to talk or wait?

However general it’s clear the place assistants like this can go : draft an e-mail & ship it. Delegate a job in Asana with a due date. Evaluate an online web page & summarize it in a weblog put up with some commentary, then verify the grammar & publish it. All by means of voice.

Years in the past, I wrote The Daybreak of the Voice-to-Textual content Period & The Quickest Consumer Interface on why I believe voice is the long run dominant consumer interface.

We’re a lot nearer to that imaginative and prescient than ever.

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles