Standalone OpenAI transcription Python script

OpenAI announced some new audio models yesterday, including new transcription models(gpt-4o-transcribe and gpt-4o-mini-transcribe). This model performs better than Whisper, though because they are built on top of language models, they're prone to prompt injection attacks. You can also access it using the Realtime API. After a bit…

Assorted links(February 2025)

Import AI 397: DeepSeek means AI proliferation is guaranteed; maritime wardrones; and more evidence of LLM capability overhangsWelcome to Import AI, a newsletter about AI research.Import AIJack Clark But beneath all of this I have a sense of lurking horror - AI systems have got so useful that the…

Running ML models in Elixir using Pythonx

I just discovered Pythonx, which runs a Python interpreter in the same OS process as Elixir. Looks like it also wraps uv so it sets up a virtual environment with all the Python dependencies your code needs. Here's how I got the MLX version of the recently released…

Book mini-review: "Children of Ruin"

Children of Ruin is the second book in Adrian Tchaikovsky's Children of Time series. It follows another branch of the terraforming project that began in the first book. Interesting read with non-human intelligences which feels relevant because of all the recent AI buzz and the apparent imminence of…

Building a Membrane Pipeline to talk to Google Gemini

This post was also published in the Software Mansion blog. Large Language Models like Google’s Gemini 2.0 Flash and OpenAI’s GPT-4o Realtime are multimodal, meaning users can chat with them via text, talk to them directly like in a conversation, or even send a live video feed.…

Assorted links(January 2025)

Speed matters: Why working quickly is more important than it seems « the jsomers.net blogJames Somers, July 26, 2015REVIEW: Reentry, by Eric BergerReentry: SpaceX, Elon Musk, and the Reusable Rockets that Launched a Second Space Age, Eric Berger (BenBella Books, 2024).Mr. and Mrs. Psmith’s BookshelfJohn PsmithSchool is Not…

TIL: Avoid modifying headers with Dart HTTP package

I'm using the http package to make a POST request to OpenAI, and the API seems to only strictly accept application/sdp as the Content-Type: flutter: Response body: {"error":{"message":"Unsupported content type. This API method only accepts 'application/sdp' requests,…