UI Automation beat my screenshot loop by 17x
I had built a Windows desktop agent around the obvious loop: take a screenshot, send the PNG to a vision LLM, ask where to click, click it, repeat. It…
Read →Category
11 posts tagged windows.
I had built a Windows desktop agent around the obvious loop: take a screenshot, send the PNG to a vision LLM, ask where to click, click it, repeat. It…
Read →I needed to slipstream a Windows 11 install across a handful of VMs on Proxmox without sitting in front of a noVNC window pressing Enter for an hour. The…
Read →Swapped a USB-to-Ethernet adapter on a ThinkPad to fix a flaky link. After the swap, the laptop dropped off the network entirely. No reservation hit in…
Read →I had a fleet-status dashboard for some Windows boxes that pulled disk encryption state via manage-bde -status. Several rows were reporting hard errors…
Read →I wanted to PXE-boot a fully unattended Windows 11 25H2 install onto a VM with no TPM, no Secure Boot, and not enough RAM by Microsoft's reckoning. The…
Read →PXE booting a Windows 11 install is fine; getting a freshly installed VM to accept SSH from my homelab without me touching it is the actual interesting…
Read →The macOS companion for a personal safety app I run does one small thing: every five minutes, if the user was active in the last minute, it POSTs a…
Read →I needed to download Windows 11 25H2 ISO bits via the UUP build flow. Aria2c started fine, pulled the small metadata files (a few hundred KB total), and…
Read →I was building a custom Windows 11 install ISO and trying to do a clever registry tweak inside boot.wim to make in-place upgrade scenarios work the way I…
Read →I write deployment scripts for Zebra ZQ521 mobile printers running CPCL. Two of the standard config knobs — disable bidirectional communication and…
Read →I shipped a small side-project that wraps Windows' Unified Write Filter (UWF) — the feature that lets you "freeze" a system drive so nothing written to…
Read →