Could a shared dataset of real mobile apps accelerate ´vibe coding´ with large language models?

A public dataset of real mobile app UIs could transform how large language models tackle ´vibe coding´ and automate mobile workflows, says Droidrun’s creator.

As the Droidrun team undertakes the challenge of building an agent framework capable of autonomously navigating mobile apps through their real UI structure, a pivotal obstacle has emerged: the absence of a publicly accessible dataset that captures real Android app UI hierarchies, screen flows, and associated metadata. This gap in data not only hinders Droidrun but creates friction for the broader community working to empower large language models with a more grounded, context-rich understanding of mobile interfaces.

The founder poses a key question to the maker and developer community: would a comprehensive dataset aggregating real-world app UI trees, screen transitions, component types, and contextual metadata enable ´vibe coding´ agents to generalize more effectively across varied applications? Despite recent advancements, developing agents that intuitively interact with mobile UIs still requires significant manual tuning and repetition, as current models rely heavily on prompts and heuristics to ´feel right´ when navigating distinct app experiences. The lack of shared data keeps teams siloed and slows progress.

Envisioning a curated repository that spans categories like shopping, social, finance, and utilities, complete with detailed structural metadata—such as buttons, lists, inputs, navigation flows, and UX patterns—the author invites feedback: would access to such a dataset reduce the time spent on prompt tuning or help achieve more consistent agent alignment? Or, conversely, is the effort to amass this data unlikely to move the needle on reliable agent behavior? Community members are encouraged to share reflections, frustrations, and past experiences, potentially shaping the future of large language model-driven automation in mobile app contexts.

68

Impact Score

Google and other chatbots surface real phone numbers

Generative Artificial Intelligence chatbots are surfacing real phone numbers and other personal details, sometimes by pulling from obscure public sources and sometimes by inventing plausible but wrong contact information. Privacy experts say users have few reliable ways to find out whether their data is in model training sets or to force its removal.

U.S. and China revisit Artificial Intelligence emergency talks

Washington and Beijing are exploring renewed talks on an emergency communication channel for Artificial Intelligence as fears grow over the capabilities of Anthropic’s Mythos model. The shift reflects rising concern in both capitals that competitive pressure is outpacing safeguards.

Artificial Intelligence divides employers as hiring and headcount shift

U.S. hiring beat expectations in April, but employers remain split on whether Artificial Intelligence should drive layoffs, productivity gains, or internal redeployment. At the same time, candidate use of Artificial Intelligence is outpacing employer adoption in hiring, adding new pressure to screening and entry-level recruiting.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.