Qwen3.5 recipes shared for Jetson Thor

A Jetson Thor forum post shares setup recipes for running multiple Qwen3.5 models with NVIDIA's latest vllm repository for Thor. The largest reported working model is Qwen3.5-122B-A10B, with notes on NVFP4 and INT4 tradeoffs.

A new set of community recipes outlines how to run various models from the Qwen3.5 family on Jetson Thor. The shared guidance is based on NVIDIA’s latest vllm repository for Thor and is published through a linked GitHub repository intended for model deployment and experimentation.

The largest model reported to run is Qwen3.5-122B-A10B. The resharded NVFP4 version is described as a bit slower because it does not have MTP, but it is also described from experience as consistently better than the INT4 version. The note frames the recipes as practical instructions for users trying to evaluate performance and quality tradeoffs across quantized variants on Thor hardware.

The forum exchange is brief and focused on sharing working configurations rather than benchmarking details or broader documentation. A reply from an NVIDIA forum participant thanks the contributor for sharing the recipes, signaling community interest in reproducible ways to run large language models on Jetson Thor.

The surrounding forum activity shows sustained attention on generative model deployment on Thor, including discussions about Qwen variants, Nemotron, vllm containers, compatibility issues, and performance comparisons. That context positions the Qwen3.5 recipes as part of a larger effort by developers to tune local inference workflows for edge and robotics computing systems built on Jetson Thor.

55

Impact Score

Google and other chatbots surface real phone numbers

Generative Artificial Intelligence chatbots are surfacing real phone numbers and other personal details, sometimes by pulling from obscure public sources and sometimes by inventing plausible but wrong contact information. Privacy experts say users have few reliable ways to find out whether their data is in model training sets or to force its removal.

U.S. and China revisit Artificial Intelligence emergency talks

Washington and Beijing are exploring renewed talks on an emergency communication channel for Artificial Intelligence as fears grow over the capabilities of Anthropic’s Mythos model. The shift reflects rising concern in both capitals that competitive pressure is outpacing safeguards.

Artificial Intelligence divides employers as hiring and headcount shift

U.S. hiring beat expectations in April, but employers remain split on whether Artificial Intelligence should drive layoffs, productivity gains, or internal redeployment. At the same time, candidate use of Artificial Intelligence is outpacing employer adoption in hiring, adding new pressure to screening and entry-level recruiting.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.