Vibe-Codin’: Photographic Image Quality Wrangling – Stream of Consciousness

Gemini 3 Flash’s new ‘Agentic Vision’ improves image responses – 9to5google

Gemini 3 Flash’s new ‘Agentic Vision’ improves image responses

Abner Li | Jan 27 2026 – 11:40 am PT

1 Comment

Agentic Vision is a new capability for the Gemini 3 Flash model to make image-related tasks more accurate by “grounding answers in visual evidence.”

Frontier AI models like Gemini typically process the world in a single, static glance. If they miss a fine-grained detail — like a serial number on a microchip or a distant street sign — they are forced to guess.

This new approach “treats vision as an active investigation” by combining visual reasoning with code execution and other tools in the future.

To answer prompts with images, Gemini 3 Flash will formulate “plans to zoom in, inspect and manipulate images step-by-step.” Specifically, Agentic Vision leverages a “Think, Act, Observe loop.”

  • Think: the model analyzes the user query and the initial image, formulating a multi-step plan.
  • Act: The model generates and executes Python code to actively manipulate images (e.g. cropping, rotating, annotating) or analyze them (e.g. running calculations, counting bounding boxes, etc).
  • Observe: The transformed image is appended to the model’s context window. This allows the model to inspect the new data with better context before generating a final response.
  • Instead of just describing an image it’s given, Gemini 3 Flash “can execute code to draw directly on the canvas to ground its reasoning.” One example of this image annotation in the Gemini app is asking “to count the digits on a hand.”

    To avoid counting errors, it uses Python to draw bounding boxes and numeric labels over each finger it identifies. This “visual scratchpad” ensures that its final answer is based on pixel-perfect understanding.

    Meanwhile, Gemini 3 Flash will zoom in when it detects fine-grained details in the image. Agentic Vision can also “parse high-density tables and execute Python code to visualize the findings.”

    Agentic Vision results in a “consistent 5-10% quality boost across most vision benchmarks” for Gemini 3 Flash.

    This is starting to roll out to the Gemini app with the Thinking model. For developers, it’s available today with the Gemini API in Google AI Studio and Vertex AI. 

    Continue/Read Original Article Here: Gemini 3 Flash’s new ‘Agentic Vision’ improves image responses

    #9to5GoogleCom #AgenticVision #ExecuteCode #Gemini #Gemini3Flash #GeminiApp #Google #ImageQuality #NewFromGemini
    It caught me off guard twice already: If I use shutter priority mode but forget to set my ISO properly, I can get overexposed jpeg files like these, even though it looks fine in the viewfinder. The only warning you get is a blinking focal length setting. The histogram probably looked odd too but histogram itself is usually too faint for me to notice the problem.

    I switched to manual mode and it works for me so far. Once I find out how to set auto ISO, I should be able to avoid this situation in the future.

    #omsystem
    #omsystemom3
    #isosetting
    #imagequality
    I’ve been having trouble nailing focus with my OM-3. For example, in this shot the pergola keeps going out of focus, while the tree in the foreground ends up sharp instead. 😔

    It explains why I felt the image quality of OM-3 is not as good as my #fujifilm cameras. At least I know why now.

    #om3
    #omsystemom3
    #focus
    #camera
    #imagequality
    I think I am spoiled by Fujifilm's 40M+ sensor. OM-3 is a capable and good-looking camera. On a smaller screen, the photos captured by OM-3 look impressive. However when I view them on my 27" monitor, I cannot help but think: my fujifilm camera can do better. 😊

    At the end of the day, M43 is really compact. I am going to invest more time to get proficient with OM-3.

    #omsystem
    #om3
    #imagequality
    We Are Post ISO Now

    A camera's ISO range matters a lot less than it used.

    PetaPixel
    🎨📸 Look, #Apple has reinvented the wheel again! Now you too can decipher the arcane magic of #HDR with a string of random numbers and letters. Who knew image quality relied on summoning the ancient text of #JFIF and AMPF? 🙄📜
    https://walzr.com/HDR2.jpg #AMPF #imagequality #innovation #HackerNews #ngated
    How NASA Is Testing AI to Make Earth-Observing Satellites Smarter
    --
    https://www.jpl.nasa.gov/news/how-nasa-is-testing-ai-to-make-earth-observing-satellites-smarter/ <-- shared technical article
    --
    https://youtu.be/1tQ03iCorXk?si=nV1s7atllo5HTHhs <-- video overview / summary
    --
    "A technology called Dynamic Targeting could enable spacecraft to decide, autonomously and within seconds, where to best make science observations from orbit..."
    #GIS #spatial #mapping #satellite #remotesensing #earthobservation #targeting #technology #AI #dynamictargeting #satellite #imagery #sciencedata #targeting #JPL #clouds #wildfire #artificalintelligence #deeplearning #cloudcover #imagequality
    #NASA
    The Panasonic S1 II Is Secretly a Groundbreaking Full-Frame Camera

    The magic of dual gain output.

    PetaPixel
    Glass Imaging Raises $20 Million Funding Round to Expand AI Imaging Tech

    Glass Imaging could transform smartphone image quality.

    PetaPixel