Insights from Stack Overflow Developer Survey Using AI - Data Science Agent in Google Colab

AI Prompt Interaction
🤖 What is Google Colab?

Google Colab is a free online platform that lets you write and execute Python code in the browser — with zero setup. It's ideal for data science, machine learning, and AI-powered tools like the Data Science Agent.

Learn About Colab

Explore how developers and data analysts can extract meaningful insights from the Stack Overflow Annual Developer Survey using Google's Colab Data Science Agent. This guide demonstrates how to upload the CSV dataset, ask natural language questions, and receive intelligent analysis instantly — all without writing a single line of code. Perfect for beginners and professionals looking to quickly uncover trends in developer tools, AI usage, and tech preferences.

📊 Download the Survey Data

To explore developer trends and analyze real-world data, visit the official Stack Overflow Developer Survey site. You can download the full dataset in CSV format and start your own AI-driven analysis.

Visit Stack Overflow Survey

🔍 Step-by-Step: How the Colab Data Science Agent Works

Colab Gemini for  Data Science Agent
  1. Upload Your Data File: Begin by uploading your dataset in CSV format. This provides the foundation for the AI's analysis.
  2. Ask a Question (Prompt): Enter a natural language question — like “What is the average salary by country?” or “Show top 5 countries by developer count.”
  3. Execute the Plan: The AI will first generate a plan based on your prompt. Review it, then click the “Execute Plan” button to proceed.
  4. Data Cleaning: The agent prepares the data by handling missing values, standardizing formats, and ensuring it's analysis-ready.
  5. Reasoning & Analysis: The AI interprets your intent, selects relevant columns, and builds a logical reasoning chain to guide the analysis.
  6. Python Code Execution: Based on the reasoning, Python code is auto-generated, executed, and the results are displayed — often with charts or tables depending on your query.

After entering your prompt, the Colab Data Science Agent will first present a structured plan. You’ll be prompted to click the “Execute Plan” button to proceed. During execution, if the requested information isn’t available within the uploaded dataset—or if relevant columns aren’t found—the agent may attempt to retrieve external data to complete the analysis. However, you can restrict the agent to work strictly within the given data table. In such cases, if the necessary data or columns are missing, the agent will notify you that the required information is not available in the dataset.

AI agent going to Internet for data

AI Agents in Data Science: Smarter Insights with Less Effort #colab #stackoverflow

To help you explore the Stack Overflow survey data more effectively, we've included a curated list of natural language prompts. These sample questions are designed to work seamlessly with Colab’s Data Science Agent, allowing you to uncover trends, compare tools, and generate insights without writing complex code. Simply ask, and the AI will analyze the data for you.
AI will give you Python code which you can pass to a cell and execute the same to get the result.

🧑‍💼 Demographics & Background

  1. What is the distribution of respondents by country? (Field: Country)
  2. What age groups do most developers fall into? (Field: Age)
  3. What is the highest level of education achieved? (Field: EdLevel)

💻 Professional Experience & Roles

  1. What are the most common developer roles? (Field: DevType)
  2. What is the distribution of coding experience among respondents? (Field: YearsCode)
  3. How many years have participants been coding professionally? (Field: YearsCodePro)
  4. What types of organizations do developers work in? (Field: OrgSize)
  5. What are the most common employment types? (Field: Employment)

📚 Learning & Education

  1. How do developers typically learn to code? (Field: LearnCode)
  2. What platforms are most used for learning coding online? (Field: LearnCodeOnline)

🛠️ Technology & Tools

  1. Which programming languages are most commonly used? (Field: LanguageHaveWorkedWith)
  2. What languages do developers want to work with next? (Field: LanguageWantToWorkWith)
  3. Which web frameworks are widely used? (Field: WebframeHaveWorkedWith)
  4. What databases are developers using? (Field: DatabaseHaveWorkedWith)
  5. What tools do developers want to explore? (Field: ToolsTechWantToWorkWith)

🤖 AI & Machine Learning Tools

🛠️ Tools Usage: Actual Experience (AISearchDevHaveWorkedWith)

  1. Which AI developer search tools have respondents used in their workflow?
  2. What are the most commonly used AI-powered search tools among developers?
  3. How does AISearchDevHaveWorkedWith vary across different DevType roles?
  4. Are certain tools more commonly used in specific countries (Country) or industries (OrgSize)?

🌟 Aspiration: Tools Developers Want to Use (AISearchDevWantToWorkWith)

  1. Which AI search tools are most desired by developers?
  2. What tools do developers aspire to work with but haven’t yet used?
  3. How do preferences vary by experience level (YearsCode, YearsCodePro)?
  4. Which tools show the largest gap between current use and future interest?

💬 Perception: Admired Tools (AISearchDevAdmired)

  1. Which AI tools are most admired by the developer community?
  2. What are the most admired tools that have low actual usage?
  3. How does admiration differ across demographics like EdLevel, or Country?

🔍 Comparative Insights

  1. Which tools appear in all three columns: used, wanted, and admired?
  2. Which tools are admired but not widely used or requested?
  3. What trends emerge when comparing AISearchDevHaveWorkedWith vs. AISearchDevAdmired?

📊 Cross-Dimensional Relationships

  1. Do developers who admire certain AI tools (from AISearchDevAdmired) report higher satisfaction (JobSat)?
  2. Is admiration or usage of specific tools related to compensation levels (CompTotal)?
  3. How does admiration differ by remote work preference (RemoteWork)?

🌐 Work Environment & Preferences

  1. What are the most common operating systems for development? (Field: OpSysProfessional use)
  2. What are the primary work environments (remote, hybrid, in-office)? (Field: RemoteWork)

🔍 Try with Other Datasets

Once you're comfortable analyzing the Stack Overflow survey, take it a step further! Apply the same natural language approach to other popular datasets available in public repositories. Some great examples include:

  • titanic.csv – Predict survival based on passenger data
  • iris.csv – Explore flower classification with simple features
  • diabetes.csv – Analyze health indicators and outcomes
  • housing.csv – Understand housing price patterns and predictions

You can upload these datasets to Google Colab and use the same Colab Data Science Agent to ask questions and generate Python code automatically.



💡 Try it yourself — no coding required!

You can explore all the SQL query solutions interactively using the Google Colab Data Agent by simply typing your questions ( List of Queries for Student table is here ) in plain English.

Open Colab Notebook on GitHub


Query Excel, CSV & SQLite with Plain English Using Google Colab Data Agent | No Code SQL Demo

✅ Conclusion

AI-powered data analysis is no longer just for coders — with tools like Colab’s Data Science Agent, anyone can gain meaningful insights from complex datasets using simple language. Whether you're a student, data enthusiast, or business analyst, this no-code approach unlocks new possibilities for exploring data. Start experimenting today and let AI do the heavy lifting.


Author Image

👤 About the Author

Subhendu Mohapatra is the creator of Plus2net.com and a dedicated developer focused on AI-powered tools, data analysis, and content automation. He regularly experiments with platforms like Google Colab, Python data workflows, and prompt engineering to explore practical uses of AI in digital content and analytics.

Driven by a passion for knowledge sharing, he helps others build technical skills and leverage AI more effectively in their personal and professional workflows—often contributing on a voluntary basis through tutorials, code samples, and real-world guidance.

🎥 Join me live on YouTube LinkedIn X (Twitter)
Subscribe to our YouTube Channel here