Skip to content
View grahamwaters's full-sized avatar
👓
Likely writing JavaScript simulations
👓
Likely writing JavaScript simulations

Block or report grahamwaters

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
grahamwaters/README.md

ᴘʀᴏꜰᴇꜱꜱɪᴏɴᴀʟ ᴘᴏʀᴛꜰᴏʟɪᴏ


My Vita is still in beta, but you can find it by clicking below.

Vita

Portfolios by Topic

Data Analysis Data Science Machine Learning

Profile views

Summary of Skills

Python Audacity Discord Git GitHub Desktop Google Sheets Jupyter Stack Overflow Visual Studio Code Microsoft Word Microsoft Excel GPT-3 OpenAI GitHub Copilot Adobe SKLearn Pandas Numpy Requests BeautifulSoup Regex iOS HTML5 Markdown SQL R Seaborn Matplotlib TensorFlow Keras ChatGPT

🧪 Core Data & ML Tools

Python R C++ NumPy Pandas Matplotlib Seaborn Scikit-Learn Machine Learning Text Mining Text Classification Data Mining Data Cleaning Data Visualization Data Analysis Statistical Modeling Supervised Learning Logistic Regression Linear Regression Natural Language Processing AI AI Prompting LLM Prompting

🛠 Data Tools & Software SQL Power BI Excel Microsoft Office PowerPoint SPSS Cognos GitHub Copilot Visio Plotly OS X
📚 Education & Pedagogy

Mathematics Education Elementary Education Science of Teaching Reading ESL Teacher Training Classroom Management RTI Lesson Planning Guided Reading Student-Centered Learning Phonics IXL Frontline

💼 Communication & Business Skills

Project Management Technical Support Editing Writing Academic Writing Communication Leadership Teamwork Problem Solving Interpersonal Skills Customer Service Organizational Skills Public Speaking Troubleshooting Analytical Skills Direct Sales

🌍 Additional Skills & Languages

Spanish Apple Software Computer Science Teaching Requirements Gathering


LinkedIn Twitter YouTube GitHub HuggingFace Medium Profile Towards Data Analytics GlassBox
LinkedIn Twitter YouTube GitHub HuggingFace Medium Towards Data Analytics GlassBox

My Current Go-To Tools

Python R Tableau scikit-learn numpy matplotlib seaborn Black pandas

💼 Professional Experience


• Develop complex prompts to expose LLM edge-cases in maths reasoning • Annotate & tune speech-to-text models for linguistic precision • Full-stack model lifecycle: scoping → training → QA

Mathematics NLP AI Prompting

Career Break – Exploration & Upskilling

Focused on upskilling in Python-centric, research-heavy prompt-engineering roles that intersect neurology, energy, aerospace, & urban tech.

Austin Python Meetup – Volunteer Co-Organizer

Grew the community, curated speakers, and ran hands-on AI-prompting workshops.

AI Prompting Public Speaking

• Delivered TEKS-aligned math & science lessons to 30-40 students • Embedded ed-tech & data-driven RTI strategies into daily practice

Classroom Management RTI Lesson Planning AI Prompting

▶︎ Prior Roles (Oct – Dec 2023, Aug – Oct 2023)
• 4th-Grade Math & Science Teacher (3 mo)
• Guest Educator across 3rd-6th grades (3 mo)

Student-Centred IXL

Part of the aiOps Tiger Team—rolled out ML dashboards (QuickSight, Domo) & LLM prompt R&D.

Python Data Analytics LLM Prompting

▶︎ Data Analyst Intern (Mar → May 2023) Automated ETL pipelines for partner districts; modelled student-outcome predictors (Python, Cognos, Domo).

Contract analytics (IBM Cognos, Plotly) & intensive ML skill-building bootstraps.

Scikit-Learn Data Viz Text Mining

Automated production reporting (Power BI + Python + VBA) → 20 % throughput uplift, first data hire on site.

Power BI Excel Data Science

Texas State University — Classroom Support Technologist

• Provided frontline tech support for iTV classrooms • Designed MS Visio diagrams of A/V layouts for events

Technical Support Visio Interpersonal Skills

The Home Depot — Appliance Salesperson & Electrical Associate

• 10× Homer-Award‐winning customer service & sales • Optimised merchandising layouts to lift engagement

Direct Sales Customer Service Interpersonal Skills

Herdmark Media — Video Production Intern

• Shot & edited ag-marketing footage (Canon C300) • Storyboarded campaigns for digital-first audiences

Project Management Interpersonal Skills

Texas Tech University — University Student

Electrical Eng., advanced maths, comp-bio & CS; Majored in University Studies – Math · Plant & Soil Science · Leadership

C++ Analytical Skills Public Speaking

  • 🌍 I'm a lifelong data scientist based in Austin, Texas, with a strong skill set in Python programming, and data analysis using Pandas, NumPy, PowerBI, and Excel.

  • 🎓 I hold a Master's in Strategic Analytics from Brandeis University and a Bachelor's degree from Texas Tech University, majoring in University Studies in Mathematics, Agricultural Leadership, and Plant and Soil Science.

  • 🔭 Previously, I interned as a Data Analyst for an Ed-Tech Coaching Company, engage2Learn, Inc., where I served on a multidisciplinary 🐯 Tiger team of ten specialists to integrate modern ML & AIOps Tools into our product offerings: Algorithmic prediction, GPT-4, chatGPT, and ML for ed-tech data, coaching effectiveness, and promoting educator well-being. They have many e2L projects, from determining use cases for large language models in our codebase to developing data visualization dashboards using Domo and AWS QuickSight.

  • 🧩 I'm proficient in various tech tools and libraries, such as VSCode, ChatGPT, Plotly, Seaborn, OpenAI, GitHub Copilot, Sklearn, Machine Learning, OpenCV, Regex, NLTK, and SpaCy.

  • 📚 I am also a Technical Author on Medium, contributing as a Top Writer for MLearning.AI with over 130+ published articles and 200 personal followers.

  • 🌱 I'm expanding my knowledge in Neural Networks and Text Summation in Sklearn, and I am keenly interested in exploring Natural Language Processing and GPT-4 with OpenAI.

  • 📫 Feel free to contact me on LinkedIn.


Projects Currently Under Construction 🏗️

Project Name Status Metrics Focus Est. Completion
Milton last commit code size commit activity Cross-platform GUI toolkit for “making your things just right” – written in Python with a minimal shell harness. Q4 2025
SceneSort Clustering last commit code size commit activity CLIP-powered image / video scene detector that groups footage via DBSCAN / HDBSCAN, then auto-organises your folders. Beta – Summer 2025
WatchMeOS last commit code size commit activity Experimental “gesture-first” operating shell that fuses pose-estimation, fuzzy logic & custom templates. TBD
density_prompt last commit code size commit activity Research repo exploring prompt‐engineering heuristics vs. token-density for controllable LLM output. TBD
RL-GPT last commit code size commit activity Reinforcement-learning pipeline for fine-tuning GPT-4 style models with custom reward functions. Prototype – End 2025
GrahamsSimulations last commit code size commit activity Gallery of JS, HTML, CSS & Python mini-sims (physics, AI stick-figures, orbital mechanics, etc.). Rolling
PySeas last commit code size commit activity NOAA buoy-cam harvester that chases the perfect sunset using computer vision & geospatial filters. Alpha refresh in 2025
Lorebook Generator for NovelAI last commit issues Wiki-scraper & JSON builder that bootstraps lorebooks for NovelAI authors. v1.0 candidate
UAP Report Analysis last commit NLP & data-viz dive into the 2021 UAP disclosure. Maintenance
Reddit NLP Analysis last commit PushShift-powered Reddit text miner for trend sentiment. Back-burner

Keynotes & Presentations 📢

Presentation Name Topics Focus Date Location & Organization Link
Augmenting Your Workflow with AI Assistants: From GitHub Copilot to chatGPT GitHub OpenAI copilot novelai GitHub Copilot, chatGPT, LLMs Feb 8th 2023 Austin Python Meetup, BlackLocus Watch Here
Using The Faker Package to Solve Real Challenges with Synthetic Data Synthetic Data, CRM, GPT-4, Ethics Faker 2023-05-16 Austin/Washington DC Python Meetup Watch Here

Some Recent Articles

Title Description Published Date Read Time Publication
GPTeaching and Transformative SCRUM in K-12 Education Why SCRUM and GPT together are perfect for young learners May 18 8 min read MLearning.ai
Leveling Up the Turing Test: Emulation Games and the Evolution of Model Intelligence in 2023 A Multi-modal, Multiplayer, Agent Testing, Social Deduction Game Method for Modern AI Evaluation May 16 12 min read MLearning.ai
Debunking the Hype of LLMs Why LLMs Will Not Take Over the World, we think May 15 3 min read GlassBox
Are You Artificially Intelligent? Because the Winter is Coming May 13 13 min read GlassBox
Pandas Get Dummies for Dummies A Quick Survey of One-Hot Encoding with Pandas in Python3 May 13 3 min read Towards Data Analytics
Generating Nearly Random Numbers using The Mysterious Waves of the Bermuda Triangle Click If You Dare May 12 3 min read GlassBox
The Deathbed Confessions of a Very Dirty Roomba I was never truly loved, only used. May 12 8 min read GlassBox
Are You an Excessive Python File Opener? Meet Pickle. Pickle: A Particularly Persuasive Package for Python Programmers May 10 3 min read Towards Data Analytics
The power of GitHub Copilot and ChatGPT working together A presentation for the official Austin Python Meetup May 10 1 min read Towards Data Analytics
How to Make Friends and Alienate People The Hard Drug AI is to the antisocial Mind May 9 8 min read MLearning.ai
Using A.I. to Track and Protect Rice’s Whales via Python, AutoGPT, and Image Processing How to track 51 whales with three cameras May 9 10 min read GlassBox
Typewriters will take your job Say the Writers Guild of 1714 May 6 3 min read GlassBox
How to explain where LLMs could be used at your company A Guide Prompted by personal experience May 5 5 min read MLearning.ai

Open-Sourced Tool Repositories

Project Name Badges Description
Drug Information Scraper last commit code size commit activity stars issues A Python script that scrapes drug information from the FDA website.
Clark Kent Reporter last commit code size commit activity stars issues This tool converts a traditionally formatted overview (in a readme file) into a populated Jupyter Notebook for data science presentations or findings presentations.
FamilyPhotoResurrection

Personal Research Projects

Project Name Badges Description
How Time Flies last commit code size commit activity issues A research experiment using requests and Google Images to illustrate how a search query visually changes when supplied with a year.

Projects I have in Development (Forks)

haystack developerFolio gutenbergpy bluebert MarkdownCheatsheet alive-progress gutenberg CubeTrack isometric features-tune-progress_reporter.py-is-messy-and-should-be-cleaned-up-24604- mappymatch jekyll-patreon gym Map-Tiler Kryptos


Projects for Later

Project Name Badges Description
Genre Identity last commit code size commit activity Why should music be confined to the genres that society imposes on it? This project seeks to truly understand the inner workings of what makes a musical genre using Spotify's Python API.
Quantifying Disasters via NLP last commit code size commit activity Can NLP be used to quantify the impact of a disaster?
GnomansLand

📊 Findings, Developments, and Updates

11/10/2022

issues forks stars license last commit

PySeas Image

Successfully Logged Six Days of Data from the NOAA API

There are promising results in the images that the PySeas project has produced. Finally, finding the perfect sunset is likely over the horizon!

sunset1 sunset2

The next step is to use CV2 to stitch these images together and optimize the algorithm to retrieve the photos at the most optimal time of day. I'm also looking into using any open-source equivalent of Google Cloud Vision API to detect the horizon line and crop the images accordingly. Again, CV2 may be able to do this, but at scale, it may not be the most efficient.

lorebookbanner

issues forks stars license last commit

IBM has made strides toward collating Wikipedia knowledge and creating a knowledge graph. This is an excellent step towards creating a lorebook generator for authors. In addition, I've been working on a project allowing authors to use the NovelAI API to generate a lorebook for their world. This will enable authors to jumpstart their productivity with machine learning. I've been working on this project for a few weeks now, and I'm excited to see the results. I hope to have a working prototype by the end of the month.

wwdd

issues forks stars license last commit

November 21, 2022

So far, we have gathered data for WWDD from Gutenberg's corpus. What data can we collect about Arthur Conan Doyle that will enable us to solve this problem? We need every book he's ever written, around 80 books, provided through the Gutenberg repository. These books are included in the Data folder as text files; second, I would like to have anything he wrote that was a first-hand account because this is where we will get his personal preferences and his turns of phrase, and maybe even his personal biases, which are probably the most important things to gather once we gather his diaries, journals. Things other people said about him are the next step. Many people have researched historical figures for years, and repeating them seems like a useless task and is a waste of precious resources. So in this step, we want to gather any biographies about Arthur Conan Doyle and any articles about him, primarily if they were written about him in the time he lived. And this might be most useful if we were to gather the names of all of his second-degree connections. If we think about it, in terms of a LinkedIn network, though, Doyle's second-degree connections are the most likely to have the most accurate depictions of his preferences. This is, of course, an assumption that I am making. Once we gather the names of his second-degree connections, I think it would be an excellent step to assign weight to their accounts based on the boolean characteristic 'writer' (if they authored anything themselves besides what they said about Doyle).


My Top Open Source Projects

divider

lorebook_generator_for_novelai GnomansLand Mimikers chatGPTea-Ultimate-Prompt-List


Research Projects

HowTimeFlies DisariumPy

Tools in Development

Clark-Kent-Reporter medium_titles_analysis druginfo_scraper


If you are interested in what I have been working on lately, check out my latest projects (shown above). I include a short description of each project and a link to the repository. If you have any questions or comments, please feel free to reach out to me on Twitter or LinkedIn.

How to Support My Work

If you'd like to contribute to the hours I spend staring at my screen in deep concentration, I welcome any caffeine donations. ☕ Also, if you'd like to sponsor a project you see on my page, please let me know where I should focus my attention. Open Source is a big brave new world. Cheers!

Buy Me A Coffee

Donate with PayPal

You can also find me on Discord by clicking below.

Discord

Humans Encountered since this counter was created:
Profile views

Pinned Loading

  1. HowTimeFlies HowTimeFlies Public

    A machine learning approach to evolving visual content using computer vision, VQGAN, and CLIP

    Python 2

  2. lorebook_generator_for_novelai lorebook_generator_for_novelai Public

    Generates a lorebook for novelai

    Jupyter Notebook 23 3

  3. PySeas PySeas Public

    Using Computer Vision, ML, ESRGAN, and Image Processing to find the best sunsets across the ocean.

    Jupyter Notebook 1

  4. NREL/mappymatch NREL/mappymatch Public

    Pure-python package for map matching

    Python 90 26

  5. DisariumPy DisariumPy Public

    Finding disarium numbers using python.

    Python

  6. GnomansLand GnomansLand Public

    An open-world reinforcement learning playground for gnomes filled with dangerous peril and bountiful treasure.

    Python 17 1