Engaging the Open Energy Modeling & Data community

zaneselvans · November 17, 2023, 8:33pm

Hi y’all,

I’m part of a little employee-owned data & software engineering org called Catalyst Cooperative. We mostly work with US energy system data and support researchers and policy NGOs trying to accelerate the transition away from fossil fuels.

We’re planning to do a bunch of community outreach in 2024 (hopefully with support from the NSF POSE grant program) to get more people familiar with the open data that we publish, and to help the community get more familiar with best practices in reproducible data processing, and the kinds of software tooling that makes it easy to work with tables that contain millions to billions of rows.

All of our own tooling is written in Python, but we’re moving to a model of just distributing tabular data as SQLite databases or Parquet files so folks can use R, or DuckDB or whatever other tooling their most familiar with, and not have to worry about installing or running the huge pile of dependencies we need to produce the data.

We’re interested in developing a series of example notebooks or other tutorial materials that can help students (really anyone from undergrads to post-docs) working in energy systems get up to speed with doing data analysis in Python, while working with relevant data that they’ll hopefully find interesting and useful in their research.

Right now we have nightly data builds that deploy their outputs to a Datasette instance and to the AWS Open Data Registry, and from there to a Kaggle dataset and a simple example notebook.

This feels like it might be adjacent to the work that PyOpenSci is doing, and we’re wondering how we might learn from / participate in the community development work you’re doing, to get the open energy modeling & data community better organized and more familiar with reproducible open science standards and working as a more coherent open source ecosystem.

Would this make sense as a potential organizational collaboration? We’re also reaching out to The Carpentries and were thinking about talking to the US RSE as well.

lwasser · November 20, 2023, 5:10pm

hey there @zaneselvans
welcome to pyOpenSci!

i’m happy to chat with you more about this. I’d like to better understand what you are doing so i can identify where the points of partnership might lie. can you email me? leah at pyopensci.org and we can find a time to chat? december might be best if you are around then.

leah

zaneselvans · November 20, 2023, 11:31pm

Okay great! I sent an email.

Topic		Replies	Views
AGU 2020 - PYOPENSCI town hall - abstracts due THURSDAY april 23 pyOpenSci Community Chat	1	505	April 21, 2020
pyOpenSci updates! pyOpenSci Community Chat	0	200	January 25, 2023
Welcome rdata to the pyOpenSci ecosystem! pyOpenSci Packages pyos-accepted	2	93	March 4, 2024
Introduce Yourself Here! pyOpenSci Community Chat	0	319	May 24, 2019
Welcome great-tables to the pyOpenSci ecosystem! pyOpenSci Packages	0	29	January 3, 2025

Engaging the Open Energy Modeling & Data community

Related topics