Glenn Research Center- Refactor Data Pipelines Using Scientific Workflows
December 6, 2021- May 13, 2022
Final Goals of your project/s:
Plume ion cloud researchers simulate experiments using 3D modeling data. Typically, the simulation is fed by processed experimental data. Unfortunately, this does not allow researchers to look at multiple files as an aggregate. The primary goal of my internship was to create a process that turns multiple test runs into a single run.
Describe what you did during the internship:
Initially, I explored the processed data to discover trends. Once I had a solid understanding of the data and its limitations, I used rejection sampling to combine data from multiple test runs into a single test run.
Did you achieve your goals? What were the results?:
By the end of the semester, my script was capable of producing an aggregated set of data. This allows researchers to combine multiple files to for a plume ion cloud simulation.
Describe positive lessons learned:
– The initial way a problem is described may not be what the actual problem is.
– Sometimes, exploring an out of the box idea can pay off!
Describe negative lessons learned:
– Managing many sets of data can be very difficult and requires precise naming conventions.
Overall, I had a very positive internship experience. Thank you for helping make this possible!