CV
Professional Experience
- Senior Developer Relations Engineer
Anaconda (Oct. 2025 - )
Act as a technical bridge between Anaconda’s engineering teams and developer community; advocate for and contribute to open source projects - Lead Data Scientist III
Arch Systems | U.S. Administration for Children & Families (Mar. 2025 - Sep. 2025)
Led the development of AI evaluation frameworks and large-scale document analyses for sensitive populations as part of ACF’s Data Surge Team - Data Scientist
U.S. General Services Administration (Dec. 2024 - Feb. 2025)
Maintained enterprise data pipelines and provided analytical consulting for federal acquisition analysis - Data Operations & Research Manager
National Parks Conservation Association (Mar. 2023 - Dec. 2024)
Built enterprise data warehouse; developed automations and ML models supporting a $300M fundraising campaign - Head Librarian
Harvard-Smithsonian Center for Astrophysics (Oct. 2015 - Feb. 2023)
Managed the world’s largest astrophysics library; secured over $1M in grants for open science and astronomical data preservation work along with software citation initiatives - Data and Metadata Librarian
Cornell University, Weill Cornell Medical College (May 2013 - Sep. 2015)
Co-instructed graduate-level computational health informatics courses; developed resources to support researchers and establish data management best practices
Education
- Master of Science in Data Science
CUNY Graduate School and University Center (Jan. 2018)
Thesis: Generating Domain-Specific Language Using a Long Short-Term Memory Recurrent Neural Network to Infer Relative Linguistic Complexity Between Astrophysics Sub-Disciplines - Master of Science in Library and Information Science
Syracuse University (May 2013) - Graduate Certificate of Advanced Study in Data Science
Syracuse University (May 2013) - Bachelor of Arts in English, Pre-Medicine
State University of New York at Geneseo (May 2011)
Minors: Biology, Psychology; Honors: Phi Beta Kappa
Technical Skills
Category | Technologies & Tools |
---|---|
Data Science & Machine Learning | Python, R, SQL, statistics, experimental design, large-scale data processing and pipeline development, TensorFlow, PyTorch, scikit-learn, Transformers |
Large Language Models & AI Safety | Prompt engineering, LLM governance protocols, constitutional AI implementation, red teaming, LLM evaluation frameworks (Promptfoo, DeepEval) |
Graph Technologies & Semantic Data | Neo4j, cypher, SPARQL, knowledge graphs, semantic data modeling, RDF, JSON-LD, controlled vocabularies, schema design, graph machine learning, network analytics |
Cloud Infrastructure | AWS (S3, Lambda, ECR, EC2, Redshift, Neptune, SageMaker, CloudWatch), Snowflake, enterprise security and compliance protocols |
Visualization | D3.js, Observable, Plotly, ggplot2, Tableau |
Development Tools | GitHub, GitLab, Hugging Face, Docker, Claude Code |
Awards as Principal Investigator
- Smithsonian Institution: $270,500
American Women’s History Initiative
Credit Where Credit is Due: Undoing the Ongoing Erasure of Women’s Labor in Astronomy (2022 - 2024) - Institute of Museum and Library Services: $99,372
National Leadership Award
Software Citation Implementation: Action Plan Development (2022 - 2024) - Harvard University: $9,994
Advancing Open Knowledge Program
The Figures in the Sky Initiative: Elevating Observers of the Night Sky (2021 - 2022) - Alfred P. Sloan Foundation: $390,634
Infrastructure for Open Space Missions (2019 - 2021) - National Science Foundation (Co-Principal Investigator): $215,018
Smithsonian Astrophysical Observatory Latino Initiative Internship Program (2017 - 2022) - Smithsonian Institution: $11,740
Smithsonian Women’s Committee
Conservation and Promotion of Astronomical Heritage (2017 - 2019) - American Astronomical Society: $3,947
Professional Development Through Access to Diverse Viewpoints (2017 - 2018) - Medical Library Association: $1,000
NY-NJ Chapter Research Award
Data Literacy in Biomedical Research Environments (2015)
Mentorship and Volunteering
- Break Through Tech AI Mentor, Massachusetts Institute of Technology (2025)
- Mozilla Open Leaders Expert, Mozilla Foundation (2017)
- Certified Software and Data Carpentry Instructor, The Carpentries (2016)
Professional Development and Certifications
- Neo4j Graph Data Science Certification, Neo4j (2025)
- Tableau Desktop I & II, Trailhead Academy (2025)
- Certified Scrum Product Owner, Scrum Alliance (2023)
- Certified Agile Leader, Scrum Alliance (2023)
- Library Leadership in the Digital Age, Harvard Graduate School of Education (2018)
- Management for Faculty of Arts & Sciences, Harvard University (2016)
- Scholar, Institute for Research Design in Librarianship (2014)
Professional Activities and Commitee Service
- National Parks Conservation Association - Community Engagement Working Group (2023 - 2024)
- Harvard University - Special Collections and Archives Council Leadership Group (2022 - 2023), Science Libraries Council (2016 - 2023) - Chair (2018 - 2019), Stewardship Standing Committee (2016 - 2018)
- Smithsonian Institution - Arts and Industries Building Committee (2022 - 2023), American Women’s History Initiative Curatorial Committee (2017 - 2023)
- NASA SmallSat Reliability Initiative (2020 - 2023)
- arXiv Next Generation IT Advisory Group (2018 - 2023)
- Smithsonian Astrophysical Observatory (SAO) SAO Council (2018 - 2021) - Chair (2019 - 2021)
- Society for the History of Astronomy Member (2018 - 2023)
- Software Preservation Network Member (2018 - 2023), Steering Committee (2018 - 2020)
- Harvard-Smithsonian Center for Astrophysics Scientific Computation Advisory Committee (2017 - 2023)
- FORCE11 Software Citation Implementation Working Group (2017 - 2023)
- American Astronomical Society History of Astronomy Division (2016 - 2023)
- Unified Astronomy Thesaurus Steering Committee (2015 - 2023)
- Special Library Association Physics Astronomy Math Division (2015 - 2023)
- Cornell University Medical Library Communications Committee - Co-Chair (2014 - 2015)
Publications
Whitten, K., Boegen, L., Levine, S., Osborn, W., Griffin, E., Barker, T., Bartlett, J., Bouquin, D., Smith-Zrull, L., Kern, B. (2024). Astronomy’s Photographic Glass Plates: Demonstrating Value Through Use Cases. Issues in Science and Technology Librarianship, 105. https://doi.org/10.29173/istl2759
Bouquin, D., Trisovic, A., Bertuch, O., Colón-Marrero, E. (2023). Advancing Software Citation Implementation. arXiv. https://doi.org/10.48550/arXiv.2302.07500
Carver, N., Correia, S., Frey, K., Bouquin, D., & McEachern, M. (2022). Embracing the Crowd: People-powered Research to Preserve the History of Astronomy. Bulletin of the American Astronomical Society, 54(2). https://doi.org/10.3847/25c2cfeb.3e8f7cf7
Bouquin, D., Carver, N., Frey, K., Williams, A., Damkalis, A.-P., Papadeas, P., & Tsiligiannis, V. (2022). LSTN: The Library Space Technology Network. Bulletin of the American Astronomical Society, 54(2). https://doi.org/10.3847/25c2cfeb.e5818322
Katz D.S., Chue Hong N.P., Clark T., Muench, A., Stall, S., Bouquin, D., Cannon, M., Edmunds, S., Faez, T., Feeney, P., Fenner, M., Friedman, M., Grenier, G., Harrison, M., Heber, J., Leary, A., MacCallum, C., Murray, H., Pastrana, E., Perry, K., Schuster, D., Stockhause, M., Yeston. (2020). The importance of software citation. F1000Research. https://doi.org/10.12688/f1000research.26932.2
Carver, N., Bouquin, D., & Frey, K. (2020). LSTN Handbook (1.0) Zenodo. https://doi.org/10.5281/zenodo.3776518
Bouquin, D., Chivvis, D., Henneken, E., Lockhart, K., Muench, A., Koch, J. (2020). Credit Lost: Two Decades of Software Citation in Astronomy. The Astrophysical Journal Supplement Series: ApJS, 249(8). https://doi.org/10.3847/1538-4365/ab7be6
Albert, K., Bouquin, D., Farber, A., Hoover, R. (2019). The Copyright Guide for Scientific Software. https://doi.org/10.5281/zenodo.3581326
Bouquin, D., Muench, A., Cruz, K., Chivvis, D., & Hennek, E. (2019). Citing Astronomy Software [blog]. AstroBetter. https://perma.cc/DS73-5BZE
Katz, D. S., Bouquin, D., Chue Hong, N. P., Hausman, J., Jones, C., Chivvis, D., … Zhang, Q. (2019). Software Citation Implementation Challenges [FORCE11 report]. https://arxiv.org/abs/1905.08674
Bouquin, D., Hou, S., Benzing, M., Wilson, L. (2019). Jupyter Notebooks: A Primer for Curators (v1.0). DCN Data Curation Primers. http://doi.org/10.5281/zenodo.2591580
Katz, D., Bouquin, D., Chue Hong, N. (2019). Towards Software Non-Creator-Instigated Identification (NCI) and Citation [blog]. https://perma.cc/L2ZZ-M78X
Bouquin, D., Frey, K., Henneken, E., McEachern, M., McGrath, A., Guarracino, D., Koch, J., Damon, J., Brownell, E., Zrull, L. S. (2018). Project PHaEDRA: Preserving Harvard’s Early Data and Research in Astronomy. Proceedings of the American Astronomical Society, 231, id. 143.03. https://ui.adsabs.harvard.edu/abs/2018AAS…23114303B
Bouquin, D., Ball, C., Lamprecht, A., Jones, C., Skluzacek, T. (2018). Abandon hope, we are archiving things we can’t use: considerations for documenting complex objects [blog]. Software Sustainability Institute: WSSSPE6.1. https://perma.cc/G42X-DZ5H
Bouquin, D., Katz, D., Henneken, E., Muench, A., Frey, K., Smith, A. (2018). Software Citation Implementation in Astronomy [whitepaper]. Astronomy Computing Today. https://perma.cc/2ZNC-YUNM
Gordon, I. E., Bouquin, D., Potterbusch, M. R., Erdmann, C. C., Wilzewski, J. S., and Rothman, L. S. (2016). Are your spectroscopic data being used? Journal of Molecular Spectroscopy, 327, 232-238. http://doi.org/10.1016/j.jms.2016.03.011
Richardson, J. E., Bouquin, D. R., Tmanova, L. L., Wright, D. (2015). Information and informatics literacies of first-year medical students. Journal of the Medical Library Association: JMLA, 103(4), 198–202. http://doi.org/10.3163/1536-5050.103.4.008
Bouquin, D. (2015). GitHub. Journal of the Medical Library Association: JMLA, 103(3), 166–167. http://doi.org/10.3163/1536-5050.103.3.019
Halpern, S., Eaker, C., Jackson, J., Bouquin, D. (2015). #DitchTheSurvey: Expanding Methodological Diversity in LIS Research. In the Library with the Lead Pipe. https://perma.cc/X3DT-FBSE
Selected Presentations
Bouquin, D. (2024). Metadata, Meaning, and Erasure in Astronomy. Presented as SUNY Geneseo’s 2024 Phi Beta Kappa Lecture. Geneseo, NY. https://perma.cc/QDY4-2EG5
Bouquin, D. (2021). How to Lose a Legacy. Presented virtually at the Space Telescope Science Institute’s Engineering and Technology Colloquium. https://slides.com/dbouquin/stsci-2021
Bouquin, D., Chivvis, D., Carver, N. (2019). Metasat: Open Metadata for Small Satellite Missions. Presented at the Committee on Space Research (COSPAR) Symposium: Small Satellites for Sustainable Science and Development. Herzliya, Israel. https://slides.com/dbouquin/cospar_2019
Bouquin, D. (2019). MetaSat: Metadata for Good. Presented at the 9th Annual GNU Radio Conference. Huntsville, Alabama, USA. https://perma.cc/9B6M-GDDV
Bouquin, D. (2019). Legacy: Preservation and Scientific Software. Colloquium presentation at Greenbank Observatory. Greenbank, West Virginia, USA. https://slides.com/dbouquin/greenbank_2019
Bouquin, D. (2019). Acknowledging Scientific Software to Ensure the Future and Legacy of Scientific Research. Presented at the Society for Industrial and Applied Mathematics (SIAM) Conference on Computational Science and Engineering. Spokane, Washington, USA. https://perma.cc/N3CW-K76C
Bouquin, D., Chivvis, D. (2019). Software Citation Implementation in Astronomy: An Update. Presented at the 233rd Meeting of the American Astronomical Society. Seattle, Washington, USA. https://slides.com/dbouquin/aas_233
Bouquin, D. (2018). Stochastic Gradient Boosting Machines: Core Concepts. Presented at the Chandra X-Ray Observatory Symposium: Classification in the Golden Era of X-ray Catalogs. Cambridge, Massachusetts, USA. https://slides.com/dbouquin/sgbm
Bouquin, D. (2018). Considerations for Sharing and Preserving Research Software and Data. Presented at the 15th International High-Resolution Transmission Molecular Absorption Database Conference (HITRAN). Cambridge, Massachusetts, USA. https://perma.cc/FSH9-7J2T
Bouquin, D., Frey, K., McEachern, M., Damon, J., Guarracino, D., McGrath, A., Henneken, E., Smith-Zrull, L. (2018). Project PHAEDRA: Preserving Harvard’s Early Data and Research in Astronomy. Presented at the 8th International Library and Information Services in Astronomy Conference: Astronomy Librarianship in the Era of Big Data and Open Science. Strasbourg, France. https://doi.org/10.1051/epjconf/201818607003
Bouquin, D., DeBlase, J. (2017). Beautiful Graphs Made Simpler with Jupyter. Presented at JupyterCon 2017. New York, NY. https://pyvideo.org/jupytercon-2017/beautiful-networks-and-network-analytics-made-simpler-with-jupyter-daina-bouquin-john-deblase.html
Bouquin, D. (2016). Libraries for Astronomers. Presented at dotAstronomy 8: University of Oxford. Oxford, UK. http://slides.com/dbouquin/libraries-for-astronomers
Last updated on September 8, 2025