Job Details  

Programmer I
Click here to apply for this job
Job ID 74483
Job Funding Source Work-Study, Non-Work-Study
Employer Information, School of
Category Professional/Administrative
Job Description

We are seeking a motivated Student Programmer to assist with data collection, cleaning, and management tasks for an active research project focused on Knowledge Graphs (KGs) of scientific activity.

The goal of this project is to model the scientific ecosystem (including datasets, research papers, authors, institutions, and funding sources) as a large-scale knowledge graph to facilitate new kinds of data-driven analysis and discovery. You will be working directly with faculty and other student researchers.

The Team

Responsibilities:

The programmer will primarily focus on tasks essential for building and maintaining the knowledge graph.

  • API Integration & Data Collection: Develop and maintain Python scripts to interact with various APIs (e.g., scholarly databases, institutional repositories) to collect structured and semi-structured scientific metadata.

  • Data Cleaning and Transformation: Implement data processing routines in Python to clean, normalize, and transform raw data into a format suitable for graph modeling.

  • Graph Database Management: Load, update, and manage data within the Neo4j graph database. This includes writing and optimizing Cypher queries for data manipulation and ensuring data integrity.

  • Documentation: Maintain clear and thorough documentation of data sources, collection scripts, and data schema.

Educational Value

Real-world Python development

Data collection and management experience

Job Requirements

Required Qualifications:

  • Proficiency in Python (including experience with libraries for data manipulation like pandas) and Jupyter Notebook.

  • Experience working with APIs for data retrieval (e.g., using requests library).

  • Familiarity with or willingness to rapidly learn Neo4j and the Cypher query language.

  • Experience with version control systems, especially Git.

  • Solid understanding of data structures, data modeling concepts, and basic database principles.

  • Reliable, detail-oriented, and able to work independently on assigned tasks.

Required application materials

Make sure you indicate in the online application form why you're interested in the job. You must also include a link to an example of your work - e.g., a github repo, an online portfolio - in either that box or on the resume you upload.

Hourly Rate $25.00/hour
Hours 5.0 to 20.0 hours per week
Time Frame Fall/Winter/Spring/Summer
Start Date ASAP
End Date Available through summer 2027
Primary Contact Libby Hemphill
Primary Contact's Email N/A
Supervisor N/A
Work Location Remote - meetings will occur in Leinweber Building
Phone N/A
Fax N/A