Submission Number: 134
Submission ID: 237
Submission UUID: 56e75a52-dd01-49dd-bb82-8616ea97d9f3
Submission URI: /form/project

Created: Thu, 01/13/2022 - 11:07
Completed: Thu, 01/13/2022 - 11:07
Changed: Wed, 07/06/2022 - 15:09

Remote IP address: 192.112.102.251
Submitted by: Gerald Kruse
Language: English

Is draft: No
Webform: Project
Project Title Configuring a high-performance cluster, with virtual machines, to simulate Hadoop multi-node system for Data Science experiences
Program CAREERS
Project Image
Tags cluster-management (495), hadoop (12), software-installation (211), unix-environment (60)
Status Halted
Project Leader Gerald Kruse
Email kruse@juniata.edu
Mobile Phone 814-644-9206
Work Phone 814-641-3595
Mentor(s)
Student-facilitator(s)
Mentee(s)
Project Description Our Data Science high-performance cluster was delivered in Jan 2020. It is a Cloudseek 1000 from PSSCLabs.
Unfortunately, Covid impacted our efforts to configure it for our Data Science courses (https://www.juniata.edu/academics/departments/data-science/curriculum.php). At Juniata, we offer a Major (our "Program of Emphasis"), a minor (our "Secondary Emphasis"), and an online graduate degree in Data Science. We've been able to get by, but with a Big Data course coming available, we need to configure this system. We would like funding for one of our students to work on this project. We have the name of a possible technical mentor, or at least someone who will need to be consulted.
It's been a challenge to get this cluster operational, and we would really appreciate any assistance.
Project Deliverables
Project Deliverables
Student Research Computing Facilitator Profile
Mentee Research Computing Profile
Student Facilitator Programming Skill Level
Mentee Programming Skill Level
Project Institution
Project Address
Anchor Institution CR-Penn State
Preferred Start Date
Start as soon as possible. No
Project Urgency Already behind3Start date is flexible
Expected Project Duration (in months)
Launch Presentation
Launch Presentation Date
Wrap Presentation
Wrap Presentation Date
Project Milestones
Github Contributions
Planned Portal Contributions (if any)
Planned Publications (if any)
What will the student learn?
What will the mentee learn?
What will the Cyberteam program learn from this project?
HPC resources needed to complete this project?
Notes
What is the impact on the development of the principal discipline(s) of the project?
What is the impact on other disciplines?
Is there an impact physical resources that form infrastructure?
Is there an impact on the development of human resources for research computing?
Is there an impact on institutional resources that form infrastructure?
Is there an impact on information resources that form infrastructure?
Is there an impact on technology transfer?
Is there an impact on society beyond science and technology?
Lessons Learned
Overall results