Submission information
Submission Number: 129
Submission ID: 226
Submission UUID: 555a8c58-506b-47fa-8113-46e20a781360
Submission URI: /form/project
Created: Tue, 11/23/2021 - 15:06
Completed: Tue, 11/23/2021 - 15:06
Changed: Mon, 06/03/2024 - 13:02
Remote IP address: 128.118.7.103
Submitted by: Rob Mathers
Language: English
Is draft: No
Webform: Project
Project Title | Calculation of Polymer Hydrophobicity |
---|---|
Program | CAREERS |
Project Image | |
Tags | cleanup (368), git (457), github (490), optimization (509), python (69) |
Status | Complete |
Project Leader | Rob Mathers |
rtm11@psu.edu | |
Mobile Phone | 412-779-7738 |
Work Phone | |
Mentor(s) | Thomas Langford |
Student-facilitator(s) | Sander Cohen-Janes |
Mentee(s) | |
Project Description | After writing python code to calculate physical properties of polymer molecules in 2021, we are interested in cleaning up the code, addressing some calculation issues, and putting the code on GitHub. The code is written using an open-source cheminformatics package called RDKit. Prior to using RDKit, we had been using commercial software (Materials Studio, Chem3D) from 2014 to 2019. The physical property of interest relates to hydrophobicity or the oil-like characteristics of polymers. Our method is inspired by the medicinal chemistry approach to describe drug-like molecules using partition coefficients. These coefficients, which are often referred to as LogP values, can be positive or negative. Positive LogP values indicate oil solubility while negative LogP values suggest water soluble molecules. Since the 1980s, the pharmaceutical industry has spawned many computational methods to calculate LogP. Our method constructs SMILES strings for a short segment of a polymer. These SMILES strings represent 3D chemical structures using ACSII symbols. Then, we use RDKit to convert the SMILES string to a 3D molecule, optimize the conformation, and calculate the surface area (SA). Afterwards, we calculate LogP. The resulting ratio of LogP/SA has provided predictive capability in a number of collaborative projects. Since 2015, we have published 18 journal articles that use LogP and LogP/SA values. |
Project Deliverables | The goals of the project include the following: 1) Clean up code 2) Optimize calculation method a. Reduce time needed for jobs b. Automatically adjust to available hardware resources c. Determine how many conformations are needed 3) Data output a. Output graph or list values (csv etc) b. Provide options for selecting axes for graph (x-axis could be number of monomer units (N), 1/N, logN, N^1/3, N^1/2 etc.) 4) Put code on GitHub |
Project Deliverables | |
Student Research Computing Facilitator Profile | - Experience with Python - Experience with or interested in learning Git and using GitHub |
Mentee Research Computing Profile | |
Student Facilitator Programming Skill Level | Some hands-on experience |
Mentee Programming Skill Level | |
Project Institution | Penn State-New Kensington |
Project Address | New Kensington, Pennsylvania |
Anchor Institution | CR-Penn State |
Preferred Start Date | |
Start as soon as possible. | Yes |
Project Urgency | Already behind3Start date is flexible |
Expected Project Duration (in months) | 5 |
Launch Presentation | |
Launch Presentation Date | 07/20/2022 |
Wrap Presentation | |
Wrap Presentation Date | |
Project Milestones |
|
Github Contributions | |
Planned Portal Contributions (if any) | |
Planned Publications (if any) | |
What will the student learn? | |
What will the mentee learn? | |
What will the Cyberteam program learn from this project? | |
HPC resources needed to complete this project? | |
Notes | |
What is the impact on the development of the principal discipline(s) of the project? | |
What is the impact on other disciplines? | |
Is there an impact physical resources that form infrastructure? | |
Is there an impact on the development of human resources for research computing? | |
Is there an impact on institutional resources that form infrastructure? | |
Is there an impact on information resources that form infrastructure? | |
Is there an impact on technology transfer? | |
Is there an impact on society beyond science and technology? | |
Lessons Learned | |
Overall results | From the Project PI: "So far, we have created The Hydrophobicity Project on GitHub to build a community of users. Sanders code is available on this site. https://github.com/TheHydrophobicityProject We have been testing and using the code since last October. A manuscript is in preparation." From the student facilitator: "This was the first project I worked on where I used version control. Now I can't live without it. This was also the first project in which I was creating a "product" for general consumption, rather than internal tools. It was useful to collaborate with people that weren't active coders on the project so I could get usability feedback off of which I could iterate. Those are two things I will keep with me regardless of the other technologies I use in my future projects." |