Virtual Screening


VDSers – you are encouraged to continue screening the rest of the libraries in the DatabaseVDS directory.

Ideally, you should have something running all the time instead of waiting to analyze each one (but don’t run more than one library job at once)

Cluster Cleaning on DDFE


VDSers,  since some jobs seem to be acting weird and not running fully on the blades – I would suggest doing the Cluster Cleaning protocol that is in the DatabasesVDS folder for EACH blade that you want to run on. You would do this before runnning your job.

Check with a mentor or Dr. B before you do this the first time though.

 

 

How to Clean a Cluster/Rid it of Gold Jobs

By: Adam Nguyen, Clifford Ho, and Kathryn Pendleton

Someone please publish this paper anybody!

This paper is now copyrighted, if you plagiarize from it, we will find you. And kill you.

****************Commands listed to be entered with description of their functions in parentheses********************

1) login to SSH Secure Shell (Host: ddfe.cm.utexas.edu)

2) ganglia cpu_user (should only be running jobs on blades 9-16)

3) ps -u username (ex. ps -u plkrs777)
(This should show you if you have any processes running)

4) ssh compute-0-X (where X=blade number i.e. 9-16, do not input a zero before single digit clusters!!! Enter the       number for computer node that your job is found on, this command will take you to the node )

5) ps -u username (ex. ps -u plkrs777, to see if there are any processes that can be killed from the specific    computer node)

6) kill -9 ######  (To kill a process, use this command, where ###### is the PID number on the left hand side for     the specific process you are trying to get rid of)

7) ps -u username (ex. ps -u plkrs777, this time, to verify that the process is no longer in the list of the       specific computer node)

8) cd /  (This changes your directory to the root directory)

9) cd tmp (This takes you to the temporary directory)

10) ls -l (This displays the long names for each file in the temporary directory, determine which pmvd file is         yours, ie. has your user id)

11) rm pvmd.#### (This command removes the file with the filename specified, ie in this ex it is pmvd.####  WARNING      ONLY DELETE THE pmvd file THAT IS UNDER YOURE USERNAME, DO NOT DELETE SOMEONE ELSES. OR WE WILL FIND YOU AGAIN.      AND KILL YOU AGAIN)

12) rm pvml.#### (This removes the license file)

13) ls -l (To verify the deletion of the file from the temporary directory)

14) exit (to leave the computer node)

15) Repeat steps 1-13 for other nodes requiring cleaning

Suggested order for Virtual Screening Libraries


Here is a list of the libraries we have in the Databases VDS folder as sorted by size.

I would recommend starting with the smaller ones. Only do ones that are in 3d format.

Don’t do the .inx ones. Those are for ICM not GOLD.

This is the size in disk space – not in ligands. So, you will still need to do the LigandCountingInstructions.txt to figure out how many are in there so that you can save 10%.

SIZE                 Name

79091              ION_Channel.inx

937397            cb_306_3d.sdf

1861731          HF9PlatesPlates5_9.sdf

1918122          NIH_ClinCol3Ded.sdf

2197560          MicroFormats.inx

2566387          zincSigmaLibrary3D.sdf

3417571          LOPAC_2d.sdf

3615248          conformersLOPAC_3d.sdf

6314046          LOPAC_3d.sdf

7368555          Fragment-set_3D.sdf

8720108          FragmentLibrary.sdf

17157134        ION_Channel.sdf

18465062        CB-kin_UT.sdf  (this is already 3D)

29809172        ION_Channel3D.sdf

36068862        KINASet.sdf

61269537        KINASet3d.sdf

64466934        HF9_180_Plates_1um3D_catnum.sdf

71529407        MW-set_3D.sdf

76018176        MolecularWeightSet.sdf

123092278      NovaCorePharmaCophore_2.sdf

123632902      NovaCorePharmaCophore_1.sdf

123989001      DIVERSet.sdf

127822631      NovaCorePharmaCophore_4.sdf

134109995      NovaCorePharmaCophore_3.sdf

139679653      ChemBridge-diversity3D.sdf

139679653      ChemBridge-diversity.sdf

153754871      CNS-Set.sdf

181460186      MayBridge50k_3D.sdf

271201171      CNS-Set3d.sdf

499025027      MicroFormats.sdf

508657806      NOVACore-PHARMACophore_Combined.sdf

900813568      MicroFormats3D.sdf

SIZE Name
79091 ION_Channel.inx
937397 cb_306_3d.sdf
1861731 HF9PlatesPlates5_9.sdf
1918122 NIH_ClinCol3Ded.sdf
2197560 MicroFormats.inx
2566387 zincSigmaLibrary.sdf
3417571 LOPAC_2d.sdf
3615248 conformersLOPAC_3d.sdf
6314046 LOPAC_3d.sdf
7368555 Fragment-set_3D.sdf
8720108 FragmentLibrary.sdf
17157134 ION_Channel.sdf
18465062 CB-kin_UT.sdf
29809172 ION_Channel3D.sdf
36068862 KINASet.sdf
61269537 KINASet3d.sdf
64466934 HF9_180_Plates_1um3D_catnum.sdf
71529407 MW-set_3D.sdf
76018176 MolecularWeightSet.sdf
123092278 NovaCorePharmaCophore_2.sdf
123632902 NovaCorePharmaCophore_1.sdf
123989001 DIVERSet.sdf
127822631 NovaCorePharmaCophore_4.sdf
134109995 NovaCorePharmaCophore_3.sdf
139679653 ChemBridge-diversity3D.sdf
139679653 ChemBridge-diversity.sdf
153754871 CNS-Set.sdf
181460186 MayBridge50k.sdf
271201171 CNS-Set3d.sdf
499025027 MicroFormats.sdf
508657806 NOVACore-PHARMACophore_Combined.sdf
900813568 MicroFormats3D.sdf