用POVME3.0计算蛋白质口袋和聚类

https://github.com/POVME/POVME

首先按照教程安装povme3.0,每次使用的时候,用一个单独的python27环境,source 路径/miniconda2/bin/activate

(我GPU工作站设置了alias povme,直接输入povme即到python27环境)

第一步:将同一个蛋白(不同轨迹)align好,生成pdb轨迹

gmx trjconv -s tpr 为reference structure,align的时候align到tpr结构上而不是第一帧

得到一个pdb轨迹

第二步:生成povme参数文件 *.ini

简化版(斜体为必须设置的,其他参数默认):

GridSpacing            1.0 (相邻点之间的距离,单位为埃。降低这个数字可以提高精度,但会牺牲计算时间

InclusionSphere  65.0 98.0 50.0 16.0  (添加一个由点组成的球体到包围区域,以(65.0 98.0 50.0)为中心,半径为16.0。(65.0 98.0 50.0)是提供的样品PDB轨迹文件中酶活性位点的位置(4NSS.pdb))

PDBFileName 4NSS.pdb 

DistanceCutoff              1.09 (默认值为1.09埃,因为这是氢原子的范德华半径。

ConvexHullExclusion        first   (first/max/each,详见下面解释

SeedSphere  67.0 102.0 57.0 4.0 (如果设置了该seed球,则测的即必须包含这个区域(如果有多个口袋的话);不设置,则测口袋总体积,即使有多个不相邻的口袋。不要随便用这个参数,当该设置中所含的体积里面没有口袋时,运算会报错

ContiguousPointsCriteria    3 (如果两个口袋卷至少共享这个ContiguousPointsCriteria的相邻点,则认为它们是“邻接的”,用默认即可

NumProcessors              16

OutputFilenamePrefix          ./POVME_test_run/example_  (所有POVME输出的文件都以这个前缀开头。POVME自动创建所需的目录(./POVME_test_run/在本例中))


#### The minimum input required for POVME to run is the input trajectory name and inclusion region.

第三步:看输出结果

We recommend that you visualize the results using VMD. Open the POVME output using VMD with the following command:

vmd -m POVME_test_run/example_volume_trajectory.pdb 4NSS.pdb

Under the Graphics-->Representations menu in VMD, show the 0: POVME_volume_trajectory.pdb molecule using the Drawing Method "VDW" (and consider reducing the sphere Scalevalue to something like0.3). Now press the play button in the bottom right corner of the VMD Main window to watch the pocket trajectory (For this short trajectory, it is probably better to advance manually through the frames).

第四步:PCA分析和聚类

(未完成。。)


详细参数:

# POVME 3.0 Sample Input File

# First, we need to define a point field that entirely encompasses all trajectory pockets. 

首先,我们需要定义一个完全包含所有轨迹口袋的点场

GridSpacing            1.0                                 # The distance, in Angstroms, between adjacent points. Making this number lower improves accuracy at the expense of computing time.

#相邻点之间的距离,单位为埃。降低这个数字可以提高精度,但会牺牲计算时间。

InclusionSphere  65.0 98.0 50.0 16.0                # Add a sphere of points to the pocket- encompassing region, centered on (65.0 98.0 50.0) with radius 16.0.  (65.0 98.0 50.0) is the location of the enzymatic active site in the sample PDB  trajectory file provided (4NSS.pdb).

添加一个由点组成的球体到包围区域,以(65.0 98.0 50.0)为中心,半径为16.0。(65.0 98.0 50.0)是提供的样品PDB轨迹文件中酶活性位点的位置(4NSS.pdb)。

#InclusionSphere  -100.0 -100.0 -100.0 10.0        # Add a second sphere of points. Note that this sphere is included only for demonstration purposes. The point (-100.0 -100.0 -100.0) is actually far from the 4NSS.pdb structure.

添加第二个点球体。请注意,包含此范围仅用于演示目的。这个点(-100.0 -100.0 -100.0)实际上离4NSS pdb的结构很远。

#InclusionBox    100.0 100.0 100.0 10.0 10.0 10.0    # Add a rectangular prism ("box") of  points to the pocket-encompassing region, centered on  (100.0 100.0 100.0) and spanning 10.0 Angstroms in the x, y, and z directions, respectively.    Again, this box is far from the 4NSS.pdb structure and is included only for demonstration purposes.

#添加一个矩形棱柱(“方框”)到包含口袋的区域,以(100.0 100.0 100.0)为中心,在x、y和z方向上分别跨越10.0埃。同样,这个盒子离4NSS很远,仅用于演示目的。

InclusionCylinder  65.0 98.0 50.0 1.0 1.0 1.0 16.0 10.0  # Add a cylinder of points centered at [65.0 98.0 50.0], with its length along the axis    [1.0 1.0 1.0], a radius of 16.0 angstroms, and a height of 10.0 angstroms.

#添加一个以[65.0 98.0 50.0]为中心的点圆柱体,其长度沿轴[1.0 1.0 1.0],半径16.0埃,高度10.0埃。

#ExclusionSphere  -100.0 -100.0 -100.0 10.0          # Remove all points from the pocket- encompassing region that fall within a sphere centered at  (-100.0 -100.0 -100.0) with radius 10.0.

#从环绕口袋的区域中移除所有位于半径为10.0圆心(-100.0 -100.0 -100.0)的球体内的点。

#ExclusionBox    100.0 100.0 100.0 10.0 10.0 10.0    # Remove all points from the pocket-encompassing region that fall within a box, centered at   (100.0 100.0 100.0) and spanning 10.0 Angstroms in the x, y, and z directions, respectively.

#从包含口袋的区域中移除位于方框内的所有点,在x、y和z方向上分别以(100.0 100.0 100.0)为中心和跨越10.0埃。

# Saving and loading the POVME points of the pocket-encompassing region.

#SavePoints       true                                    # You can optionally save the point field to a PDB file. As you can imagine, identifying just the right set of inclusion and exclusion spheres and boxes to encompass the binding pocket is challenging. On approach is to define an initial geometry, visualize that geometry together with the protein using a  program like VMD, and then iteratively add new inclusion and exclusion regions as required. The ability to save the points for visualization is helpful. Additionally, if your point field contains many points, generating the field may be computationally intensive. If you want to use the same field for multiple POVME runs, using a saved copy of the field rather than repeatedly generating it from scratch is more efficient. Note that POVME also saves a NPY file, which contains the same data as the PDB file but can be loaded much faster in subsequent POVME runs. The point-field PDB file is saved to   {PREFIX}point_field.pdb (see OutputFilenamePrefix below). Additionally, if you specify a  contiguous- pocket seed region (see ContiguousPocketSeedSphere and ContiguousPocketSeedBox  below), POVME will also save those points to {PREFIX}contiguous_pocket_seed_points.pdb  for  visualization.

您可以选择将POINT field 保存到PDB文件中。正如你可以想象的那样,确定正确的包含和排除球体和盒子来包含binding口袋是一项挑战。方法是定义一个初始的几何图形,使用像VMD这样的程序将该几何图形与蛋白质可视化,然后根据需要迭代地添加新的包含和排除区域。保存点以便可视化的能力很有帮助。此外,如果点场包含许多点,则生成的场可能会非常密集。如果您希望在多个POVME运行时使用相同的字段,那么使用保存的字段副本而不是从头重复生成该字段更有效。注意,POVME还保存了一个NPY文件,该文件包含与PDB文件相同的数据,但在随后的POVME运行中可以更快地加载点字段PDB文件被保存到{PREFIX}point_field.pdb(参见下面的OutputFilenamePrefix)。另外,如果您指定一个连续的口袋种子区域(参见下面的 ContiguousPocketSeedSphere和ContiguousPocketSeedBox ),POVME也会将这些点保存为{PREFIX}contiguous_pocket_seed_points.pdb来可视化。

#LoadPointsFilename    points.pdb.npy                      # You can optionally load previous point fields if you don't want to generate them using the comamnds above. Note that you should use the .pdb.npy file, not the pdb file.

 加载之前生成的point field 文件

# Load the PDB trajectory file

PDBFileName                4NSS.pdb                      # Load in the PDB trajectory file with the pocket you want to analyze.Tell POVME how to identify points that are within the binding pocket.

DistanceCutoff              1.09                   # Any point that comes within this distance of any receptor atom's van der Waals surface will not be considered part of the pocket volume. 1.09 Angstroms was chosen as the default value because that is the van der Waals radius of a hydrogen atom.

任何靠近任何受体原子的范德华表面这个距离内的点都不会被认为是口袋体积的一部分。默认值为1.09埃,因为这是氢原子的范德华半径。

ConvexHullExclusion        first                          # The convex hull is a method of determining where a pocket "ends" on the outside of the protein.  Portions of the inclusion region which lie outside of the convex hull are removed. In previous versions, this would be recalculated for each individual frame, however this led to significant numerical noise as the motion of the protein in different frames could radically redefine the convex hull. A consistent convex hull can now be used in all frames by giving the "first" or "max" keywords here. "first" applied the convex hull from  the first frame to all others."max" draws a convex hull around all the frames superimposed simultaneously (and may take a while on large trajectories). To reproduce previous behavior, the "each" keyword may be used. Any other keyword will not use the convex hull exclusion method.

可写参数:first max each none

凸壳是一种确定pocket在蛋白质外部“末端”位置的方法。所述凸包外部的部分包含区域被移除。在以前的版本中,对于每个单独的帧,这将被重新计算,但是这导致了显著的数值噪声,因为蛋白质在不同帧中的运动可能会从根本上重新定义凸包。一个一致的凸壳现在可以使用在所有帧通过给出“first”或“max”关键字在这里。“first”将第一帧中的凸包应用到所有其他帧中。“max”在同时叠加的所有帧周围画一个凸壳(在大轨迹上可能需要一段时间)。要重现以前的行为,可以使用“each”关键字。任何其他关键字都不会使用凸包排除方法。

SeedSphere  67.0 102.0 57.0 4.0                   # It's possible your pocket-encompassing point field defined above might include more than one pocket in at least some of the frames of your trajectory. You can instruct POVME to remove any points  that are not contiguous with a user-defined "contiguous pocket seed region." This region, which is typically just a small sphere placed in the center of your primary pocket of interest, tells POVME which pocket to measure. If no such regions are specified, POVME will calculate the total volume accross all pockets covered by your pocket-encompassing point field, even if they are not contiguous.

如果设置了该seed球,则测的即必须包含这个区域(如果有多个口袋的话);不设置,则测口袋总体积,即使有多个不相邻的口袋。

在你的轨迹的至少一些框架中,上面定义的包含口袋的点场可能包含多个口袋。您可以指示POVME删除不与用户定义的“contiguous pocket seed region”相邻的任何points。这个区域,通常是放在你感兴趣的主要口袋中心的一个小球体,告诉POVME要测量哪个口袋(如果有多个口袋的话)。如果没有指定这样的区域,POVME将计算包含口袋的点场所覆盖的所有口袋的总体积,即使它们不是相邻的。

ContiguousPointsCriteria    3                     # Two pocket volumes are considered "contiguous" if they share at least this number neighboring points in common. Note that points that are "kitty-corner" from each other count as neighbors.

如果两个口袋卷至少共享这个ContiguousPointsCriteria的相邻点,则认为它们是“邻接的”。注意,点是“kitty-corner”从对方算作相邻。

# Tell POVME how to perform the calculations.

NumProcessors              1                              # POVME can use multiple processors on Unix-based systems.

# Tell POVME how to save the output

OutputFilenamePrefix      ./POVME_test_run/example_  # All the files POVME outputs will start with this prefix. POVME automatically creates any   required directory (./POVME_test_run/ in this case).

所有POVME输出的文件都以这个前缀开头。POVME自动创建所需的目录(./POVME_test_run/在本例中)。

CompressOutput                false          # If you're short on disk space, POVME can automatically compress all output files using gz compression.



安装POVME3.0

wget https://repo.continuum.io/miniconda/Miniconda2-latest-Linux-x86_64.sh

bash Miniconda2-latest-Linux-x86_64.sh -b -p miniconda2

source miniconda2/bin/activate

pip install povme



https://github.com/POVME/POVME

Basic example

cd basic_example

POVME3.py sample_input.ini

This example shows the "classic" operation of POVME, using a geometrically-defined inclusion sphere. If you open the "sample_input.ini" text file, you will find the operating parameters. The minimum input required for POVME to run is the input trajectory name and inclusion region.

Once this runs, you will have an output directory named POVME_test_run.


Ligand-defined inclusion region example

cd ligand_example/

POVME3.py sample_POVME_input.ini

To visualize:

vmd -m POVME_test_run/POVME_volume_trajectory.pdb 1BYQ_every250.pdb

POVME 3.0 now allows users to define the inclusion region of a pocket using a ligand residue name. The pocket will then be defined in all grid points within 3 Angstroms of the ligand atoms in the loaded PDB trajectory. Note that this residue name must match the one given in the input PDB trajectory.

Clustering and PCA example

cd analysis_workflow_example/

source runWorkflow.sh

The bulk of the new capabilities of POVME 3.0 are in separate scripts. Three of these are showcased in the analysis workflow example.

This example runs POVME on 5 trajectories taken from the POVME 3.0 paper's HSP90 simulations. Each of these trajectory PDB files has 5 frames, and has had the ligand removed. After running POVME on these trajectories, three post-processing scripts are run:


binding_site_overlap.py calculates the similarity of all of the analyzed frames

cluster.py processes the binding_site_overlap matrix and performs hierarchical clustering.

1. This example is programmed to yield five clusters, as specified in the "-n" argument to cluster.py // 这个示例被编程为生成5个集群,这在cluster.py的“-n”参数中指定

2. A heatmap showing which frames belong to which cluster is displayed when cluster.py finishes running. //py结束运行时将显示一个热图,显示哪个帧属于哪个集群。

3. Combined and individual-simulation transition maps are displayed. //将显示组合和单独模拟的转换映射。

4. The most representative frames from each cluster are output in the 3-post_analysis/ALL/cluster#subdirectories. //每个集群中最具代表性的帧输出在3-post_analysis/ALL/cluster#子目录中。

5. The average pocket shape of each cluster can be visualized in vmd by running vmd -e visualizeAll.vmd in the 3-post_analysis/ALL subdirectory, and showing the second representation in each loaded object .//通过在3-post_analysis/ALL子目录运行vmd -e visualizeAll.vmd,可以在vmd中显示每个簇的平均 pocket shape,并在每个加载的对象中显示 second representation。

6. Text files of the cluster members and representatives are written, with each line corresponding to one cluster //text文件中写cluter成员和代表的结构,每一行对应一个集群

pocketPointsPca.py runs principal component analysis of the pocket points in the frames

1. Scatterplots of each simulations position in PC space are shown//给出了计算机空间中各模拟位置的散点图

2. A plot of the explained variance for each PC is shown//每个PC的方差解释图显示

3. The first 10 principal components can be visualized by running vmd -e loadAllPcs.vmd //显示前10个主成分





运行流程:

(base) $ source runWorkflow.sh ---------------------------------

START

---------------------------------

PDBFILENAME ../1-trajectories/4R3M_every50ns_aligned.pdb

GRIDSPACING 1.0

INCLUSIONSPHERE 38.2 -47.5 63.7 15

CONTIGUOUSPOINTSCRITERIA 3

DISTANCECUTOFF 1.09

CONVEXHULLEXCLUSION none

OUTPUTFILENAMEPREFIX ./4R3M_every50ns/4R3M_every50ns_

COMPRESSOUTPUT false

NUMPROCESSORS 1

If you use POVME in your research, please cite the following reference:  Durrant, J. D., C. A. de Oliveira, et al. (2011). "POVME: An algorithm for measuring binding-pocket volumes." J Mol Graph Model 29(5): 773-776.

Parameters:

OUTPUTFILENAMEPREFIX: ./4R3M_every50ns/4R3M_every50ns_

POINTSINCLUDEREGIONS: sphere at (38.2, -47.5, 63.7), radius = 15.0

NUMFRAMES: -1

OutputFrameFilenamePrefix: ./4R3M_every50ns/4R3M_every50ns_frameInfo/4R3M_every50ns_

MAXGROWITERATIONS: 10000000000.0

GRIDSPACING: 1.0

NUMPROCESSORS: 1

DISTANCECUTOFF: 1.09

COMPRESSOUTPUT: False

PDBFILENAME: ../1-trajectories/4R3M_every50ns_aligned.pdb

CONVEXHULLEXCLUSION: none

SAVEREGIONS: False

CONTIGUOUSPOINTSCRITERIA: 3

OUTPUTBASENAME: 4R3M_every50ns_

---------------------------------

PARAMETERS DEFINED

---------------------------------

---------------------------------

ABOUT TO LOAD RECEPTORS

---------------------------------

Reading frames from ../1-trajectories/4R3M_every50ns_aligned.pdb

Further processing frame 1

Further processing frame 2

Further processing frame 3

Further processing frame 4

Further processing frame 5

---------------------------------

RECEPTORS LOADED

---------------------------------

Generating the pocket-encompassing point field

Saving the point field as a PDB and NPY file

Point field saved to ./4R3M_every50ns/4R3M_every50ns_frameInfo/4R3M_every50ns_inclusion.pdb to permit visualization Point field saved to ./4R3M_every50ns/4R3M_every50ns_frameInfo/4R3M_every50ns_inclusion.npy to optionally load for the volume calculation

Calculating the pocket volume of each frame

---------------------------------

STARTING CALC VOLUME

---------------------------------

Frame 1:  Volume 2399.0 A^3  Surf. A. 1136.0 A^2

---------------------------------

FINISHING CALC VOLUME

---------------------------------

---------------------------------

STARTING CALC VOLUME

---------------------------------

Frame 2:  Volume 2038.0 A^3  Surf. A. 1172.0 A^2

---------------------------------

FINISHING CALC VOLUME

---------------------------------

---------------------------------

STARTING CALC VOLUME

---------------------------------

Frame 3:  Volume 1925.0 A^3  Surf. A. 1153.0 A^2

---------------------------------

FINISHING CALC VOLUME

---------------------------------

---------------------------------

STARTING CALC VOLUME

---------------------------------

Frame 4:  Volume 2121.0 A^3  Surf. A. 1147.0 A^2

---------------------------------

FINISHING CALC VOLUME

---------------------------------

---------------------------------

STARTING CALC VOLUME

---------------------------------

Frame 5:  Volume 2255.0 A^3  Surf. A. 1177.0 A^2

---------------------------------

FINISHING CALC VOLUME

---------------------------------

---------------------------------

VOLUMES CALCULATED

---------------------------------

FRAME        | VOLUME (A^3) | SURF. A. (A^2)

-------------+--------------+----------------

1            |  2399.0    |  1136.0

2            |  2038.0    |  1172.0

3            |  1925.0    |  1153.0

4            |  2121.0    |  1147.0

5            |  2255.0    |  1177.0

Execution time = 83.1753201485 sec

---------------------------------

ABOUT TO CALCULATE OCCUPANCY AVERAGE

---------------------------------

---------------------------------

CALCULATED OCCUPANCY AVERAGE

---------------------------------

---------------------------------

ABOUT TO CALCULATE COLOR MAPS

---------------------------------

---------------------------------

CALCULATED COLOR MAPS

---------------------------------

---------------------------------

接下来的4个轨迹略,每个轨迹5帧

['../../2-POVME_analysis/4R3M_every50ns/4R3M_every50ns_frameInfo/4R3M_every50ns_frame_1.npy'

, '../../2-POVME_analysis/1UYF_every50ns/1UYF_every50ns_frameInfo/1UYF_every50ns_frame_1.npy', '../../2-POVME_analysis/1UYL_every50ns/1UYL_every50ns_frameInfo/1UYL_every50ns_frame_1.npy', '../../2-POVME_analysis/3D0B_every50ns/3D0B_every50ns_frameInfo/3D0B_every50ns_frame_1.npy', '../../2-POVME_analysis/1UYI_every50ns/1UYI_every50ns_frameInfo/1UYI_every50ns_frame_1.npy', '../../2-POVME_analysis/4R3M_every50ns/4R3M_every50ns_frameInfo/4R3M_every50ns_frame_2.npy', '../../2-POVME_analysis/1UYF_every50ns/1UYF_every50ns_frameInfo/1UYF_every50ns_frame_2.npy', '../../2-POVME_analysis/1UYL_every50ns/1UYL_every50ns_frameInfo/1UYL_every50ns_frame_2.npy', '../../2-POVME_analysis/3D0B_every50ns/3D0B_every50ns_frameInfo/3D0B_every50ns_frame_2.npy', '../../2-POVME_analysis/1UYI_every50ns/1UYI_every50ns_frameInfo/1UYI_every50ns_frame_2.npy', '../../2-POVME_analysis/4R3M_every50ns/4R3M_every50ns_frameInfo/4R3M_every50ns_frame_3.npy', '../../2-POVME_analysis/1UYF_every50ns/1UYF_every50ns_frameInfo/1UYF_every50ns_frame_3.npy', '../../2-POVME_analysis/1UYL_every50ns/1UYL_every50ns_frameInfo/1UYL_every50ns_frame_3.npy', '../../2-POVME_analysis/3D0B_every50ns/3D0B_every50ns_frameInfo/3D0B_every50ns_frame_3.npy', '../../2-POVME_analysis/1UYI_every50ns/1UYI_every50ns_frameInfo/1UYI_every50ns_frame_3.npy', '../../2-POVME_analysis/4R3M_every50ns/4R3M_every50ns_frameInfo/4R3M_every50ns_frame_4.npy', '../../2-POVME_analysis/1UYF_every50ns/1UYF_every50ns_frameInfo/1UYF_every50ns_frame_4.npy', '../../2-POVME_analysis/1UYL_every50ns/1UYL_every50ns_frameInfo/1UYL_every50ns_frame_4.npy', '../../2-POVME_analysis/3D0B_every50ns/3D0B_every50ns_frameInfo/3D0B_every50ns_frame_4.npy', '../../2-POVME_analysis/1UYI_every50ns/1UYI_every50ns_frameInfo/1UYI_every50ns_frame_4.npy', '../../2-POVME_analysis/4R3M_every50ns/4R3M_every50ns_frameInfo/4R3M_every50ns_frame_5.npy', '../../2-POVME_analysis/1UYF_every50ns/1UYF_every50ns_frameInfo/1UYF_every50ns_frame_5.npy', '../../2-POVME_analysis/1UYL_every50ns/1UYL_every50ns_frameInfo/1UYL_every50ns_frame_5.npy', '../../2-POVME_analysis/3D0B_every50ns/3D0B_every50ns_frameInfo/3D0B_every50ns_frame_5.npy', '../../2-POVME_analysis/1UYI_every50ns/1UYI_every50ns_frameInfo/1UYI_every50ns_frame_5.npy']The number of frames found was: 25 所有的五条轨迹25帧

Starting Tanimoto calculations

Overlap Matrix for Tanimoto calculation

[[1.        0.62900448 0.64765217 0.59501348 0.66082691 0.64882943

  0.62778152 0.50920051 0.64109882 0.74176218 0.64160972 0.66573329

  0.48633702 0.67280453 0.65585088 0.60796869 0.67383146 0.50098945

  0.64149636 0.7079489  0.56489576 0.62705055 0.56037358 0.67741935

  0.64851826]

[0.62900448 1.        0.65356004 0.695747  0.74812874 0.55058573

  0.69299182 0.49346926 0.64285714 0.73632742 0.5311263  0.69069935

  0.47880381 0.69395924 0.67008124 0.56946403 0.74684001 0.47969627

  0.67925207 0.70893372 0.54429101 0.70376956 0.5378712  0.66357474

  0.66725601]

[0.64765217 0.65356004 1.        0.62482615 0.6469553  0.5600713

  0.65291262 0.56429047 0.63038793 0.69622042 0.5225    0.65073788

  0.51325948 0.70431287 0.6398892  0.5645614  0.68756531 0.51535606

  0.67670251 0.68152866 0.54075813 0.64857437 0.58823529 0.66642883

  0.67958748]

[0.59501348 0.695747  0.62482615 1.        0.70594673 0.53978873

  0.6518724  0.50803342 0.6281407  0.668637  0.50583245 0.6364587

  0.47801837 0.65541741 0.68194741 0.54454073 0.6650619  0.47211286

  0.66370107 0.67572336 0.52339861 0.6363016  0.5421123  0.63732024

  0.66021808]

[0.66082691 0.74812874 0.6469553  0.70594673 1.        0.58947368

  0.69208939 0.51531764 0.64684554 0.72531418 0.54733406 0.65937169

  0.50768717 0.67300108 0.71021292 0.59129815 0.70105263 0.49666889

  0.67178571 0.70430108 0.57721345 0.65586207 0.55398587 0.67406877

  0.66819626]

[0.64882943 0.55058573 0.5600713  0.53978873 0.58947368 1.

  0.55807464 0.47963649 0.59841629 0.60843158 0.66512605 0.58087707

  0.48221205 0.62072808 0.57082153 0.6120155  0.5830721  0.47951977

  0.6001462  0.58773181 0.56564551 0.53864569 0.50365726 0.59525547

  0.5790789 ]

[0.62778152 0.69299182 0.65291262 0.6518724  0.69208939 0.55807464

  1.        0.49734126 0.64242639 0.71203639 0.53094233 0.74272198

  0.46046364 0.66456583 0.66735751 0.54761905 0.71091854 0.46638924

  0.66167247 0.71070293 0.5357377  0.72730484 0.52411784 0.66445299

  0.71499822]

[0.50920051 0.49346926 0.56429047 0.50803342 0.51531764 0.47963649

  0.49734126 1.        0.4659164  0.49798074 0.44208754 0.4935106

  0.67952522 0.50482005 0.46926166 0.49799331 0.50386399 0.64959766

  0.49713193 0.51492063 0.54539363 0.50265045 0.67279942 0.4849636

  0.50380469]

[0.64109882 0.64285714 0.63038793 0.6281407  0.64684554 0.59841629

  0.64242639 0.4659164  1.        0.70007289 0.56882129 0.6453824

  0.46138415 0.7118003  0.66148844 0.54910394 0.6953547  0.4711051

  0.69124767 0.6898572  0.52081911 0.6340694  0.50841751 0.71223565

  0.70293353]

[0.74176218 0.73632742 0.69622042 0.668637  0.72531418 0.60843158

  0.71203639 0.49798074 0.70007289 1.        0.596      0.72039943

  0.46971139 0.77501853 0.71136924 0.58980583 0.78556034 0.48138639

  0.72433597 0.76931447 0.54114958 0.69243986 0.54143646 0.74563319

  0.73240443]

[0.64160972 0.5311263  0.5225    0.50583245 0.54733406 0.66512605

  0.53094233 0.44208754 0.56882129 0.596      1.        0.58644946

  0.421875  0.59056886 0.54448017 0.58232303 0.56996103 0.42802102

  0.57206045 0.55297065 0.49874507 0.5276907  0.46175243 0.59356287

  0.5565186 ]

[0.66573329 0.69069935 0.65073788 0.6364587  0.65937169 0.58087707

  0.74272198 0.4935106  0.6453824  0.72039943 0.58644946 1.

  0.45285439 0.6780086  0.63786575 0.57559775 0.73573466 0.4778906

  0.66430595 0.705      0.53833333 0.72548318 0.53355372 0.6665483

  0.69828203]

[0.48633702 0.47880381 0.51325948 0.47801837 0.50768717 0.48221205

  0.46046364 0.67952522 0.46138415 0.46971139 0.421875  0.45285439

  1.        0.47039581 0.45989134 0.49044807 0.46247655 0.72191235

  0.47049234 0.4763285  0.56914894 0.46643445 0.67067217 0.46011673

  0.46793426]

[0.67280453 0.69395924 0.70431287 0.65541741 0.67300108 0.62072808

  0.66456583 0.50482005 0.7118003  0.77501853 0.59056886 0.6780086

  0.47039581 1.        0.66302817 0.58502674 0.75453885 0.48441645

  0.75573956 0.72327273 0.53126045 0.66527488 0.53825593 0.75906344

  0.71335031]

[0.65585088 0.67008124 0.6398892  0.68194741 0.71021292 0.57082153

  0.66735751 0.46926166 0.66148844 0.71136924 0.54448017 0.63786575

  0.45989134 0.66302817 1.        0.59231853 0.68267581 0.4729686

  0.65780266 0.67899512 0.5376735  0.63218005 0.51506317 0.67706487

  0.67120894]

[0.60796869 0.56946403 0.5645614  0.54454073 0.59129815 0.6120155

  0.54761905 0.49799331 0.54910394 0.58980583 0.58232303 0.57559775

  0.49044807 0.58502674 0.59231853 1.        0.58384668 0.4998245

  0.55924502 0.57137907 0.65694813 0.55111713 0.50736554 0.55897795

  0.54632807]

[0.67383146 0.74684001 0.68756531 0.6650619  0.70105263 0.5830721

  0.71091854 0.50386399 0.6953547  0.78556034 0.56996103 0.73573466

  0.46247655 0.75453885 0.68267581 0.58384668 1.        0.47717185

  0.73107143 0.74689386 0.5461039  0.7037037  0.54287556 0.72058301

  0.73598297]

[0.50098945 0.47969627 0.51535606 0.47211286 0.49666889 0.47951977

  0.46638924 0.64959766 0.4711051  0.48138639 0.42802102 0.4778906

  0.72191235 0.48441645 0.4729686  0.4998245  0.47717185 1.

  0.46941446 0.49557812 0.58810811 0.49870045 0.72211426 0.4748438

  0.48074413]

[0.64149636 0.67925207 0.67670251 0.66370107 0.67178571 0.6001462

  0.66167247 0.49713193 0.69124767 0.72433597 0.57206045 0.66430595

  0.47049234 0.75573956 0.65780266 0.55924502 0.73107143 0.46941446

  1.        0.7113031  0.5105194  0.65894955 0.53710483 0.72945966

  0.71630948]

[0.7079489  0.70893372 0.68152866 0.67572336 0.70430108 0.58773181

  0.71070293 0.51492063 0.6898572  0.76931447 0.55297065 0.705

  0.4763285  0.72327273 0.67899512 0.57137907 0.74689386 0.49557812

  0.7113031  1.        0.55322688 0.6764099  0.55140496 0.72869629

  0.72192321]

[0.56489576 0.54429101 0.54075813 0.52339861 0.57721345 0.56564551

  0.5357377  0.54539363 0.52081911 0.54114958 0.49874507 0.53833333

  0.56914894 0.53126045 0.5376735  0.65694813 0.5461039  0.58810811

  0.5105194  0.55322688 1.        0.53866232 0.56885813 0.51020408

  0.52481104]

[0.62705055 0.70376956 0.64857437 0.6363016  0.65586207 0.53864569

  0.72730484 0.50265045 0.6340694  0.69243986 0.5276907  0.72548318

  0.46643445 0.66527488 0.63218005 0.55111713 0.7037037  0.49870045

  0.65894955 0.6764099  0.53866232 1.        0.55409836 0.65424431

  0.66815835]

[0.56037358 0.5378712  0.58823529 0.5421123  0.55398587 0.50365726

  0.52411784 0.67279942 0.50841751 0.54143646 0.46175243 0.53355372

  0.67067217 0.53825593 0.51506317 0.50736554 0.54287556 0.72211426

  0.53710483 0.55140496 0.56885813 0.55409836 1.        0.53528628

  0.53522054]

[0.67741935 0.66357474 0.66642883 0.63732024 0.67406877 0.59525547

  0.66445299 0.4849636  0.71223565 0.74563319 0.59356287 0.6665483

  0.46011673 0.75906344 0.67706487 0.55897795 0.72058301 0.4748438

  0.72945966 0.72869629 0.51020408 0.65424431 0.53528628 1.

  0.697733  ]

[0.64851826 0.66725601 0.67958748 0.66021808 0.66819626 0.5790789

  0.71499822 0.50380469 0.70293353 0.73240443 0.5565186  0.69828203

  0.46793426 0.71335031 0.67120894 0.54632807 0.73598297 0.48074413

  0.71630948 0.72192321 0.52481104 0.66815835 0.53522054 0.697733

  1.        ]]

Starting Tversky calculations

Overlap Matrix for Tversky calculation

[[1.        0.76115048 0.77615673 0.73614006 0.7861609  0.72780325

  0.77615673 0.66902876 0.74906211 0.86327637 0.70446019 0.79283035

  0.62317632 0.79199667 0.79199667 0.71238016 0.82325969 0.63318049

  0.77198833 0.8315965  0.70029179 0.78074198 0.70029179 0.79658191

  0.78449354]

[0.78369099 1.        0.79184549 0.82145923 0.85793991 0.66566524

  0.8360515  0.66480687 0.76094421 0.87253219 0.63347639 0.8223176

  0.62532189 0.81845494 0.81416309 0.69313305 0.88755365 0.62360515

  0.81072961 0.84463519 0.69356223 0.84935622 0.69184549 0.79828326

  0.80987124]

[0.79640719 0.78913601 1.        0.76860565 0.786142  0.67194183

  0.80538922 0.7245509  0.75064157 0.84302823 0.6257485  0.79213003

  0.65397776 0.82420873 0.79041916 0.68819504 0.84431138 0.65312233

  0.8075278  0.82378101 0.68947819 0.8075278  0.73139435 0.79897348

  0.81736527]

[0.75631692 0.81970021 0.76959315 1.        0.82869379 0.65653105

  0.80513919 0.67708779 0.74946467 0.8235546  0.61284797 0.78201285

  0.62398287 0.79014989 0.82184154 0.67280514 0.82826552 0.61627409

  0.7987152  0.82012848 0.67537473 0.7987152  0.69464668 0.77815846

  0.80385439]

[0.80563862 0.85390859 0.78513456 0.82656984 1.        0.69372063

  0.83340453 0.68261427 0.76206749 0.86287911 0.64459633 0.79794959

  0.64886801 0.80179411 0.84066638 0.70824434 0.85348142 0.6369073

  0.80350278 0.83938488 0.71849637 0.8124733  0.70354549 0.80392994

  0.80862879]

[0.85672228 0.76104024 0.77085378 0.75220805 0.79685967 1.

  0.78508342 0.69921492 0.77870461 0.83562316 0.7767419  0.79293425

  0.671737  0.81992149 0.79097154 0.7747792  0.82139352 0.66633955

  0.80569185 0.80863592 0.76104024 0.77281649 0.70951914 0.80029441

  0.79587831]

[0.76657061 0.80197612 0.77521614 0.77398106 0.8032112  0.65870729

  1.        0.65459037 0.74557431 0.83779333 0.621655  0.84026348

  0.59695348 0.78139152 0.79538905 0.66282421 0.8443804  0.59983532

  0.78180321 0.82832441 0.67270482 0.84767394 0.66652944 0.7826266

  0.82626595]

[0.68066158 0.65691264 0.71840543 0.67048346 0.67769296 0.6043257

  0.67430025 1.        0.61450382 0.6798134  0.55682782 0.66115352

  0.7769296  0.66624258 0.64418999 0.63146735 0.69126378 0.75318066

  0.66157761 0.68787108 0.69041561 0.6836302  0.79092451 0.64970314

  0.67387617]

[0.81644707 0.80554294 0.79736483 0.79509314 0.81054066 0.72103589

  0.82280781 0.65833712 1.        0.8727851  0.67969105 0.81281236

  0.62698773 0.85506588 0.83189459 0.69604725 0.87732849 0.63334848

  0.84325307 0.85597456 0.69332122 0.82189914 0.68605179 0.85688323

  0.86006361]

[0.84050325 0.82508117 0.79991883 0.78043831 0.81980519 0.6911526

  0.82589286 0.65056818 0.77962662 1.        0.66517857 0.81980519

  0.60105519 0.84862013 0.82021104 0.69034091 0.88758117 0.60876623

  0.81899351 0.86079545 0.67248377 0.81777597 0.67613636 0.83157468

  0.83198052]

[0.87792208 0.76675325 0.76      0.74337662 0.7838961  0.82233766

  0.78441558 0.68207792 0.77714286 0.85142857 1.        0.82285714

  0.63116883 0.81974026 0.79168831 0.77350649 0.83584416 0.63480519

  0.80623377 0.8025974  0.7225974  0.78701299 0.68987013 0.8238961

  0.80051948]

[0.8059322  0.81186441 0.78474576 0.77372881 0.79152542 0.68474576

  0.86483051 0.66059322 0.75805085 0.8559322  0.67118644 1.

  0.59830508 0.80211864 0.78516949 0.69364407 0.87415254 0.61822034

  0.79491525 0.83644068 0.68432203 0.85889831 0.68389831 0.79533898

  0.82669492]

[0.68894009 0.67142857 0.70460829 0.67142857 0.7        0.63087558

  0.66820276 0.84423963 0.6359447  0.68248848 0.55990783 0.65069124

  1.        0.66267281 0.66313364 0.65069124 0.68156682 0.83502304

  0.66497696 0.68156682 0.73963134 0.67880184 0.82304147 0.65391705

  0.66912442]

[0.8172043  0.82021505 0.8288172  0.79354839 0.80731183 0.71870968

  0.81634409 0.67569892 0.80946237 0.89935484 0.67870968 0.81419355

  0.61849462 1.        0.80989247 0.70580645 0.89376344 0.6283871

  0.86365591 0.85548387 0.68344086 0.82236559 0.69290323 0.86451613

  0.84344086]

[0.79232694 0.7910759  0.7706422  0.80025021 0.8206839  0.67222686

  0.80567139 0.63344454 0.76355296 0.84278565 0.63552961 0.77272727

  0.6000834  0.7852377  1.        0.70100083 0.82985822 0.60925771

  0.78398666 0.81150959 0.67848207 0.78482068 0.66305254 0.79649708

  0.80108424]

[0.805752  0.76143329 0.75860443 0.74068835 0.78170674 0.74446016

  0.75907591 0.70202735 0.7223008  0.8019802  0.70202735 0.77180575

  0.66572372 0.77369165 0.79255068 1.        0.80433758 0.67138142

  0.75436115 0.77746346 0.81801037 0.76756247 0.69825554 0.75294672

  0.75058934]

[0.78779418 0.82489031 0.78739529 0.77143997 0.79696849 0.66773036

  0.81810929 0.6501795  0.77024332 0.8723574  0.64180295 0.82289589

  0.58994815 0.82887914 0.79377742 0.68049462 1.        0.60031911

  0.81651376 0.8392501  0.67092142 0.81850818 0.67171919 0.8085361

  0.82728361]

[0.70585502 0.67518587 0.70957249 0.6686803  0.69284387 0.63104089

  0.67704461 0.82527881 0.64776952 0.69702602 0.56784387 0.67797398

  0.84200743 0.67890335 0.67890335 0.66171004 0.69934944 1.

  0.66682156 0.70306691 0.75836431 0.71328996 0.8633829  0.67100372

  0.68447955]

[0.79145299 0.80726496 0.80683761 0.79700855 0.80384615 0.7017094

  0.81153846 0.66666667 0.79316239 0.86239316 0.66324786 0.8017094

  0.61666667 0.85811966 0.8034188  0.68376068 0.87478632 0.61324786

  1.        0.84444444 0.66367521 0.81495726 0.68974359 0.84230769

  0.84273504]

[0.82642916 0.81524441 0.7978459  0.79328915 0.81400166 0.68268434

  0.83347142 0.67191384 0.78044739 0.87862469 0.64001657 0.81772991

  0.61267606 0.82394366 0.8061309  0.68309859 0.87158244 0.62676056

  0.81855841 1.        0.68889809 0.81483016 0.69096935 0.82891466

  0.83347142]

[0.74501109 0.71662971 0.71485588 0.69933481 0.745898  0.68780488

  0.72461197 0.72195122 0.6767184  0.73481153 0.61685144 0.71618625

  0.71175166 0.70465632 0.72150776 0.76940133 0.745898  0.72372506

  0.6886918  0.73747228 1.        0.73215078 0.72904656 0.68736142

  0.70820399]

[0.76107273 0.80414466 0.76716782 0.75782202 0.77285656 0.63998375

  0.83665177 0.65501829 0.73506705 0.81877286 0.61560341 0.82364892

  0.59853718 0.77691995 0.76472978 0.66151971 0.8338074  0.62373019

  0.77488826 0.79926859 0.6708655  1.        0.68671272 0.77041853

  0.78870378]

[0.73716542 0.70732778 0.75032909 0.71171566 0.72268539 0.63448881

  0.7103993  0.81834138 0.6625713  0.73102238 0.58271172 0.70820535

  0.78367705 0.70688899 0.69767442 0.64984642 0.73892058 0.81526986

  0.70820535 0.73189996 0.72136902 0.74155331 1.        0.70557262

  0.71347082]

[0.81911702 0.79725675 0.80068581 0.77882555 0.80668667 0.69909987

  0.81483069 0.65666524 0.8084012  0.87826832 0.6798114  0.80454351

  0.60822975 0.86155165 0.81868838 0.68452636 0.86883841 0.61894556

  0.84483498 0.85769396 0.66438063 0.81268753 0.68924132 1.

  0.83111873]

[0.78909853 0.79119497 0.80125786 0.7870021  0.79371069 0.68008386

  0.84150943 0.66624738 0.79371069 0.85953878 0.64612159 0.81802935

  0.60880503 0.82222222 0.80545073 0.66750524 0.86960168 0.61761006

  0.82683438 0.84360587 0.66960168 0.81383648 0.68176101 0.8129979

  1.        ]]

Given index file (-i) and prefix-to-trajectory mapping (-t or -T). Clustering will return prefix and frame numbers of cluster members, and will extract representatives./home/amax/Downloads/miniconda2/lib/python2.7/site-packages/scipy/cluster/hierarchy.py:490:

ClusterWarning: scipy.cluster: The symmetric non-negative hollow observation matrix looks suspiciously like an uncondensed distance matrix  return linkage(y, method='average', metric='euclidean')

Matrix-index-to-trajectory-frame mapping given. Writing out trajectory frames to cluster_reps.csv and cluster_members.csv.Extracting trajectory frames

Generating difference maps

[16, 4, 2, 2, 1]

5 2 3




[4]+ Stopped python ../analyzeClusterMembershipInGroups.py

Traceback (most recent call last):

  File "/home/amax/Downloads/miniconda2/bin/pocketPointsPca.py", line 9, in <module>

    from sklearn.preprocessing import StandardScaler

ImportError: No module named sklearn.preprocessing

[('1UYI_every50ns', 5), ('3D0B_every50ns', 5), ('1UYL_every50ns', 5), ('4R3M_every50ns', 5),

('1UYF_every50ns', 5)]

最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
  • 序言:七十年代末,一起剥皮案震惊了整个滨河市,随后出现的几起案子,更是在滨河造成了极大的恐慌,老刑警刘岩,带你破解...
    沈念sama阅读 202,607评论 5 476
  • 序言:滨河连续发生了三起死亡事件,死亡现场离奇诡异,居然都是意外死亡,警方通过查阅死者的电脑和手机,发现死者居然都...
    沈念sama阅读 85,047评论 2 379
  • 文/潘晓璐 我一进店门,熙熙楼的掌柜王于贵愁眉苦脸地迎上来,“玉大人,你说我怎么就摊上这事。” “怎么了?”我有些...
    开封第一讲书人阅读 149,496评论 0 335
  • 文/不坏的土叔 我叫张陵,是天一观的道长。 经常有香客问我,道长,这世上最难降的妖魔是什么? 我笑而不...
    开封第一讲书人阅读 54,405评论 1 273
  • 正文 为了忘掉前任,我火速办了婚礼,结果婚礼上,老公的妹妹穿的比我还像新娘。我一直安慰自己,他们只是感情好,可当我...
    茶点故事阅读 63,400评论 5 364
  • 文/花漫 我一把揭开白布。 她就那样静静地躺着,像睡着了一般。 火红的嫁衣衬着肌肤如雪。 梳的纹丝不乱的头发上,一...
    开封第一讲书人阅读 48,479评论 1 281
  • 那天,我揣着相机与录音,去河边找鬼。 笑死,一个胖子当着我的面吹牛,可吹牛的内容都是我干的。 我是一名探鬼主播,决...
    沈念sama阅读 37,883评论 3 395
  • 文/苍兰香墨 我猛地睁开眼,长吁一口气:“原来是场噩梦啊……” “哼!你这毒妇竟也来了?” 一声冷哼从身侧响起,我...
    开封第一讲书人阅读 36,535评论 0 256
  • 序言:老挝万荣一对情侣失踪,失踪者是张志新(化名)和其女友刘颖,没想到半个月后,有当地人在树林里发现了一具尸体,经...
    沈念sama阅读 40,743评论 1 295
  • 正文 独居荒郊野岭守林人离奇死亡,尸身上长有42处带血的脓包…… 初始之章·张勋 以下内容为张勋视角 年9月15日...
    茶点故事阅读 35,544评论 2 319
  • 正文 我和宋清朗相恋三年,在试婚纱的时候发现自己被绿了。 大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
    茶点故事阅读 37,612评论 1 329
  • 序言:一个原本活蹦乱跳的男人离奇死亡,死状恐怖,灵堂内的尸体忽然破棺而出,到底是诈尸还是另有隐情,我是刑警宁泽,带...
    沈念sama阅读 33,309评论 4 318
  • 正文 年R本政府宣布,位于F岛的核电站,受9级特大地震影响,放射性物质发生泄漏。R本人自食恶果不足惜,却给世界环境...
    茶点故事阅读 38,881评论 3 306
  • 文/蒙蒙 一、第九天 我趴在偏房一处隐蔽的房顶上张望。 院中可真热闹,春花似锦、人声如沸。这庄子的主人今日做“春日...
    开封第一讲书人阅读 29,891评论 0 19
  • 文/苍兰香墨 我抬头看了看天上的太阳。三九已至,却和暖如春,着一层夹袄步出监牢的瞬间,已是汗流浃背。 一阵脚步声响...
    开封第一讲书人阅读 31,136评论 1 259
  • 我被黑心中介骗来泰国打工, 没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留,地道东北人。 一个月前我还...
    沈念sama阅读 42,783评论 2 349
  • 正文 我出身青楼,却偏偏与公主长得像,于是被迫代替她去往敌国和亲。 传闻我的和亲对象是个残疾皇子,可洞房花烛夜当晚...
    茶点故事阅读 42,316评论 2 342

推荐阅读更多精彩内容