Replicated maxent model for Culiseta_annulata


This page summarizes the results of 10 bootstrap models for Culiseta_annulata, created Tue Jun 19 15:26:58 CDT 2012 using Maxent version 3.3.3k. The individual models are here: [0] [1] [2] [3] [4] [5] [6] [7] [8] [9]


Analysis of omission/commission

The following picture shows the training omission rate and predicted area as a function of the cumulative threshold, averaged over the replicate runs.


The next picture is the receiver operating characteristic (ROC) curve for the same data, again averaged over the replicate runs. Note that the specificity is defined using predicted area, rather than true commission (see the paper by Phillips, Anderson and Schapire cited on the help page for discussion of what this means). The average training AUC for the replicate runs is 0.991, and the standard deviation is 0.003.



Pictures of the model

The following two pictures show the point-wise mean and standard deviation of the 10 output grids. Other available summary grids are min, max and median.




Analysis of variable contributions


The following table gives estimates of relative contributions of the environmental variables to the Maxent model. To determine the first estimate, in each iteration of the training algorithm, the increase in regularized gain is added to the contribution of the corresponding variable, or subtracted from it if the change to the absolute value of lambda is negative. For the second estimate, for each environmental variable in turn, the values of that variable on training presence and background data are randomly permuted. The model is reevaluated on the permuted data, and the resulting drop in training AUC is shown in the table, normalized to percentages. As with the variable jackknife, variable contributions should be interpreted with caution when the predictor variables are correlated. Values shown are averages over replicate runs.

VariablePercent contributionPermutation importance
bio_435.472.2
bio_1931.61.7
flowdir6.69.6
bio_156.20.6
bio_25.92.1
bio_124.81.8
soils_so2.30.2
dem2.22.2
bio_131.98.1
bio_61.20.8
slope1.20
topoind0.60.3
bio_1400.4
bio_900
flowacc00
bio_500
bio_100


The following picture shows the results of the jackknife test of variable importance. The environmental variable with highest gain when used in isolation is bio_4, which therefore appears to have the most useful information by itself. The environmental variable that decreases the gain the most when it is omitted is bio_4, which therefore appears to have the most information that isn't present in the other variables. Values shown are averages over replicate runs.





Command line to repeat this species model: java density.MaxEnt nowarnings noprefixes -E "" -E Culiseta_annulata jackknife outputdirectory=G:\VectorMap\Models\Mosquito\Cu_annulata samplesfile=G:\VectorMap\OccPts\Mosquito\Cu_annulata\Cu_annulata_vm.csv environmentallayers=G:\VectorMap\Models\Mosquito\Cu_annulata\asc randomseed replicates=10 replicatetype=bootstrap noextrapolate nodoclamp maximumiterations=1000