PC-ORD: What’s New

	-	Hilltop Plot A hilltop plot is a way of showing more than one nonlinear response surface at a time. More...

	-	Contour Overlay Overlay a quantitative (Q) variable as a contoured surface on any ordination. More...

	-	Convex Hulls Filled Polygons A convex hull is an overlay that uses a polygon to enclose all of the points in a group. More...

	-	NMS Stress by Iteration
A real-time display of how stress is changing with each iteration of NMS. More...

New Matrix Format
Categorical values can now be either numerical or text with a maximum of 20 characters

New Traits Window and Analyses
Provides ways to relate data on species traits (trait matrix) to community samples (main matrix) and environmental data (second matrix).

New Analyses

Graph Enhancements

Software Management

Additions to Existing Analyses

Data Management

Step-by-Step v.7

New Graphs

Hilltop Plot

A hilltop plot is a way of showing more than one nonlinear response surface at a time on a single graph. The first use that we are aware of was in Nelson et al. (2015). In PC-ORD, response surfaces are superimposed on an ordination as an overlay. This enables simultaneous measurement and display of one- and two-dimensional, non-linear community–trait–environment associations.

For each selected overlay variable, we trace a particular contour that is specified as a percentage of that variable's range. Each contained area is a "hilltop", and multiple partially transparent hilltops are superimposed on one ordination. The resulting diagram shows the maxima of many nonlinear overlay variables (e.g. traits, species abundances, or environmental variables) in a single figure.

Because the hilltops are based on a contoured response surface, you can better understand the basis for hilltop plots by reading about contour overlays. The chief difference in use between the hilltops and the contour plots is that contour plots can be shown for only one variable at a time, while one can graph many hilltops on the same ordination. This comes at a loss of information in that most of the contour plot is discarded when converting to a hilltop, but has the advantage of representing multiple nonlinear relationships at once..
Contour Overlay

When you select a Q variable as an overlay, you are given a table of how the fit varies with the flexibility (smoothing parameter) used in the nonparametric regression of the overlay variable against the axis scores. If you choose "Optimize" flexibility, PC-ORD will select the flexibility with the highest cross-validated fit (xR²). If you wish to choose a specific value for flexibility, select "Specify" flexibility. Use the table presented under "Optimize" to help you choose a specific flexibility, if desired. Finally, you get a contour overlay onto the ordination space that you can customize to include color shading and/or contour interval labeling.

Convex Hulls Filled Polygons

A convex hull is an overlay that uses a polygon to enclose all of the points in a group. The purpose of this overlay is to show the outline of a group by using the outermost points of that group in an ordination or a scatterplot. This can help the viewer to discern if and how groups are separated or overlap in the scatterplot.

The basic rule for forming the convex hulls is that the outer points in a group are connected in a closed polygon such that adjoining segments always make an interior angle less than or equal to 180 degrees. At least three points are needed to draw a convex hull. A convex hull for just three points must, however, have each of those points as a vertex (corner) of the polygon.

NMS Stress by Iteration

Medium 4D Solution		Slow 6D Solution
	-
Click graphs for larger image

A real-time display of how stress is changing with each iteration of NMS. The display is updated with every single step that attempts to improve the configuration of points in the ordination space.

This graph is meant to be both entertaining and informative. The entertainment is that you get a dynamic, colorful, interesting window into the progress of NMS. For every run and dimensionality you can see how stress is reduced as the iterations increase. Each dimensionality is color coded, and separate panels are displayed for the real and randomized runs.

Informativeness of these graphs comes from the insights into stability and consistency of alternative NMS solutions. Unstable configurations show up as vertical zig-zags. Consistency shows up as minimum stress plateaus at a given level. The importance of dimensionality shows up as a decreasing series of final stress as dimensionality increases.

Traits

PC-ORD 7 provides ways to relate data on species traits (trait matrix) to community samples (main matrix) and environmental data (second matrix). While many of these operations can be done in the other PC-ORD menu items, the Traits menu provides several operations specific to this kind of data.

Functional Diversity
Functional diversity analyzes the combination of the sample unit x species matrix with a species x trait matrix. Functional diversity measures attempt to describe the diversity of species functional traits represented in a sample unit, rather that simply species diversity. For example, a plot containing three species, all having the same traits, are considered no more diverse than a single species. Similarly, two species with very different functional traits contribute more functional diversity than two species that are similar in their functional traits. For example, if we have two species in a plot and one is a weedy sun-loving pioneer plant species, and the other is a shade tolerant species with poor colonizing ability, that plot would have more functional diversity than two different species that were both shade tolerant poor colonizers.
Fourth Corner Analysis
The methodological question of linking species traits to environmental variables, via the sample unit x species matrix, is called the fourth corner problem because of the arrangement of four basic matrices (see the traits x environment positions in Dray and Legendre (2008, Fig. 1a) and McCune and Grace (2002, Fig. 2.1). Fourth Corner Analysis provides statistical tests of the strength of the links between these matrices. For a detailed explanation of the theory and mathematics of fourth corner analysis see Legendre et al. (1997), Dray and Legendre (2008), Ter Braak et al, (2012), and Dray et al. (2014).
Categorical to Binary
If, for a given variable, there are n unique categories (i.e., class levels with unique value labels), then n new binary (0/1) variables will be generated. Each new variable will be designated as a Q variable with value 0 or 1.
Create Trait Combinations
Create one new categorical variable by combining categories from two existing variables. Each combination of categories from the two selected variables is taken as a new category in the new variable. The resulting new variable is always categorical. The existing variables are left intact, but you can easily remove them with Modify | Delete Columns.

For example, say you had two categorical variables, one coding for native vs. non-native species, and one coding for annuals vs. perennials. That might work well in the analyses, but what if species having a combinations of those, for example the non-native annuals, is particularly different ecologically from all remaining species? You might, therefore, wish to create a new categorical variable with all four combinations of those trait categories: (1) native annuals, (2) native perennials, (3) non-native annuals, (4) non-native perennials.
Calculate SU x Traits Matrix
Calculating a sample unit x trait matrix provides a flexible first step in analyzing the relationships between species traits and explanatory variables. This matrix is obtained by multiplying a sample unit x species matrix by a species x trait matrix, but the content of the resulting matrix depends on whether and how traits are standardized and whether or not the multiplication is followed by a weighted averaging step (McCune 2015). To maximize versatility of the SU x trait matrix, including comparability among traits, and usability with a wide range of distance measures, we recommend first standardizing traits by min-to-max, then calculating abundance-weighted trait averages in each sample unit.
Species Distances in Trait Space
Species can be compared in their traits by calculating a distance matrix among species, starting with a species x trait matrix. Mathematically this is the same as calculating a distance matrix among sample units in species space, except that in this case the objects are species and their attributes are traits, rather than objects being sample units and the attributes species. The same distance measures are offered from the traits menu as for distances between sample units in species space.

New Matrix File Format

MatrixFormat.jpg (98652 bytes)

Interface

Row and column indentifiers maximum length increased from 8 characters to 12 characters.
Categorical values can now be either numerical (ShadeTolerQ in above example) or text (Dispersal, Leaf, Leaf, LearPersist, and ShadeTolerC in above example) with a maximum of 20 characters.
Matrix size increased from 32,000 rows x 32,000 columns to
2,000,000 rows x 2,000,000 columns with a maximum of 536,848,900 elements with some limitations.
Note: You can still import Excel old .wk1 files as well as .xls and .xlsx files.

New Analyses

Fourth Corner Analysis
The methodological question of linking species traits to environmental variables, via the sample unit x species matrix, is called the fourth corner problem because of the arrangement of four basic matrices (see the traits x environment positions in Dray and Legendre (2008, Fig. 1a) and McCune and Grace (2002, Fig. 2.1). Fourth Corner Analysis provides statistical tests of the strength of the links between these matrices. For a detailed explanation of the theory and mathematics of fourth corner analysis see Legendre et al. (1997), Dray and Legendre (2008), Ter Braak et al, (2012), and Dray et al. (2014).
Fuzzy Set (FSO)
Fuzzy set ordination applies fuzzy set theory to direct gradient analysis in ecological ordination. This ordination method requires the user to hypothesize the relationship between species communities and environmental variables or other predictors. The predictors are most commonly environmental variables, but they can also be a secondary set of species communities, or any other quantitative data set with the same number of rows as the community matrix. The community data are placed in the main matrix, and the secondary set is in the second matrix. The resulting ordination is an ordination of sample units in species space. Species can be superimposed on the ordination by a single weighted averaging step
Distance-based Redundancy Analysis (dbRDA)
Distance-based redundancy analysis (dbRDA) is similar to redundancy analysis (RDA), except that the main matrix is replaced by its principal coordinates, using a distance measure of your choice. The purpose of this variant is to allow you to choose non-Euclidean distance measures such as Sorensen (Bray-Curtis), that have proven effective in community ecology.
Categorical Counts
Categorical Counts provides a way to track the number of cases (rows, usually sample units), with a given categorical value. By default this is done for all categorical variables in the selected matrix. This provides a quick assessment of the frequency of categories, which is useful for such issues as balance in experimental designs,or sampling effectiveness in different categories.
Functional Diversity
Functional diversity analyzes the combination of the sample unit x species matrix with a species x trait matrix. The rationale and use of functional diversity measures in PC-ORD are described in the following topics.

Additions to Existing Analyses

Gower Distance
Gower's (1971a) coefficient is highly unusual among measures of similarity or dissimilarity, because it can be calculated based on qualitative (categorical) data, quantitative data, or mixtures of the two. Categorical data are handled as a matching problem: items that share a qualitative attribute receive a unit of similarity from that attribute. See Legendre and Legendre (1998) for a detailed description of the method.
Gower Distance, ignore 0,0
Gower (1971) and Legendre and Legendre (1998) suggested an intriguing but largely untested variant of Gower's coefficient of similarity that ignore 0,0 data pairs. If these double zeros are considered ambiguous information, you can exclude them from the computation of the coefficient. Sensitivity to double zeros is notorious in community ecology in producing unwanted effects on an analysis (Legendre & Legendre 1989, p. 253; McCune & Grace 2002, p. 38, 51).

Legendre and Legendre (1998) presented a modified version of the Gower coefficient of similarity (S19), calling it " asymmetrical" because matching zeros are handled differently than nonzeros. It is the same as the Gower coefficient, except that for quantitative variables (0,0) pairs are excluded and the sum of partial similarities is thus divided not by p variables, but by p* the number of non (0,0) pairs.

Note that this sense of "asymmetric" is different than in matrix symmetry. If Legendre's asymmetric version of Gower's similarity is converted to a distance (or dissimilarity) and used to build a distance matrix, this is still a symmetric matrix. In other words the distance between items A and B is the same as the distance between B and A, even if using the asymmetric version of Gower's similarity. To avoid that confusion PC-ORD uses the terminology "Gower, Ignore 0,0" in the menu system and output files.
Morisita-Horn Distance
Horn (1966) modified Morisita's (1959) similarity measure, resulting in what is now known as Morista-Horn similarity or distance. The chief appeal of this distance measure is that it is relatively insensitive to sampling effort (Wolda 1981). It is thus most useful for cases where it is impossible to control the time, area, or volume of sampling.

The Morisita-Horn index is only weakly influenced by minor species, which is largely the reason that the method resistant to undersampling. At the same time, it makes the method insensitive to pattern that is carried in minor species.
Increased maximum number of blocks or groups in perMANOVA and Indicator Species Analysis to 1000
New Summary | Write distance matrix options:
Bootstrapped confidence intervals for species area curve.
PCA option to write prediction equations into text file.
PCA option to write eigenvalues from randomizations into spreadsheet.
Added R²perm adjusted variance represented in CCA using method of Peres-Neto et al. (2006).
Added Ezekiel and R²perm adjusted variance represented in RDA, using methods of Peres-Neto et al. (2006).
Added measures of fit, including randomization-based methods, to NMS.
Dust bunny index as measure of departure from multivariate normality toward the dust bunny distribution.
New Summary | Write distance matrix options:
- write subdiagonal distances only (for sequential samples)
- write every other subdiagonal distances (for paired samples)

Graph Enhancements

Apply button to instantly preview changes without exiting Preferences dialog
Save scores for group centroids to spreadsheet or text file
Save graphs as TIFF
Species Frequency Label Cutoff
Overlay from third axis
Control over number of tick marks
Reorder legend
Customization of toolbar

Improved Data Format and Management

New traits window for simultaneous display of species traits matrix.
Text allowed in the categorical variable type. For example, instead of coding a C variable for experimental treatments as 1, 2, and 3, you can code them as "burned", "mowed", and control". The consequence of this is that categorical variables can no longer be used in arithmetic operations, only to define groups of items.
Conversion between variable types:

Categorical to Quantitative

Quantitative to Categorical

Categorical to Binary

Row names and column names (acronyms) increased to 12 character maximum. See Summary of Limitations.
Spreadsheet design allows up to 2 million rows or columns. See Summary of Limitations and memory use.
Streamlined import/export to Excel
New Project redesigned for drag and drop setup of matrices and easier file management
Save All to save groups of related new files with common base name.
Export Project is a convenient way to move all the files and settings for a given project from one machine to another.
Modifed File | Import simple spreadsheet options to include defining a specific range of cells for the matrix
Compact format: Code numbers increased from 4-digit max to 8-digit max. Code names (species acronyms) increased from 8 character max to 12-character max.
Click and drag to switch matrices.
Convert main or second matrix to graph file. This provides an easy way to use ordination coordinates produced outside PC-ORD. In version 6 this was available only as the add-in, "wk1togph".
Recode categorical variables (numbers to text or text to text)

Software Management

Built-in free update to latest version
Distribution by download from web with license key

Multivariate Analysis for Ecologists: Step-by-Step

Image of book cover Step-by-Step using PC-ORD

Second edition, custom-made for the beginning data analyst, this book also includes techniques and tips useful to advanced users. A 10-step analysis process using tools available in versions 5-7 guides both users who can, and cannot, participate in PC-ORD training.