Build a shared nearest-neighbor graph with cells as nodes. More...

#include <BuildSnnGraph.hpp>

Classes
struct	Defaults
	Default parameter settings. More...

struct	Results
	Results of SNN graph construction. More...

Public Types
enum	Scheme { RANKED , NUMBER , JACCARD }

Public Member Functions
BuildSnnGraph &	set_neighbors (int k=Defaults::neighbors)

BuildSnnGraph &	set_approximate (bool a=Defaults::approximate)

BuildSnnGraph &	set_weighting_scheme (Scheme w=Defaults::weighting_scheme)

BuildSnnGraph &	set_num_threads (int n=Defaults::num_threads)

Results	run (size_t ndims, size_t ncells, const double *mat) const

template<class Algorithm >
Results	run (const Algorithm *search) const

template<typename Index_ , typename Distance_ >
Results	run (const knncolle::NeighborList< Index_, Distance_ > &neighbors) const

template<class Indices_ >
Results	run (const std::vector< Indices_ > &indices) const

Detailed Description

Build a shared nearest-neighbor graph with cells as nodes.

In a shared nearest neighbor graph, pairs of cells are connected to each other by an edge with weight determined from their shared nearest neighbors. If two cells are close together but have distinct sets of neighbors, the corresponding edge is downweighted as the two cells are unlikely to be part of the same neighborhood. In this manner, highly weighted edges will form within highly interconnected neighborhoods where many cells share the same neighbors. This provides a more sophisticated definition of similarity between cells compared to a simpler (unweighted) nearest neighbor graph that just focuses on immediate proximity.

A key parameter in the construction of the graph is the number of nearest neighbors $k$ to consider. Larger values increase the connectivity of the graph and reduce the granularity of any subsequent community detection steps (see scran::ClusterSnnGraph) at the cost of speed. The nearest neighbor search can be performed using either vantage point trees (exact) or with the Annoy algorithm (approximate) - see the knncolle library for details.

For the edges, a variety of weighting schemes are possible:

RANKED defines the weight between two nodes as $k - r/2$ where $r$ is the smallest sum of ranks for any shared neighboring node (Xu and Su, 2015). For the purposes of this ranking, each node has a rank of zero in its own nearest-neighbor set. More shared neighbors, or shared neighbors that are close to both observations, will generally yield larger weights.
NUMBER defines the weight between two nodes as the number of shared nearest neighbors between them. The weight can range from zero to $k + 1$, as the node itself is included in its own nearest-neighbor set. This is a simpler scheme that is also slightly faster but does not account for the ranking of neighbors within each set.
JACCARD defines the weight between two nodes as the Jaccard index of their neighbor sets. This weight can range from zero to 1, and is a monotonic transformation of the weight used by NUMBER.

See the ClusterSNNGraph class to perform community detection on the graph returned by run().

See also: Xu C and Su Z (2015). Identification of cell types from single-cell transcriptomes using a novel clustering method. Bioinformatics 31, 1974-80

Member Enumeration Documentation

◆ Scheme

enum scran::BuildSnnGraph::Scheme

Choices for the edge weighting scheme during graph construction.

Member Function Documentation

◆ set_neighbors()

BuildSnnGraph & scran::BuildSnnGraph::set_neighbors ( int k = Defaults::neighbors )

inline

Parameters

k	Number of neighbors to use in the nearest neighbor search.

Returns: A reference to this BuildSnnGraph object.

◆ set_approximate()

BuildSnnGraph & scran::BuildSnnGraph::set_approximate ( bool a = Defaults::approximate )

inline

Parameters

a	Whether to perform an approximate nearest neighbor search.

Returns: A reference to this BuildSnnGraph object.

◆ set_weighting_scheme()

BuildSnnGraph & scran::BuildSnnGraph::set_weighting_scheme ( Scheme w = Defaults::weighting_scheme )

inline

Parameters

w	The edge weighting scheme to use.

Returns: A reference to this BuildSnnGraph object.

◆ set_num_threads()

BuildSnnGraph & scran::BuildSnnGraph::set_num_threads ( int n = Defaults::num_threads )

inline

Parameters

n	Number of threads to use.

Returns: A reference to this BuildSnnGraph object.

◆ run() [1/4]

Results scran::BuildSnnGraph::run	(	size_t	ndims,
		size_t	ncells,
		const double *	mat
	)		const

inline

Parameters

ndims	Number of dimensions.
ncells	Number of cells.
mat	Pointer to an array of expression values or a low-dimensional representation thereof. Rows should be dimensions while columns should be cells. Data should be stored in column-major format.

Returns: The edges and weights of the constructed SNN graph.

◆ run() [2/4]

template<class Algorithm >

Results scran::BuildSnnGraph::run ( const Algorithm * search ) const

inline

Template Parameters

Algorithm Any instance of a knncolle::Base subclass.

Parameters

search Pointer to a knncolle::Base instance to use for the nearest-neighbor search.

Returns: The edges and weights of the constructed SNN graph.

◆ run() [3/4]

template<typename Index_ , typename Distance_ >

Results scran::BuildSnnGraph::run ( const knncolle::NeighborList< Index_, Distance_ > & neighbors ) const

inline

Template Parameters

Index_	Integer type for the indices.
Distance_	Floating-point type for the distances.

Parameters

neighbors Vector of indices and distances of the neighbors for each cell, sorted by increasing distance.

Returns: The edges and weights of the constructed SNN graph.

Distances are ignored here; this overload is only provided to enable convenient usage with pre-computed neighbors from knncolle.

◆ run() [4/4]

template<class Indices_ >

Results scran::BuildSnnGraph::run ( const std::vector< Indices_ > & indices ) const

inline

Template Parameters

Indices_ Vector-like class containing integer indices. This should provide the [ and size() methods.

Parameters

indices Vector of indices of the neighbors for each cell, sorted by increasing distance.

Returns: The edges and weights of the constructed SNN graph.

The documentation for this class was generated from the following file:

scran/clustering/BuildSnnGraph.hpp

Classes

Public Types

Public Member Functions

Detailed Description

Member Enumeration Documentation

◆ Scheme

Member Function Documentation

◆ set_neighbors()

◆ set_approximate()

◆ set_weighting_scheme()

◆ set_num_threads()

◆ run() [1/4]

◆ run() [2/4]

◆ run() [3/4]

◆ run() [4/4]