Layer¶

`HeteroEmbedLayer`	Embedding layer for featureless heterograph.
`GeneralLinear`	General Linear, combined with activation, normalization(batch and L2), dropout and so on.
`HeteroLinearLayer`	Transform feature with nn.Linear.
`HeteroMLPLayer`	HeteroMLPLayer contains multiple GeneralLinears, different with HeteroLinearLayer.
`HeteroFeature`	This is a feature preprocessing component which is dealt with various heterogeneous feature situation.
`MetapathConv`	MetapathConv is an aggregation function based on meta-path, which is similar with dgl.nn.pytorch.HeteroGraphConv.
`HeteroGraphConv`	A generic module for computing convolution on heterogeneous graphs.
`ATTConv`	It is macro_layer of the models [HetGNN].
`MacroConv`	param in_feats Input feature size.
`SemanticAttention`
`CompConv`	Composition-based convolution was introduced in Composition-based Multi-Relational Graph Convolutional Networks and mathematically is defined as follows:
`AttConv`	Attention-based convolution was introduced in Hybrid Micro/Macro Level Convolution for Heterogeneous Graph Learning and mathematically is defined as follows:
`LSTMConv`	Aggregate the neighbors with LSTM

class HeteroEmbedLayer(n_nodes_dict, embed_size, embed_name='embed', activation=None, dropout=0.0)[源代码]¶

Embedding layer for featureless heterograph.

参数

n_nodes_dict (dict[str, int]) – Key of dict means node type, value of dict means number of nodes.
embed_size (int) – Dimension of embedding,
activation (callable activation function/layer or None, optional) – If not None, applies an activation function to the updated node features. Default: None.
dropout (float, optional) – Dropout rate. Default: 0.0

forward()[源代码]¶

返回类型: The output embeddings.

forward_nodes(nodes_dict)[源代码]¶

参数: nodes_dict (dict[str, th.Tensor]) – Key of dict means node type, value of dict means idx of nodes.
返回: out_feature – Output feature.
返回类型: dict[str, th.Tensor]

class GeneralLinear(in_features, out_features, act=None, dropout=0.0, has_l2norm=True, has_bn=True, **kwargs)[源代码]¶

General Linear, combined with activation, normalization(batch and L2), dropout and so on.

参数

in_features (int) – size of each input sample, which is fed into nn.Linear
out_features (int) – size of each output sample, which is fed into nn.Linear
act (callable activation function/layer or None, optional) – If not None, applies an activation function to the updated node features. Default: None.
dropout (float, optional) – Dropout rate. Default: 0.0
has_l2norm (bool) – If True, applies torch.nn.functional.normalize to the node features at last of forward(). Default: True
has_bn (bool) – If True, applies torch.nn.BatchNorm1d to the node features after applying nn.Linear.

forward(batch_h: Tensor) → Tensor[源代码]¶: Apply Linear, BatchNorm1d, Dropout and normalize(if need).

class HeteroLinearLayer(linear_dict, act=None, dropout=0.0, has_l2norm=True, has_bn=True, **kwargs)[源代码]¶

Transform feature with nn.Linear. In general, heterogeneous feature has different dimension as input. Even though they may have same dimension, they may have different semantic in every dimension. So we use a linear layer for each node type to map all node features to a shared feature space.

参数: linear_dict (dict) – Key of dict can be node type(node name), value of dict is a list contains input dimension and output dimension.

示例

>>> import torch as th
>>> linear_dict = {}
>>> linear_dict['author'] = [110, 64]
>>> linear_dict['paper'] = [128,64]
>>> h_dict = {}
>>> h_dict['author'] = th.tensor(10, 110)
>>> h_dict['paper'] = th.tensor(5, 128)
>>> layer = HeteroLinearLayer(linear_dict)
>>> out_dict = layer(h_dict)

forward(dict_h: dict) → dict[源代码]¶

参数

dict_h (return) – A dict of heterogeneous feature
dict_h –

class HeteroMLPLayer(linear_dict, act=None, dropout=0.0, has_l2norm=True, has_bn=True, final_act=False, **kwargs)[源代码]¶

HeteroMLPLayer contains multiple GeneralLinears, different with HeteroLinearLayer. The latter contains only one layer.

参数: linear_dict (dict) – Key of dict can be node type(node name), value of dict is a list contains input, hidden and output dimension.

forward(dict_h)[源代码]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

备注

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

class HeteroFeature(h_dict, n_nodes_dict, embed_size, act=None, need_trans=True, all_feats=True)[源代码]¶

This is a feature preprocessing component which is dealt with various heterogeneous feature situation.

In general, we will face the following three situations.

The dataset has not feature at all.

The dataset has features in every node type.

The dataset has features of a part of node types.

To deal with that, we implement the HeteroFeature.In every situation, we can see that

We will build embeddings for all node types.

We will build linear layer for all node types.

We will build embeddings for parts of node types and linear layer for parts of node types which have original feature.

参数

h_dict (dict) – Input heterogeneous feature dict, key of dict means node type, value of dict means corresponding feature of the node type. It can be None if the dataset has no feature.
n_nodes_dict (dict) – Key of dict means node type, value of dict means number of nodes.
embed_size (int) – Dimension of embedding, and used to assign to the output dimension of Linear which transform the original feature.
need_trans (bool, optional) – A flag to control whether to transform original feature linearly. Default is True.
act (callable activation function/layer or None, optional) – If not None, applies an activation function to the updated node features. Default: None.

embed_dict¶

store the embeddings

Type: nn.ParameterDict

hetero_linear¶

A heterogeneous linear layer to transform original feature.

Type: HeteroLinearLayer

forward()[源代码]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

备注

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

class MetapathConv(meta_paths_dict, mods, macro_func, **kargs)[源代码]¶

MetapathConv is an aggregation function based on meta-path, which is similar with dgl.nn.pytorch.HeteroGraphConv. We could choose Attention/ APPNP or any GraphConvLayer to aggregate node features. After that we will get embeddings based on different meta-paths and fusion them.

\[\mathbf{Z}=\mathcal{F}(Z^{\Phi_1},Z^{\Phi_2},...,Z^{\Phi_p})=\mathcal{F}(f(H,\Phi_1),f(H,\Phi_2),...,f(H,\Phi_p))\]

where \(\mathcal{F}\) denotes semantic fusion function, such as semantic-attention. \(\Phi_i\) denotes meta-path and \(f\) denotes the aggregation function, such as GAT, APPNP.

参数

meta_paths_dict (dict[str, list[tuple(meta-path)]]) – contain multiple meta-paths.
mods (nn.ModuleDict) – aggregation function
macro_func (callable aggregation func) – A semantic aggregation way, e.g. ‘mean’, ‘max’, ‘sum’ or ‘attention’

forward(g_dict, h_dict)[源代码]¶

参数

g_dict (dict[str: dgl.DGLGraph]) – A dict of DGLGraph(full batch) or DGLBlock(mini batch) extracted by metapaths.
h_dict (dict[str: torch.Tensor]) – The input features

返回

h – The output features dict

返回类型

dict[str: torch.Tensor]

class HeteroGraphConv(mods)[源代码]¶

A generic module for computing convolution on heterogeneous graphs.

The heterograph convolution applies sub-modules on their associating relation graphs, which reads the features from source nodes and writes the updated ones to destination nodes. If multiple relations have the same destination node types, their results are aggregated by the specified method.

If the relation graph has no edge, the corresponding module will not be called.

参数

mods (dict[str, nn.Module]) – Modules associated with every edge types. The forward function of each module must have a DGLHeteroGraph object as the first argument, and its second argument is either a tensor object representing the node features or a pair of tensor object representing the source and destination node features.
aggregate (str, callable, optional) –
Method for aggregating node features generated by different relations. Allowed string values are ‘sum’, ‘max’, ‘min’, ‘mean’, ‘stack’. The ‘stack’ aggregation is performed along the second dimension, whose order is deterministic. User can also customize the aggregator by providing a callable instance. For example, aggregation by summation is equivalent to the follows:
```
def my_agg_func(tensors, dsttype):
    # tensors: is a list of tensors to aggregate
    # dsttype: string name of the destination node type for which the
    #          aggregation is performed
    stacked = torch.stack(tensors, dim=0)
    return torch.sum(stacked, dim=0)
```

mods¶

Modules associated with every edge types.

Type: dict[str, nn.Module]

forward(g, inputs, mod_args=None, mod_kwargs=None)[源代码]¶

Forward computation

Invoke the forward function with each module and aggregate their results.

参数

g (DGLHeteroGraph) – Graph data.
inputs (dict[str, Tensor] or pair of dict[str, Tensor]) – Input node features.
mod_args (dict[str, tuple[any]], optional) – Extra positional arguments for the sub-modules.
mod_kwargs (dict[str, dict[str, any]], optional) – Extra key-word arguments for the sub-modules.

返回

Output representations for every types of nodes.

返回类型

dict[str, Tensor]

class ATTConv(ntypes, dim)[源代码]¶

It is macro_layer of the models [HetGNN]. It presents in the 3.3.2 Types Combination of the paper.

In this framework, to make embedding dimension consistent and models tuning easy, we use the same dimension d for content embedding in Section 3.2, aggregated content embedding in Section 3.3, and output node embedding in Section 3.3.

So just give one dim parameter.

参数

dim (int) – Input feature dimension.
ntypes (list) – Node types.
Note –
We don’t implement multi-heads version.

atten_w is specific to the center node type, agnostic to the neighbor node type.

forward(hg, h_neigh, h_center)[源代码]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

备注

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

class MacroConv(in_feats: int, out_feats: int, num_heads: int, dropout: float = 0.0, negative_slope: float = 0.2)[源代码]¶

参数

in_feats (int) – Input feature size.
out_feats (int) – Output feature size.
num_heads (int) – Number of heads in Multi-Head Attention.
dropout (float, optional) – Dropout rate, defaults: 0.

forward(graph, input_dst: dict, relation_features: dict, edge_type_transformation_weight: ParameterDict, central_node_transformation_weight: ParameterDict, edge_types_attention_weight: Parameter)[源代码]¶

参数

graph – dgl.DGLHeteroGraph
input_dst – dict: {ntype: features}
relation_features – dict: {(stype, etype, dtype): features}
edge_type_transformation_weight – ParameterDict {etype: (n_heads * hidden_dim, n_heads * hidden_dim)}
central_node_transformation_weight – ParameterDict {ntype: (input_central_node_dim, n_heads * hidden_dim)}
edge_types_attention_weight – Parameter (n_heads, 2 * hidden_dim)

返回

output_features: dict, {“type”: features}

class SemanticAttention(in_size, hidden_size=128)[源代码]¶

forward(z, nty=None)[源代码]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

备注

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.

class CompConv(comp_fn, norm='right', linear=False, in_feats=None, out_feats=None, bias=False, activation=None, _allow_zero_in_degree=False)[源代码]¶

Composition-based convolution was introduced in Composition-based Multi-Relational Graph Convolutional Networks and mathematically is defined as follows:

参数: comp_fn (str, one of 'sub', 'mul', 'ccorr') –

forward(graph, feat, h_e, Linear=None)[源代码]¶

Compute Composition-based convolution.

参数

graph (DGLGraph) – The graph.
feat (torch.Tensor or pair of torch.Tensor) – If a torch.Tensor is given, it represents the input feature of shape \((N, D_{in})\) where \(D_{in}\) is size of input feature, \(N\) is the number of nodes. If a pair of torch.Tensor is given, which is the case for bipartite graph, the pair must contain two tensors of shape \((N_{in}, D_{in_{src}})\) and \((N_{out}, D_{in_{dst}})\).
Linear (a Linear nn.Module, optional) – Optional external weight tensor.
h_e (torch.Tensor) – \((1, D_{in})\) means the edge type feature.

返回

The output feature

返回类型

torch.Tensor

抛出

DGLError – Case 1: If there are 0-in-degree nodes in the input graph, it will raise DGLError since no message will be passed to those nodes. This will cause invalid output. The error can be ignored by setting allow_zero_in_degree parameter to True. Case 2: External weight is provided while at the same time the module has defined its own weight parameter.

备注

The h_e is a tensor of size (1, D_{in})

Input shape: \((N, *, \text{in_feats})\) where * means any number of additional dimensions, \(N\) is the number of nodes.
Output shape: \((N, *, \text{out_feats})\) where all but the last dimension are the same shape as the input.
Linear shape: \((\text{in_feats}, \text{out_feats})\).

class AttConv(in_feats: tuple, out_feats: int, num_heads: int, dropout: float = 0.0, negative_slope: float = 0.2)[源代码]¶

Attention-based convolution was introduced in Hybrid Micro/Macro Level Convolution for Heterogeneous Graph Learning and mathematically is defined as follows:

forward(graph: DGLHeteroGraph, feat: tuple, dst_node_transformation_weight: Parameter, src_node_transformation_weight: Parameter, src_nodes_attention_weight: Parameter)[源代码]¶

Compute graph attention network layer.

参数

graph – specific relational DGLHeteroGraph
feat (pair of torch.Tensor) – The pair contains two tensors of shape (N_{in}, D_{in_{src}})` and (N_{out}, D_{in_{dst}}).
dst_node_transformation_weight – Parameter (input_dst_dim, n_heads * hidden_dim)
src_node_transformation_weight – Parameter (input_src_dim, n_heads * hidden_dim)
src_nodes_attention_weight – Parameter (n_heads, 2 * hidden_dim)

返回类型

torch.Tensor, shape (N, H, D_out)` where H is the number of heads, and D_out is size of output feature.

class LSTMConv(dim)[源代码]¶

Aggregate the neighbors with LSTM

reset_parameters()[源代码]¶: Reinitialize learnable parameters.

备注

The LSTM module is using xavier initialization method for its weights.

forward(g, inputs)[源代码]¶

Defines the computation performed at every call.

Should be overridden by all subclasses.

备注

Although the recipe for forward pass needs to be defined within this function, one should call the Module instance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.