Instructions
Upload Sequence File
Add RefMet Information
Clean Lipid Names
Remove Unannotated Features
Edit Data Columns
Group Nicknames
Only use letters and numbers.
Download Sequence File
Median Across Samples
Median Across QCs
Class Plot
Select which column plot to display:
Violin Plot
Median Across Groups
Principal Component Analysis
- Check log-transfomed checkbox if data is already log-transformed.
PCA 1
Scree plot 1
PCA 2
Scree plot 2
Select Feature
Settings
K-means Analysis Information
- Evaluate cluster validity using methods silhouette, WSS, or gap statistic.
- Optionally, restrict clustering to selected groups for enhanced analysis.
- Results are visualized with plots and an outlier table displaying potential anomalies.
Data & Evaluation Selection
K-means Parameters
K-means Results
Hierarchical Clustering Analysis Information
- Hierarchical clustering builds a dendrogram based on the distance between samples.
- Users can specify the clustering method, number of clusters, and dendrogram threshold.
- It supports group selection to focus on subsets of the data.
- Dendrogram and cluster plots along with outlier tables help identify unusual samples.
Data Selection
Clustering Parameters
Hierarchical Results
DBSCAN Analysis Information
- DBSCAN is a density-based algorithm that groups samples based on local point density.
- Requires setting epsilon(eps) and minimum points (minPts) parameters to define cluster boundaries.
- Filters out samples with low density as potential outliers.
Data Selection
DBSCAN Parameters
DBSCAN Results
HDBSCAN Analysis Information
- HDBSCAN extends DBSCAN by converting it into a hierarchical clustering approach.
- Uses a minimum points parameter and an outlier threshold to classify data points.
- Allows selection of groups to refine the clustering process.
- Plots and outlier tables provide a detailed view of anomalous samples.
Data Selection
HDBSCAN Parameters
HDBSCAN Results
OPTICS Analysis Information
- OPTICS orders samples based on reachability distances to reveal cluster structure.
- Generates reachability and threshold plots for visualizing cluster boundaries.
- Parameters such as eps, minPts, and eps cl can be adjusted to fine-tune analysis.
- The results help identify outliers and are displayed through multiple visualizations.
Data Selection
OPTICS Parameters
OPTICS Results
LOF Analysis Information
- LOF measures the local density deviation of a sample relative to its neighbors.
- Uses a threshold and a k parameter to compute an outlier score for each sample.
- Highlights samples with significantly lower local density as outliers.
- Interactive LOF plots and a summary table help in interpreting and identifying anomalies.
Data Selection
LOF Parameters
LOF Results
Heatmap Analysis
- Heatmaps provide a visual representation of high-dimensional data and clustering patterns.
- Select the desired dataset and, optionally, specific groups for comparison.
- Customize clustering, labeling, and display options to best reveal data patterns.
Heatmap Controls
Heatmap Customizations
Heatmap Plot
Heatmap Results
Circular Barplot Information
- Circular barplots display ranked features in a visually engaging, circular format.
- Select the top features and define groupings to compare differences effectively.
- Customize colors and labels to enhance the visual interpretation of the data.
Circular Barplot Settings
Circular Barplot
User guide
- Select the data frame to use.
- Click 'Run Data Processing' to start the analysis.
- After data processing select the Group of Interest and Reference Groups.
- Go to the 'Lipid Visualization' tab set thresholds and see the results
- Have fun!
Data Frame and group selection:
Group of Interest Table
Reference Group Table
Plot Settings
Settings
Volcano Plot Information
- Volcano plots highlight statistically significant changes by plotting fold change versus significance.
- Select a dataset, define two distinct groups, and set thresholds for p-values and fold changes.
- Customize colors to easily distinguish between upregulated, downregulated, and non-significant features.
Data & Group Selection
Volcano Plot Customization
Individual Feature Selection
Group Selection
Volcano Plot & Results
Odds Ratio plot Information
- Odds ratio analysis quantifies the association between features and group differences.
- Transform the data and apply logistic regression to compute odds ratios with confidence intervals.
- Visualize the results with plots and tables to interpret significant associations clearly.
Data & Group Selection
OR Plot
OR Results Table
Pathway Enrichment Information
- Pathway enrichment analysis identifies over-represented biological pathways in your data.
- Set appropriate thresholds and select the correct groups to reveal significant pathways.
Data & Identifier Selection
Enrichment Settings
Network Graph Customizations
Data summary
Enrichment Barplot and Dotplot
Enrichment Network Graph
Enrichment Table
Guide
Start by selecting the test type:
- 2 groups (unpaired): compare the means of two independent or unrelated groups to determine if there is a statistically significant difference between them.
- 2 groups (paired): compare the means of two related groups to see if their average difference is significantly different from zero. Used when the same subjects are tested under two different conditions (e.g., before and after a treatment).
- 2 groups with time (paired): used to analyze the changes within the same group over different times or conditions, assessing if there is a consistent effect across these points.
- Compare to reference group: compare the mean of all groups against a reference group. Can be used to determine if the groups significantly differ from the expected performance or baseline.
PolySTest
Usage of PolySTest is recommended for data with few replicates and high amounts of missing values.
Local test
Export to PolySTest
Results
.csv and .xlsx
Export to other apps
Statistical testing
Clustering
.csv for MetaboAnalyst
WELCOME TO MetaboLink
An interactive open source software tool for rapid data correction of untargeted metabolomics data.
MetaboLink is a graphical user interface which is allowing the usage of different complex functions to an uploaded dataset. The application is designed to be able to accept datasets with columns containing information not related to this part of the workflow and the functions can be run independently of each other and in any order as long as the requirements of the functions are met.
WHAT YOU CAN DO WITH MetaboLink:
- Remove features based on intensity between blank and QC samples.
- Remove features based on missing values.
- Normalize features based on internal standards.
- Impute missing values with different functions .
- Correcting signal drift using QC samples.
- Merge datasets with different ionmode and remove duplicates.
- Perform statistical analysis.
To start, upload a dataset or press "Load example" for a sample dataset to be loaded into the application.
Remember to save any changes you apply to the datasets.
Input
Data fileComma-separated values (CSV) file with samples in columns and features in rows. If samples are in rows, select file format "Samples in rows".
> If you get an error "Invalid multibyte string at ..." after uploading, your file might have invalid characters. We recommend you only include English letters, underscore, and numbers for naming.Sequence file (metafile)
After uploading the datafile, your dashboard will update and open the sequence panel. Here you will be able to upload a CSV file which works as an ID for the main table. You can read the requirements in the user manual.
Detailed user manual: user manual
Source code available here: GitHub - anitamnd/MetaboLink
This version might still have the occasional crash, if you are experiencing trouble please reach out: send email
Molecular Metabolism and Metabolomics | SDU