# Basic Module

# Area

  • Introduction

    The area chart displays graphically quantitative data. It is based on the line chart. The area between axis and line are commonly emphasized with colors, textures and hatchings.

  • Analysis of case data

    The loaded data are xaxis.value and yaxis.value.

  • Interpretation of case statistics graphics

    Different colors represent different groups of area charts.

# Barcode Plot

Introduction

Suitable for displaying the distribution of large amounts of data.

  • Case Data Analysis

    The case data represents the sales revenue of a certain product in 500 stores in three regions.

    Through the barcode plot, we can observe that the number of stores with sales revenue around 50 is relatively high in Guangdong and Fujian regions. Additionally, the sales revenue among stores in Guangdong shows less variation, indicating a more concentrated distribution.

  • Special Parameters

    title: Title

    x_label: Name of the x-axis

    y_label: Name of the y-axis

    width: Width of the bars

    height: Height of the bars

    color: Color of the bars

# 3D Barplot

  • Introduction

    3D bar charts are used to provide a 3D look and feel for the data. The third dimension is often used for aesthetic reasons, but it does not improve data reading. Still intended to show comparisons between discrete categories.

  • Analysis of case data

    The loaded data are efficacy data of different doses of drugs in different treatment regimens.

  • Interpretation of case statistics graphics

    The 3D histogram shows the different therapeutic effects of different treatment groups (groups 1 to 4) with low, medium and high doses. Group 1 had the best effect with medium dose treatment, group 2 had the best effect with high dose treatment, group 3 had no significant difference with dose treatment, and group 4 had the best effect with high dose treatment.

  • Extra Parameters

    Phi:The larger the positive value is, the more it tends to observe from the top down. The greater the negative value, the more likely it is to be observed from the bottom of the three-dimensional figure upwards.

    Theta:The Angle at which a three-dimensional figure rotates horizontally.

# Barplot Color Group

The color group barplot can be used to display data values in groups, and to label different colors in sequence, which is often used to visualize the results of GO/KEGG pathway enrichment analysis.

  • Data table (three columns):

    Term | Entry name, such as GO/KEGG channel name

    Count | The numerical size of the entry, such as the number of genes enriched in a pathway

    Type | Category to which this channel belongs: such as BP/MF/CC/KEGG

  • Extra parameters:

    Display number | Used to control the maximum number of entries displayed in each group in the histogram. If set to 4, only the first 4 records of each group in Type will be taken

# Barplot Errorbar2

  • # Function Introduction

    Bar plot with error-lines and groups.

# Barplot Errorbar

  • # Function Introduction

    Bar plot with error-lines and groups.

  • # Data Structure

    Data frame.

    <1st-col>: (Numeric) values as Y-axis.

    <2nd-col>: (Numeric or String) classes as X-axis.

    <3rd-col>: (String) groups as colors and legend.

  • # Parameter details

    Main parameters

    Title: the main title of the image (some images can replace the default title)

    Theme: image theme (provided by ggplot2)

    Color palette: image matching

    Font family: font (such as time new Roma specified by mainstream journals)

    Width: the width of the output image (the default is inches, such as 12 x 7 inch in the standard)

    Height: the height of the output image (the default is inches, such as 12 x 7 inch)

    Alpha: transparency of the element (0-1, 0 for transparency, 1 for opacity)

    Important parameters

    Legend position: the position of the legend in the image

    Legend direction: arrangement of multiple elements in legend (horizontal or vertical)

    Legend Title Size: Legend Main Title Size

    Legend text size: the size of the element text in the legend

    Axis title size: the size of the image axis title

    Axis font size: the size of the image axis scale font

    Axis text angle: the angle of the image axis text

    Axis adjust: image axis text distance (fine tuning)

    Special parameters

    Errorbar width: error bar width

    Bar border color: error bar border color

    Other notes

    P-values for two columns of data are calculated by comparison with all samples

    P-values for three columns of data are whether there is a difference within the group

# Gradient Barplot

  • Introduction

    It is similar to the bubble chart, but on the basis of the histogram, a color gradient rectangle is used to simultaneously display the visualization of two variables.

  • Case data analysis

    The first column is Go Term (Go language code), the second column is the number of genes, and the third column is pvalue.

  • Graphic interpretation of case statistics

    As shown in the figure, blue is a low pvalue color, and red is a high pvalue color. As the pvalue increases, the color changes from blue to red. The abscissa indicates the number of genes.

  • Extra parameters

    Number of impressions: show the number of Go IDs in Go Term

    Horizontal: When it is turned on, the ordinate is Go id, the abscissa is a value, and the graph is a rectangle arranged horizontally. When it is closed, the horizontal and vertical coordinates are reversed, and the graph is a rectangle arranged vertically.

    Low value color: indicates the color displayed by low pvalue

    High value color: indicates the color displayed by high pvalue

# barplot-line-multiple

  • # Function Introduction

    Displaying multiple bar or line plot in one diagram.

  • # Data Structure

    Data frame with multiple columns data (Numeric).

  • # Parameter details

    Main parameters

    Title: the main title of the image (some images can replace the default title)

    Theme: image theme (provided by ggplot2)

    Color palette: image matching

    Font family: font (such as time new Roma specified by mainstream journals)

    Width: the width of the output image (the default is inches, such as 12 x 7 inch in the standard)

    Height: the height of the output image (the default is inches, such as 12 x 7 inch)

    Alpha: transparency of the element (0-1, 0 for transparency, 1 for opacity)

    Important parameters

    Legend position: the position of the legend in the image

    Legend direction: arrangement of multiple elements in legend (horizontal or vertical)

    Legend Title Size: Legend Main Title Size

    Legend text size: the size of the element text in the legend

    Axis title size: the size of the image axis title

    Axis font size: the size of the image axis scale font

    Axis text angle: the angle of the image axis text

    Axis adjust: image axis text distance (fine tuning)

    Special parameters

    Plot type: the type of image (line or bar chart)

    Line size: the width of the line corresponding to the line graph

    Point size: the size of the data point

# Barplot

  • Introduction

    The bar chart is a statistical chart showing the size of data values in terms of the length of the rectangle.

  • Analysis of case data

    The loaded data are efficacy data of different doses of drugs in different treatment regimens.

  • Interpretation of case statistics graphics

    Bar charts are used to display category data with rectangular bars whose height or length is proportional to the value they represent. Bar charts can be drawn vertically or horizontally. The bar chart shows the comparison between the discrete categories. One axis of the chart shows the specific categories to be compared, and the other axis represents the measurements. Some bar charts show bars that can also show the values of multiple measurement variables. The bar chart shows the different effects of low, medium, and high doses in different treatment groups (groups 1 to 4). Group 1 had the best effect with medium dose treatment, group 2 had the best effect with high dose treatment, group 3 had no significant difference with dose treatment, and group 4 had the best effect with high dose treatment.

  • Extra Parameters

    Bar Type:Dodge means horizontal listing and stack means top - down stacking

    Horizon:Swap the position of the category axis with that of the measurement axis

    Add Numeric Lable:Add numeric lable

# Beanplot

  • Introduction

    The beanplot is a method of visualizing the distribution characteristics.

  • Analysis of case data

    The loaded data is data set (data on treatment outcomes of different treatment regimens).

# Beeswarm

  • Introduction

    The beeswarm is a noninterference scatter plot which is similar to a bee colony.

  • Analysis of case data

    The loaded data are different groups and their data.

  • Interpretation of case statistics graphics

    Different colors represent different groups, and dots represent data.

# Corrplot Big Data

  • Introduction

    The correlation heat map is a graph that analyzes the correlation between two or more variables.

  • Analysis of case data

    The loaded data are the gene names and the expression of each sample.

  • Interpretation of case statistics graphics

    Red indicates positive correlation between two genes, blue indicates negative correlation between two genes, and the number in each cell indicates correlation coefficient.

# Bivariate

  • Introduction

    Display multiple bi-variables relationship.

# Box plot

  • Introduction

    The box plot is a method of visualizing the distribution characteristics of a set of data by means of a quartile graph.

  • Analysis of case data

    The loaded data is data set (data on treatment outcomes of different treatment regimens).

  • Interpretation of case statistics graphics

    The abscissa represents several different sets of data, and the ordinate represents the quartile of each set of data respectively. The upper, middle and lower horizontal lines of the box represent the upper, median and lower quartile respectively; The values represented by the upper and lower line segments respectively exponential the maximum and minimum values of the data, and the points outside the box represent outliers. The above figure indicates the P value between two variables. It can be considered that in treatment plan 1, there is a significant difference in efficacy between the middle-dose group and the low-dose group, and so on.

  • Extra parameters

    P Value Form:Value is a numeric value, signif is the number of "*" and no is not showing the P value

    Add Points:Mark out the points represented by numerical value

# Bubble

  • Introduction

    The bubble chart is a statistical chart that shows the third variable by the size of the bubble on the basis of the scatter chart, so that the three variables can be compared and analyzed simultaneously.

  • Analysis of case data

    The loaded data are GO Term,Gene Ridio, Gene count and P-value.

  • Interpretation of case statistics graphics

    The x-axis represents Gene Ridio, and the y-axis is GO Term; The size of the dot represents the number of genes, and the color of the dot represents the high or low P value.

  • Extra Parameters

    Show Numbers:Displays the number of bubbles

    Color Theme:Displays the color theme for the P value

# Bumpchart

  • Introduction

Bump chart can be used to display the change of grouped values.

# Calibration Curve

  • # Function Introduction

    The calibration curve is used to evaluate the consistency / calibration, i.e. the difference between the predicted value and the real value

  • # Data Structure

    Data frame of multi columns data (Numeric allow NA). i.e the survival data (status with 0 and 1).

  • # Parameter details

    Main parameters

    Title: the main title of the image (some images can replace the default title)

    Theme: image theme (provided by ggplot2)

    Color palette: image matching

    Font family: font (such as time new Roma specified by mainstream journals)

    Width: the width of the output image (the default is inches, such as 12 x 7 inch in the standard)

    Height: the height of the output image (the default is inches, such as 12 x 7 inch)

    Alpha: transparency of the element (0-1, 0 for transparency, 1 for opacity)

    Important parameters

    Legend position: the position of the legend in the image

    Legend direction: arrangement of multiple elements in legend (horizontal or vertical)

    Legend Title Size: Legend Main Title Size

    Legend text size: the size of the element text in the legend

    Axis title size: the size of the image axis title

    Axis font size: the size of the image axis scale font

    Axis text angle: the angle of the image axis text

    Axis adjust: image axis text distance (fine tuning)

    Special parameters

    Model: model calculation method (default is linear fitting model)

    Cmmethod: the method to check the model (km algorithm by default)

    Method: calibration method (boot method by default)

    Time (day): time, support time unit is day (default 365 days)

# Chi-square-fisher Test

  • Introduction

    Chi-square and Fisher test can be used to test the frequency difference of categorical variables. The tool will automatically select the statistical method of Chi-square and Fisher exact test.

  • Data Table

    The data table supports two formats: contingency table (example 1) and single-row record table (example 2)

  • Extra parameters

    Loop Method | The tool will generate all permutations and combinations that can be compared, and then test the differences between the two groups through traversal, optionally traversing by the rows and columns of the contingency table

    Add Mean | Add mean to test

    Add Median | Add median to test

# Chord

  • # Function Introduction

    The complex interaction is visualized in the form of chord graph.

  • # Data Structure

    Data frame or matrix of interaction of genes with pathways or gene ontologys.

  • # Parameter details

    Main parameters

    Title: the main title of the image (some images can replace the default title)

    Theme: image theme (provided by ggplot2)

    Color palette: image matching

    Font family: font (such as time new Roma specified by mainstream journals)

    Width: the width of the output image (the default is inches, such as 12 x 7 inch in the standard)

    Height: the height of the output image (the default is inches, such as 12 x 7 inch)

    Alpha: transparency of the element (0-1, 0 for transparency, 1 for opacity)

    Important parameters

    Legend position: the position of the legend in the image

    Legend direction: arrangement of multiple elements in legend (horizontal or vertical)

    Legend Title Size: Legend Main Title Size

    Legend text size: the size of the element text in the legend

    Axis title size: the size of the image axis title

    Axis font size: the size of the image axis scale font

    Axis text angle: the angle of the image axis text

    Axis adjust: image axis text distance (fine tuning)

    Special parameters

    Dist Name: distance between label and chord graph

    Circle width: the width of the circle border of a chord graph

    Label distance: the distance of the label

    Link visible: is the link shown in the chord diagram

    Scale: is the chord graph displayed according to the proportion of free scaling of elements

  • Circle Packing

    • Introduction

      Circle packing is a visualization method used to display the differences in quantity among different categories.

    • Special Parameters

      add_label: Whether to add labels inside the circles.

      alpha: Transparency of the circles' colors.

      min_text_size: Minimum font size for labels.

      max_text_size: Maximum font size for labels.

# Circular Barplot (opens new window)

  • Introduction

    The circular barplot is used to display the data values with circular bars across discrete categories for comparisons.

  • Analysis of case data

    The loaded data have three columns, with the first represent individual items, the second for different groups and the third for data values.

  • Interpretation of case statistics graphics

    As shown in the example figure, four different groups with every individual items are showed with circular bars. Each bars are colored by their groups, and the data values and item names were labeled on the end of the bars.

  • Extra Parameters

    Sort Value: whether the data values in each group need to bed sorted, default is not sorted.

    Label: whether the item names are labeled on the bars, default is labeled.

    Gapbar Num: set the number of gapped bars for separate each groups.

    Label Size: set the font size of labels.

# Circular Pie Chart

  • Introduction

    Another form of the pie chart.

  • Special Parameters

    show_label: Whether to display labels on the ring chart.

    width: The width of the strips.

    bottom_text: Bottom text.

    label_size: Font size of the labels. ComplexHeatmap

  • Introduction

    A multi-omics plugins to draw heatmap, meta annotation, and mutations.

  • Data parameter

    The first is the start of mutation. The second is the start of heatmap matrix.# Connected Scatterplot

  • Case Data Analysis

    The case data consists of the number of newborns named Alice and Anna from 1991 to 2017. The scatterplot is created based on the data, and the data points are connected in chronological order.

  • Special Parameters

    label_ratio: Proportion of labels displayed in the plot, where 0 means no labels are shown, and 1 means all labels are displayed.

    line_color: Color of the connecting lines.

    arrow_size: Size of the arrows. 0 means no arrows will be drawn.

    label_size: Size of the labels.

# Contour(Matrix)

  • Introduction

    The contour map (matrix) is a graph that displays three-dimensional data in a two-dimensional form

  • Analysis of case data

    The loaded data is a matrix.

  • Interpretation of case statistics graphics

    Yellow represents the highest, dark purple represents the lowest, the height scale range is 90-190.

# Contour (XY)

  • Introduction

    Contour plot (XY) is a data processing method that reflects data density through contour line.

  • Analysis of case data

    The loaded data are two variables.

  • Interpretation of case statistics graphics

    Just as contour lines in geography represent different heights, different contour lines in contour maps represent different densities. The closer to the center, the smaller contour loop is, and the higher the regional data density is. For example, the data density of yellow area is the highest, while that of blue area is the lowest.

# Corelation Heatmap

  • Introduction

    The correlation heat map is a graph that analyzes the correlation between two or more variables.

  • Analysis of case data

    The loaded data are the gene names and the expression of each sample.

  • Interpretation of case statistics graphics

    Red indicates positive correlation between two genes, blue indicates negative correlation between two genes, and the number in each cell indicates correlation coefficient.

  • Extra parameters

    Correlation : Row means correlation calculation by row, COL means correlation calculation by column.

    Half: Upper for the image on the upper left, Lower for the image on the lower right, and full for the panorama.

    Shape:Circle means circles and square means squares.

    Reorder:Sort by the resulting correlation coefficient.

# Corrplot

  • Introduction

    The correlation heat map is a graph that analyzes the correlation between two or more variables.

  • Analysis of case data

    The loaded data are the gene names and the expression of each sample.

  • Interpretation of case statistics graphics

    Red indicates positive correlation between two genes, blue indicates negative correlation between two genes, and the number in each cell indicates correlation coefficient.

# Custom Heatmap

Introduction

Custom Heatmap, directly plot a heatmap based on the given data.

  • Case Data Analysis

    The case data is a grayscale image of the official logo of hiplot.org.

  • Table Data Description

    The first row represents the column headers, and each subsequent row contains specific data.

    The title of the first column is name, please do not modify this column header (name), each subsequent row represents the row names.

  • Special Parameters

    low_color: Color for lower values.

    high_color: Color for higher values.

    text_size: Font size for row and column names.

    circle_size: Size of circles in the heatmap.

# Custom Icon Scatter Plot

Introduction

A scatter plot with customizable icons.

  • Parameter Description

    x: x-axis coordinates

    y: y-axis coordinates

    size: size of points (icons)

    label_name: legend name

    theme: plot theme

    symbol: optional preset symbols

    svg_path: custom SVG drawing path, e.g. M150 0 L75 200 L225 200 Z to draw a triangle. Note: When defining this parameter, make sure to leave 'symbol' empty.

# D3 Wordcloud

  • Introduction

    The word cloud is to visualize the "keywords" that appear frequently in the web text by forming a "keyword cloud layer" or "keyword rendering".

  • Case data analysis

    Load data nouns and noun frequencies.

  • Graphic interpretation of case statistics

    The icon shows a heart-shaped picture, showing its proportion in the word cloud according to the frequency of nouns.

  • Extra parameters

    Gradient mode: change the color to gradient

    Gradient dark: nouns with low word frequency are dark

    Gradient highlight color: nouns with high word frequency are light colors

# Dendrogram

  • Introduction

    The dendrogram is a diagram representing a tree. This diagrammatic representation is frequently used in different contexts:In hierarchical clustering, it illustrates the arrangement of the clusters produced by the corresponding analyses.

    In computational biology, it shows the clustering of genes or samples, sometimes in the margins of heatmaps.

  • Analysis of case data

    The loaded data are gene name and corresponding gene expression value.

  • Interpretation of case statistics graphics

    The figure is like a tree perpendicular to 90°. M1-M10 represents the root, and the branches grow and converge to the right to form the trunk of the branch.It can be seen that the figure generates 2 classes, 4 classes, and so on.

  • Extra Parameters

    Clade:

    Distance Measure:

    Type of tree:Change the shape of the tree

    Add Lable Color:After clustering, samples with similar names and colors are similar

# Mirror Density & Histogram

  • Introduction

    The mirror density & histogram is a graph used to observe the distribution of continuous variables in two side view: top and bottom## Density-Histogram

  • Introduction

    Displaying the data distribution based on density or histogram.

# Density

  • Introduction

    The kernel density map is a graph used to observe the distribution of continuous variables.

  • Analysis of case data

    The loaded data is the data set (data on treatment outcomes of different treatment regimens).

  • Interpretation of case statistics graphics

    Different colors represent different groups, with values on the horizontal axis and frequencies on the vertical axis.

  • Extra parameters

    Stack:

# Deviation plot

Introduction

Deviation plot provides a visual representation of the differences between data points.

  • Case Study Data Analysis

    The case study data is from the mtcars dataset, where the key is name (car model) and the value is mpg (miles per gallon) for plotting.

  • Interpretation of Case Study Statistical Graph

    Compute the z-score for mpg, sort the values, and categorize them based on the boundary of 0. The left side represents the low-efficiency group, while the right side represents the high-efficiency group.

  • Special Parameters

    sort_method: Specify whether to sort in ascending or descending order.## Diffusion Map

  • Introduction

    Diffusion Map is a nonlinear dimensionality reduction algorithm that can be used to visualize developmental trajectories.## Diverging scale

  • Introduction

    The diverging scale is a graph that maps a continuous, quantitative input to a continuous fixed interpolator.

  • Analysis of case data

    The first column is a list of model names, and the remaining columns enter the relevant indicators and corresponding values.

  • Interpretation of case statistics graphics

    Hp data is shown on the horizontal axis, model names (classification) are shown on the vertical axis, models above average are shown in red, and models below average are shown in blue. Data is assigned on a scale of 2 by size.

  • Extra parameters

    Type: Lollipop represents the drawing of sliding beads on both sides of the central axis;Bar represents a bar graph drawn on both sides of the central axis.

    Scale: When scale is turned on, the data is normalized and displayed on a scale of ±2; when scale is turned off, the vertical slide ball chart and histogram are displayed.

    Line size: As the right pull value increases, the red and blue lines extending from the vertical axis become thicker.

    Point size: As the value of the right pull increases, the point of the horizontal and vertical values at the end of the red and blue lines becomes larger.

    Color: indicates the line color, the default red is the positive line color, and blue is the negative line color.

# Donut

  • Introduction

    The donut is a variant of the pie chart, with a blank center allowing for additional information about the data as a whole to be included. Doughnut charts are similar to pie charts in that their aim is to illustrate proportions.

  • Analysis of case data

    The loaded data are different groups and their data.

  • Interpretation of case statistics graphics

    Different colors represent different groups and different areas represent data and proportion.

# Dotchart

  • Introduction

    Sliding bead chart is a graph of beads sliding on a column. It is the superposition of bar chart and scatter chart.

  • Analysis of case data

    The loaded data are gene names and their corresponding gene expression values and groups.

  • Interpretation of case statistics graphics

    Each color represents a different grouping, so that the differences in gene expression values can be intuitively understood.

# Dual Y Axis Chart

  • Introduction

The dual Y-axis graph can put two groups of data with larger orders of magnitude in the same graph for display.

  • Analysis of case data

The loaded data is divided into three columns, the first column is the value of the x-axis, the second column is the value of the left Y-axis, and the third column is the value of the right Y-axis.

  • Interpretation of case statistics graphics

In the case data, the Y-axis scale on the left is in the range of 0-100, while the Y-axis scale on the right is 0-1000.

  • Extra Parameters

Coefficient: used for scaling of the second Y-axis scale

Color1: The color of the first polyline

Color2: The second discounted color

# Dumbbell

  • Introduction

Dumbbell chart can be used to display the change of values.

# Easy Pairs

  • Introduction

    Display a matrix of plots for viewing correlation relationship and distributions of multiple variables.

# SOM Model Vis

  • Introduction

    Establish the SOM model and conduct the visulization.

# Eulerr

  • Introduction

    The proportional Venn diagram is a Venn diagram that can display the relationship between sets in proportion.

  • Case data analysis

    The first column represents different variables and the intersection relationship between variables. The intersection relationship is connected with the "&" symbol, and the second column indicates the value of the single variable or the intersection between variables.

  • Case graphic analysis

    Consistent with the Venn diagram, the area of the single variable or the intersection area of the variable is proportional to its value.

# Extended Scatter Plot

  • Introduction

    An extended scatter plot adds marginal plots to the basic scatter plot to provide a more comprehensive view of the data distribution.

  • Special Parameters

    add_rug: Whether to add rug marks at the edges.

    rug_alpha: Transparency of the rug marks.

    rug_size: Size of the rug marks.

    rug_color: Color of the rug marks.

    marginal_type: Type of marginal plots, options include: histogram, boxplot, violinplot, qqplot, ppplot, densigram.

    marginal_fill: Fill color of the marginal plots.

    marginal_color: Border color of the marginal plots.

    marginal_size: Line width of the marginal plots.

    marginal_bins: Number of bins in the marginal plots.

# Cox Models Forest

  • Introduction

    Cox model forest is a visual representation of a COX model that constructs a risk forest map to facilitate variable screening

  • Analysis of case data

    The loaded data are time,status and multiple variable factor.

  • Interpretation of case statistics graphics

    The first column of the table shows the variables and sample numbers, the second column shows the forest plot, and the third column shows the CI 95% confidence interval range, its mean and P values.

    Forest map interpretation.

    The middle vertical line represents the invalid line, the horizontal line represents the 95% confidence interval of the variable factor, and the length represents the magnitude of the confidence interval. If the confidence interval of a variable factor intersects the invalid line, the variable factor is considered to have no statistical significance, and the position of the square is the point estimation of HR.

    Age is the control variable.

    The incidence of the ph.ecog factor is greater than that of the age factor, and the ph.ecog factor increases the occurrence of survival (P<0.001, statistically significant).

    The incidence of sex is less than that of age, and sex reduces the incidence of survival (P =0.002<0.05, statistically significant).

  • Extra parameters

    Merge Models:Integrate multiple variable factors into an icon.

    Drop Controls:Remove the row of the age variable in the diagram.

    Add Caption:Mark the title below the icon to the right.

# Fan plot

  • Introduction

    The pie chart is a statistical chart designed to clearly show the percentage of each data group by the size of the pie.

  • Analysis of case data

    The loaded data are different groups and their data.

  • Interpretation of case statistics graphics

    Different colors represent different groups and different areas represent data and proportion.

  • Extra Parameters

    Show Ratio:Show the proportions next to the different groups.

# Fishplot

Detail: https://github.com/chrisamiller/fishplot

  • Data Table

    timepoints: specifying the timepoints for each column of the matrix

    parents: specifying parental relationships between clones

    samplename: samplename (support multiple samples)

    other: a numeric matrix containing tumor fraction estimates for all clones at all timepoints

  • General parameters

    Color Palette: control background colors

    Color Palette2: control clone colors

  • Extra parameters

    Shape: the type of shape to construct the plot out of

    Background Type: a string giving the background type

    Time Unit: time unit added to vline axis

    Border Width: a numeric width for the border line around this polygon

    Padding Pecent: the amount of "ramp-up" to the left of the first timepoint. Given as a fraction of the total plot width. Default 0.2

    Vline: add timepoints vline

    Clone Label: add clone name

    Fix Missing: whether to "correct" clones that have zero values at timepoints between non-zero values. (the clone must still have been present if it came back)

    Vline of Color: Color of timepoints vline

    Border of Color: Color of border

# Flowerplot

  • Introduction

    The flowerplot is used to display all possible logical relations between a finite collection of different sets .

  • Analysis of case data

    The loaded data is a multiple column data frame with different sets .

  • Extra Parameters

    angle: set the angle of rotation in degress.

    a: set the radii of the ellipses along the x-axes.

    b: set the radii of the ellipses along the y-axes.

    r: set the radius of the central circle.

    ellipse_col_pal: set the color palette for filling the ellipse.

    circle_col: set the color for filling the central circle.

    lab_size: set the label text cex.

# Funnel Plot (Metafor)

  • Introduction

    Can be used to show potential bias factors in Meta-analysis.

# Funnel Plot

  • Introduction

    Can be used to show potential bias factors in Meta-analysis.

# Gantt

  • Introduction

    The Gantt chart is a type of bar chart that illustrates a project schedule.

  • Analysis of case data

    Data were loaded for 4 samples (4 patients), 3 items (3 treatments), and the start and end times of each treatment.

  • Interpretation of case statistics graphics

    The horizontal axis represents the time schedule, the vertical axis represents 4 patients, and the 3 colors represent 3 treatments. The figure can observe the time schedule of different treatments for each patient.

# Gene Density (opens new window)

  • Introduction

    The gene density is used to display the distribution of gene density among different chromosomes with the colored heat map and area graph.

  • Analysis of case data

    The loaded data have two files, the first file contain the chromosome information with three columns: chr, start, end, the second file contain the gene information with three columns: chr, start, end.

  • Interpretation of case statistics graphics

    As shown in the example figure, gene density across every chromosomes are displayed with colored heat map and combined area graph.

  • Extra Parameters

    Windows: set the window size to calculate genomic density.

  • Reference Packages

    circlize: (Maintainer: Zuguang Gu z.gu@dkfz.de)

    ComplexHeatmap: (Maintainer: Zuguang Gu z.gu@dkfz.de)

    gtrellis: (Zuguang Gu z.gu@dkfz.de)

# Gene Ranking Dotplot (opens new window)

  • Introduction

    The gene ranking dotplot is used to display the differentially expressed genes with sorted ranking by the log2foldchange and pvalues with colored dot chart.

  • Analysis of case data

    The loaded data have three columns, with the first represent gene names, the second for log2FC and the third for pvalues.

  • Interpretation of case statistics graphics

    As shown in the example figure, the x-axis represents the ranking of every differentially expressed genes, the y-axis shows the log2foldchange, the dot is colored by the pvalues and the dot size represents the log2foldchange.

  • Extra Parameters

    log2FC: set the threshold of log2foldchage, default is 1.

    TopN: set the number of labeled top genes.

    Label Size: set the font size of labels.

# Gene Cluster Trend (opens new window)

  • Introduction

    The gene cluster trend is used to display different gene expression trend with multiple lines showing the similar expression patterns in each cluster.

  • Analysis of case data

    The loaded data are a gene expression matrix with each row represent a gene and each column represent a time-point sample.

  • Interpretation of case statistics graphics

    As shown in the example figure, the genes are clustered into different groups, with each group showing similar expression patterns across different time-points. The average expression trend is highlighted in each cluster.

  • Extra Parameters

    Cluster Num: set the number of desired cluster.

    Draw Cluster Centre Line: whether draw the centre line of the average expression in every cluster.

    Threshold: set the threshold for excluding genes. If the percentage of missing values (indicated by NA in the expression matrix) is larger than thres, the corresponding gene will be excluded.

    MinStd: set the threshold for minimum standard deviation. If the standard deviation of a gene's expression is smaller than min.std the corresponding gene will be excluded.

  • Reference Packages

    Mfuzz: (Maintainer: Matthias Futschik matthias.futschik@sysbiolab.eu)

# Dist plot

  • Introduction

    The dist plot is a visual diagram using a confidence distribution.

  • Analysis of case data

    The loaded data are five conditions and their corresponding values.

  • Interpretation of case statistics graphics

    The diagram shows the confidence distribution of the mean under the conditions, and the approximate distribution of the corresponding values under the five conditions can be seen.

# GGPIE

  • Introduction

    The pie chart is a statistical chart that shows the proportion of each part by dividing a circle into sections.

  • Analysis of case data

    The loaded data are different groups and their data.

  • Interpretation of case statistics graphics

    In a circle graph, the arc length of each slice (the arc length of its center Angle and the region corresponding to its center Angle) is proportional to the number represented. The pie chart shows the number of samples for the 1 to 4 components and the corresponding proportions. The number of samples in one group is 13, accounting for 11.71%; the number of samples in two groups is 34, accounting for 30.63%; the number of samples in three groups is 21, accounting for 18.92%; and the number of samples in four groups is 43, accounting for 38.74%.

# ggpubr-boxplot

Try plotting and if you have any problem with the parameters, check https://rpkgs.datanovia.com/ggpubr/reference/ggboxplot.html (opens new window) or internet materials.## Seqlogo

  • Introduction

    The sequence LOGO is a graphic that describes a sequence pattern of binding sites.

  • Analysis of case data

    The loaded data is the Sequence of binding sites for multiple transcription factors on multiple genes.

  • Interpretation of case statistics graphics

    A sequence of binding sites was displayed in a column of the chart by means of BITS calculation, which could clearly observe the large proportion of bases of different sequences.

  • Extra parameters

    Seqtype: DNA represents DNA sequence, RNA represents RNA sequence, aa represents amino acid sequence.

    Color Scheme: Different colors of base

    Method: BITS represented the proportion of different bases in the vertical coordinate axis with the maximum value of 2 by using the BIT formula;Prob represents the proportion of bases in the total base of the sequence, which is displayed in the vertical axis with the maximum value of 1..

# ggplot2 Wordcloud

  • Introduction

    The word cloud is to visualize the "keywords" that appear frequently in the web text by forming a "keyword cloud layer" or "keyword rendering".

  • Case data analysis

    Load data nouns and noun frequencies.

  • Graphic interpretation of case statistics

    Display the proportion of nouns in the word cloud graph according to the frequency of nouns.

  • Extra parameters

    Minimum rotation: the minimum clockwise rotation angle of the entry

    Maximum rotation: the maximum clockwise rotation angle of the entry

    Font size range: limit the font size of entries

    Word spacing: font spacing of entries

# GOBar Plot (opens new window)

  • Introduction

    The gobar plot is used to display Z-score coloured barplot of terms ordered alternatively by z-score or the negative logarithm of the adjusted p-value.

  • Analysis of case data

    The loaded data are the results of GO enrichment with seven columns: category, GO id, GO term, gene count, gene name, logFC, adjust pvalue and zscore.

  • Interpretation of case statistics graphics

    As shown in the example figure, the x-axis represent each GO term, the y-axis represent the -log(adj_pvalue), each bar is colored by the z-score. If display is used to facet the plot the width of the panels will be proportional to the length of the x scale.

  • Extra Parameters

    Display: A character vector indicating whether a single plot ('single') or a facet plot with panels for each category should be drawn (default='single').

  • Reference Packages

    GOplot: (Maintainer: Wencke Walter wencke.walter@arcor.de)

# GOBubble Plot (opens new window)

  • Introduction

    The gobubble plot is used to display Z-score coloured bubble plot of terms ordered alternatively by z-score or the negative logarithm of the adjusted p-value.

  • Analysis of case data

    The loaded data are the results of GO enrichment with seven columns: category, GO id, GO term, gene count, gene name, logFC, adjust pvalue and zscore.

  • Interpretation of case statistics graphics

    As shown in the example figure, the x- axis of the plot represents the z-score. The negative logarithm of the adjusted p-value (corresponding to the significance of the term) is displayed on the y-axis. The area of the plotted circles is proportional to the number of genes assigned to the term. Each circle is coloured according to its category and labeled alternatively with the ID or term name.If static is set to FALSE the mouse hover effect will be enabled.

  • Extra Parameters

    Display: A character vector indicating whether a single plot ('single') or a facet plot with panels for each category should be drawn (default='single').

    Label ID: Defines whether label the GO term.

    Table Legend: Defines whether a table of GO ID and GO term should be displayed on the right side of the plot or not (default = TRUE).

    Label Threshold: Sets a threshold for the displayed labels. The threshold refers to the -log(adjusted p-value).

  • Reference Packages

    GOplot: (Maintainer: Wencke Walter wencke.walter@arcor.de)

# GOCircle Plot (opens new window)

  • Introduction

    The gocircle plot is used to display the circular plot combines gene expression and gene- annotation enrichment data. A subset of terms is displayed like the GOBar plot in combination with a scatter plot of the gene expression data. The whole plot is drawn on a specific coordinate system to achieve the circular layout. The segments are labeled with the term ID.

  • Analysis of case data

    The loaded data are the results of GO enrichment with seven columns: category, GO id, GO term, gene count, gene name, logFC, adjust pvalue and zscore.

  • Interpretation of case statistics graphics

    As shown in the example figure, the outer circle shows a scatter plot for each term of the logFC of the assigned genes. Red circles display up-regulation and blue ones down-regulation by default.

  • Extra Parameters

    Show Items: A numeric vector defines how many processes are displayed (starting from the first row of data).

    Label Size: Size of the segment labels (default=5).

    Rad1: The radius of the inner circle (default=2).

    Rad2: The radius of the outer circle (default=3).

    Table Legend: Shall a table be displayd or not? (default=TRUE).

  • Reference Packages

    GOplot: (Maintainer: Wencke Walter wencke.walter@arcor.de)

# Grouped Bubble Chart

  • Special Parameters:

    min_size: Minimum bubble size

    max_size: Maximum bubble size

    alpha: Bubble opacity

# Group-comparison Heatmap

This plugin provides a way to compare multiple variables across multiple (>2) groups and visualize the result with heatmap.

When the reference group is not set, let's say you have A,B,C 3 groups, then this plugin will

  • Compare A and B+C
  • Compare B and A+C
  • Compare C and A+B

If reference groups set to A, then

  • Compare A and B
  • Compare A and C

If you have discrete variable, only binary is valid, you can use TRUE/P for positive case and FALSE/N for negative case.

# Grouped Dumbbell Chart

  • Special Parameters:

    line_size: Line thickness

    dot_size: Dot size on both ends

# Grouped Line Chart

  • Input Data Analysis

    Use the names column to differentiate different lines and then specify the groups to which each line belongs using the groups column.

# Half Violin

  • Introduction

    The half violin plot is a statistical graph used to display the distribution and probability density of data by replacing the left part with the data frequency count graph on the basis of keeping the right part of violin graph.

  • Analysis of case data

    The loaded data is data set (gene names and expression levels in different tumors).

  • Interpretation of case statistics graphics

    The half violin plot can reflect the data distribution, which is similar to the box diagram. The black horizontal line in the box shows the median gene expression level in each tumor, and the upper and lower edges in the white box represent the upper and lower quartiles in the data set. The distribution of observable numerical points on the left half; The violin graph can also reflect the data density, and the more concentrated the data set, the fatter the graph. The gene expression distribution in the BLGG group is more concentrated, followed by BIC group and AML group.

# Heatmap

  • Introduction

    Heat map is an intuitive and visual method for analyzing the distribution of experimental data, which can be used for quality control of experimental data and visualization display of difference data, as well as clustering of data and samples to observe sample quality.

  • Analysis of case data

    The loaded data are Count (gene name and corresponding gene expression value), sampleInfo (sample name, group and other relevant information, such as age), and gene information (gene name and its pathway, such as tumor pathway and physiological pathway).

  • Interpretation of case statistics graphics

    In the example figure, each small grid represents each gene, and the shade of color represents the expression level of this gene. The larger the expression level is, the darker the color will be (red is up-regulated, green is down-regulated).Each row represents the expression of each gene in a different sample, and each column represents the expression of all genes in each sample.The upper tree represents the clustering analysis results of different samples of different groups and ages, and the left tree represents the clustering analysis results of different genes from different samples.

# HiC Heatmap (opens new window)

  • Introduction

    The HiC heatmap is used to display the genome-wide chromatin interaction with heatmap on different chromosomes.

  • Analysis of case data

    The loaded data have three columns, with the first for one locus bin index, the second for another locus bin index, and the third for the interaction frequency between this two locus.

  • Interpretation of case statistics graphics

    As shown in the example figure, a heat map represent the interaction frequency between any two locus. The color displays their intensity of interaction.

  • Extra Parameters

    Resolution: set the resolution of calculated interaction frequency.

    Interval: set the tick interval of x and y-axis.

# Histogram

  • Introduction

    Histogram refers to the distribution of continuous variable data by a series of vertical stripes or line segments with different heights.

  • Analysis of case data

    The loaded data is the data set (data on treatment outcomes of different treatment regimens).

  • Interpretation of case statistics graphics

    The width of the rectangle in the figure is proportional to and different from the spacing, and the vertical axis represents the frequency.

  • Extra parameters

    Bins: Change the width of the rectangle

# Interval Area Plot

  • Case Data Analysis

    The case data consists of the highest and lowest temperatures for each of the twelve months in a year, along with the corresponding abbreviated month names.

    Based on the case data, temperature intervals and average temperature lines have been plotted for each month.

  • Special Parameters

    need_mean_line: Whether to plot the mean line. If mean data is not provided, the mean will be calculated as (max value + min value) / 2 by default.

    title: Title of the plot.

    x_label: Label for the x-axis.

    y_label: Label for the y-axis.

    area_color: Color of the temperature interval area.

    mean_line_color: Color of the mean temperature line.

    min_line_color: Color of the line representing the minimum temperature.

    max_line_color: Color of the line representing the maximum temperature.

    line_size: Width of the lines.

    alpha: Transparency of the temperature interval area.

    max|min|mean_linetype: Line types for maximum, minimum, and mean lines.

    add_line_label: Whether to add labels to the lines.

# Interval Bar Chart

  • Case Data Analysis

    The case data consists of the highest and lowest temperatures for each of the twelve months along with their corresponding abbreviated month names.

    An interval bar chart has been created using the case data, showing the temperature ranges for each month and the mean temperature line.

  • Special Parameters

    need_mean_line: Whether to include the mean temperature line. If mean data is not provided, the mean will be calculated as (max temperature + min temperature) / 2 by default.

    title: Title of the chart.

    x_label: Name of the x-axis.

    y_label: Name of the y-axis.

    bar_color: Color of the bars in the bar chart.

    line_color: Color of the mean temperature line.

    line_size: Width of the mean temperature line.

    bar_width: Width of the bars. If the x-axis labels are discrete values, the width can be adjusted between 0 and 1.

    alpha: Transparency of the bars in the bar chart.

# Likert Plot

  • Introduction

    Descriptive statistical analysis of Likert scale data## Line (Color Dot)

  • Introduction

    Display the change of data points.

# Line(Errorbar)

  • Introduction

    The error line mainly indicates the error range of each data point and shows the potential error or uncertainty relative to each data in the series.

  • Analysis of case data

    The loaded data are two groups and their corresponding values.

  • Interpretation of case statistics graphics

    The red line in low type represents the maximum value of the low type in treat1, the red line represents the minimum value of the low type in treat1, and so on. Conduct rank-sum test on treat1 and treat2 according to low,mid and high types respectively, and display p-value at the top of the figure.

  • Extra parameters

    P Value Form:Value is a numeric value, signif is the number of "*" and no is not showing the P value

# Line

  • Introduction

    The line chart is a statistical chart that USES a linear or logarithmic scale to draw data in a two - or three-dimensional view to show the data set or track the characteristics of the data over time.

  • Analysis of case data

    The loaded data are the horizontal axis values and their corresponding vertical axis values and groups.

  • Interpretation of case statistics graphics

    The diagram shows that value1 is positively correlated with Value2 in treatment plan 1, while Value1 is negatively correlated with Value2 in treatment plan 2.

  • Extra Parameters

    Change Line Type:Change the shape of different grouping lines

    Add Points:Add the point corresponding to the value

# Line Regression

  • Introduction

    Linear regression is a regression method for linear modeling of the relationship between independent variables and dependent variables.If there is only one independent variable, it is called simple regression, and if there is more than one independent variable, it is called multiple regression.

  • Analysis of case data

    The loaded data are independent variables, dependent variables and groups.

  • Interpretation of case statistics graphics

    Different colors represent different groups, and linear regression equations can be added. The closer R squared is to 1, the closer the fitted curve is to the actual curve.

# [China Map 2](/basic/map-china 2)

  • Introduction

    A map of China is a statistical chart on a map of China showing the distribution of a set of variables in different provinces within China.

  • Analysis of case data

    The loaded data is the distribution of a variable in each province of China.

  • Interpretation of case statistics graphics

    The graph shows the distribution of variable values in different provinces in China. The redder the color, the higher the value, and the closer the color is to green, the lower the value.

# China Map

  • Introduction

    A map of China is a statistical chart on a map of China showing the distribution of a set of variables in different provinces within China.

  • Analysis of case data

    The loaded data is the distribution of a variable in each province of China.

  • Interpretation of case statistics graphics

    The graph shows the distribution of variable values in different provinces in China. The redder the color, the higher the value, and the closer the color is to green, the lower the value.

# World Map

  • Introduction

    A world map is a statistical chart on a world map showing the distribution of a set of variables in different countries of the world.

  • Analysis of case data

    The loaded data is the distribution of a variable in countries around the world.

  • Interpretation of case statistics graphics

    The graph illustrates the distribution of variable values in different countries of the world. The redder the color, the higher the value, and the closer the color is to green, the lower the value.

# Matrix Bubble

  • # Function Introduction

    The color matrix bubble is used to visualize the expression matrix data of multiple genes (rows) in various cells (columns).

  • # Data Structure

    <1st-col>: (String) cell sample name as X axis,

    <2nd-col>: (String) gene name as the Y axis,

    <3rd-col>: (Numeric) Gene expression,

    [4th-col]: (String) groups.

  • # Parameter details

    Main parameters

    Title: the main title of the image (some images can replace the default title)

    Theme: image theme (provided by ggplot2)

    Color palette: image matching

    Font family: font (such as time new Roma specified by mainstream journals)

    Width: the width of the output image (the default is inches, such as 12 x 7 inch in the standard)

    Height: the height of the output image (the default is inches, such as 12 x 7 inch)

    Alpha: transparency of the element (0-1, 0 for transparency, 1 for opacity)

    Important parameters

    Legend position: the position of the legend in the image

    Legend direction: arrangement of multiple elements in legend (horizontal or vertical)

    Legend Title Size: Legend Main Title Size

    Legend text size: the size of the element text in the legend

    Axis title size: the size of the image axis title

    Axis font size: the size of the image axis scale font

    Axis text angle: the angle of the image axis text

    Axis adjust: image axis text distance (fine tuning)

    Special parameters

    Panel space: the distance between groups in the image when the cells are divided into multiple groups (default 0)

# Metawho

  • Introduction

The goal of metawho is to provide simple R implementation of “Meta-analytical method to Identify Who Benefits Most from Treatments”.

metawho is powered by R package metafor and does not support dataset contains individuals for now. Please use stata package ipdmetan if you are more familar with stata code.

  • Usage

    • data: a table contains at least columns 'trial' (for trial id), 'hr' (for hazard ratio), 'ci.lb' (for lower boundary of hazard ratio), 'ci.ub' (for upper boundary of hazard ratio) and 'ni' (number of sample).
    • conf_level: set the confidence interval coresponding to the 'ci.lb' and 'ci.ub', typically is 0.95.## Moon Charts
  • Introduction

    The moon chart is a graph that uses the moon's waxing and waning to reflect the size of the data.

  • Analysis of case data

    The loaded data are the name of the restaurant and the number of types of food in the restaurant, the number of decorative style types, the number of tableware sets and the number of price types.

  • Interpretation of case statistics graphics

    The first row of the diagram shows the name of the restaurant, the first column shows different variables in the restaurant, the blank moon represents the number of 1 (the least number), the black moon represents the number of 5 (the most number) data, as the number of data increases, the black area of the moon gradually becomes larger, that is, gradually becomes full moon.

# Mosaic Ratio Plot

  • Introduction

    Can be used to display the ratio using mosaic

# Multiple Histograms

  • Introduction

    Multiple histograms are plotted on the same graph to compare differences between multiple sets of data.

  • Special Parameters

    alpha: Transparency of the histogram colors.

    binwidth: Width of each bar in the histogram.

    frame_color: Color of the histogram's borders.

# Network (igraph)

  • Introduction

    Network (igraph) can be used to visulize basic network based on igraph.

  • Data Table

    Two data tables need to be entered. Table 1 is node information, including node ID and other annotation information, which can be used to map color and size. Table 2 is the link information, including node ID and connection information between nodes

  • Data args

    Label Column: used to label node names

    Color Column: used to map node and connection colors

    Node Size Column: used to map node size. If not set, the degrees (#links) of nodes will be used

    Mark Group Column: convert the selected types of nodes by using irregular color graphics

    Line Width Column: used to map the line width

  • Extra parameters

    Layout: set the distribution style of the nodes and lines of the network diagram

    Line Type: set type of lines

    Node Shape: set shape of nodes

    Transform (size): used to scale and transform node size column data

    Transform (width): used to scale and transform the line width column data

    Label Distance: control the distance between label and graph

# Nomogram (Logistic)

  • Introduction

    Nomogram is often used to evaluate the prognosis of oncology and medicine, and can visualize the results of logistic regression or Cox regression.

# nomogram

  • # Function Introduction

    Nomogram is often used to evaluate the prognosis of oncology and medicine, and can visualize the results of logistic regression or Cox regression.

  • # Data Structure

    Survival data frame with time, which sex and status are presented by number such as 0,1 according to example data.

  • # Parameter details

    Main parameters

    Title: the main title of the image (some images can replace the default title)

    Theme: image theme (provided by ggplot2)

    Color palette: image matching

    Font family: font (such as time new Roma specified by mainstream journals)

    Width: the width of the output image (the default is inches, such as 12 x 7 inch in the standard)

    Height: the height of the output image (the default is inches, such as 12 x 7 inch)

    Alpha: transparency of the element (0-1, 0 for transparency, 1 for opacity)

    Important parameters

    Legend position: the position of the legend in the image

    Legend direction: arrangement of multiple elements in legend (horizontal or vertical)

    Legend Title Size: Legend Main Title Size

    Legend text size: the size of the element text in the legend

    Axis title size: the size of the image axis title

    Axis font size: the size of the image axis scale font

    Axis text angle: the angle of the image axis text

    Axis adjust: image axis text distance (fine tuning)

    Special parameters

# Pareto Chart

  • Case Data Analysis

    The case data represents the sales data of a product on multiple platforms.

    The plugin will automatically plot a bar chart with sales data in descending order and simultaneously calculate the cumulative sales to draw the cumulative line chart.

# Parliament

  • Introduction

    The parliamentary chart is a data processing method that looks like a parliamentary seat, with points representing a data set to show the share ratio of each group more flexibly.

  • Analysis of case data

    The loaded data are groups and their corresponding values.

  • Interpretation of case statistics graphics

    Different colors represent different groups, and each dot represents one data point. It's a semicircle of points, the smallest fraction of which can be represented by a single point in a parliamentary diagram.

# PCA

  • Introduction

    Principal component analysis (PCA) is a data processing method with "dimension reduction" as the core, replacing multi-index data with a few comprehensive indicators (PCA), and restoring the most essential characteristics of data.

  • Analysis of case data

    The loaded data are set (gene name and corresponding gene expression value) and sample information (sample name and grouping).

  • Interpretation of case statistics graphics

    Different colors represent different samples, which can explain the relationship between principal components and original variables. For example, M1 has a greater contribution to PC1, while M8 has a greater negative correlation with PC1.

# PCAtools

PCAtools can reduce the dimensionality of data through principal component analysis, and view principal component related features at a two-dimensional level

  • Data table 1 (numerical matrix):

    Each column is a sample, and each row is a feature (such as gene, chip probe)

  • Data sheet 2 (sample information):

    The first column is the sample, and the other columns are the phenotypic characteristics of the sample, which can be used to mark the color and shape of the point and perform correlation analysis with the principal component

  • Data table 2 parameter columns

    Bi-plot color column (single choice) | Used to map phenotypic features to Bi-plot and mark points in different colors

    Bi-plot shape column (single choice) | Used to map phenotypic features to Bi-plot and mark points as different shapes

    Eigencorplot phenotype column (multiple choice) | Used to draw correlation heat map of phenotype and principal component

  • Extra parameters:

    * Components | The number of principal components of screeplot, pairsplot, plotloadings, and eigencorplot

    Top Variance | Used to filter the included numerical matrix features

    Corelation Method | Method pass to cor function for corelation calulation

    Method for Missing Values | Method for handling missing values in corelation calulation

    Screeplot Bar, Loadings * | Control the colors of images elements

Detail introduction about PCAtools (opens new window)

# Perspective

  • Introduction

    The three-dimensional perspective is a three-dimensional figure that can connect the higher values contained in a matrix with surfaces.

  • Analysis of case data

    The loaded data is a matrix.

  • Interpretation of case statistics graphics

    A clear view of a mountain peak.

# 3D pie

  • Introduction

    The 3D pie chart is a pie chart that has a 3D appearance.

  • Analysis of case data

    The loaded data are different groups and their data.

  • Interpretation of case statistics graphics

    The figure shows more aesthetically the number of samples in groups 1 to 4 and the proportion corresponding to the number of samples in the 3D form. The number of samples in one group is 13, accounting for 11.71%; the number of samples in two groups is 34, accounting for 30.63%; the number of samples in three groups is 21, accounting for 18.92%; and the number of samples in four groups is 43, accounting for 38.74%.

  • Extra Parameters

    Radius:Diameter of disk

    Circle Height:Thickness of disk

    Theta:The Angle of the horizontal rotation of a 3D pie chart

    Explode:The dispersion degree of each group pie chart

# Pie

  • Introduction

    The pie chart is a statistical chart that shows the proportion of each part by dividing a circle into sections.

  • Analysis of case data

    The loaded data are different groups and their data.

  • Interpretation of case statistics graphics

    In a circle graph, the arc length of each slice (the arc length of its center Angle and the region corresponding to its center Angle) is proportional to the number represented. The pie chart shows the number of samples for the 1 to 4 components and the corresponding proportions. The number of samples in one group is 13, accounting for 11.71%; the number of samples in two groups is 34, accounting for 30.63%; the number of samples in three groups is 21, accounting for 18.92%; and the number of samples in four groups is 43, accounting for 38.74%.

# Point (SD)

  • Introduction

    Displaying the standard deviation (SD) of multi-group data.

# Pyramid Chart2

  • Introduction

    The pyramid chart is a pyramid-like figure that distributes data on both sides of a central axis.

  • Analysis of case data

    The loaded data are age, gender, and the number of people after the combination of the first two variables .

  • Interpretation of case statistics graphics

    The graph shows the age groups from bottom to top in order on the central axis, the left side represents the number of men, the right side represents the number of women, and the X-axis represents the number of people. The graph clearly shows the proportion of men and women in different age groups and the proportion of different age groups in the same gender.

# Pyramid Chart

  • Introduction

    The pyramid chart is a pyramid-like figure that distributes data on both sides of a central axis.

  • Analysis of case data

    The loaded data are age, gender, and the number of people after the combination of the first two variables .

  • Interpretation of case statistics graphics

    The graph shows the age groups from bottom to top in order on the central axis, the left side represents the number of men, the right side represents the number of women, and the X-axis represents the number of people. The graph clearly shows the proportion of men and women in different age groups and the proportion of different age groups in the same gender.

  • Extra parameters

    sort:No means the central axis is arranged in the order of age;Desc means descending order by total population;Asc stands for ascending order by total population.

# Pyramid Stack2

  • Introduction

    The pyramid stack is a pyramid-like figure that distributes data on both sides of a central axis.

# Pyramid Stack

  • Introduction

    The pyramid stack is a pyramid-like figure that distributes data on both sides of a central axis.

# QQplot

  • Introduction

    Verify whether a set of data comes from a certain distribution or whether two sets of data come from the same (family) distribution.

# Radar

  • Introduction

    Radar chart displays multivariable data in the form of two-dimensional charts representing three or more quantitative variables on the axis starting from the same point, so as to visually express the comparison of a research object in multiple parameters.

  • Analysis of case data

    The loaded data is data set (expression levels of 5 genes in 4 diseases).

  • Interpretation of case statistics graphics

    Each color of the radar map represents a disease, and the position of each point represents different gene expression. The higher the gene expression value, the farther away it is from the center of the circle, and vice versa.

# R Script Flow

  • Introduction

    R script flow can realize the visual window of if, else and other logic functions.

  • Analysis of case data

    The loaded data are if, else function code of R.

  • Interpretation of case statistics graphics

    The graph shows that if the variable x<10, a =1, if the variable x>10, a = 2;If a is equal to 2, then c is equal to d, if a is not equal to 2, then d is equal to a.

# Ribbon

  • Introduction

    The ribbon diagram is a pattern similar to a ribbon.

  • Analysis of case data

    The loaded data are the X-axis values and their corresponding Y-axis values and groups.

  • Interpretation of case statistics graphics

    Each color represents a different grouping, through which broken lines can be seen the change of each group of data over time.

# Ridge

  • Introduction

    The ridge map is a graph that connects points and forms a ridge.

  • Analysis of case data

    The loaded data are three groups and their corresponding values.

  • Interpretation of case statistics graphics

    Different colors represent different groups, and the approximate degree of data can be observed.

# ROC

  • Introduction

    Receiver operating characteristic curve (ROC curve) is used to describe the diagnostic ability of binary classifier system when its recognition threshold changes.

  • Analysis of case data

    The loaded data are the outcomes of one column of dichotomous variables and three columns of different variables (diagnostic indicators) and their values.

  • Interpretation of case statistics graphics

    There is no functional relationship between specificity on the horizontal axis and sensitivity on the vertical axis.The closer the curve is to the upper left corner, the better the predictive ability of the diagnostic index is.Each color represented a variable (diagnostic indicator), and the blue and red curves were significantly better predictors than the green curves.AUC is the area under ROC curve.AUC=1 indicates that there is at least one threshold on the curve that leads to a perfect prediction.0.5<AUC<1, better than random guess, appropriate selection of threshold value, can have predictive value. AUC=0.5, like random guesses, the model has no predictive value. If AUC<0.5, the possible reason is that the dichotomy variable such as (0,1) is reversed with the ending setting, and the result assignment can be reversed.In this diagram, it can be considered that Am variable has the best predictive ability as shown in value-Am(86.9792)>value-GG(84.3750)>value-EL(56.7708).

  • Extra parameters

    Add interval:Shape means to draw a confidence interval with color, bars means to draw a confidence interval with vertical bars, and no means not to draw a confidence interval.

    Smooth Curve:make a step diagram into a smooth curve.

    Evaluation Modle:

    Compare Modles:

# Rose Chart

  • Introduction

The rose chart is a column chart drawn in polar coordinates. The radius of the arc is used to indicate the size of the data.

  • Analysis of case data

The loaded data is divided into three columns, the first column is the sample name, the second column is the grouping, and the third column is the value corresponding to the sample.

  • Interpretation of case statistics graphics

The case data is the distribution of up- and down-regulated genes in different organs after using scRNA-Seq to sequence different human organs.

# Sankey

  • Introduction

    Sankey diagrams are a type of flow diagramin which the width of the arrows is proportional to the flow rate.

  • Analysis of case data

    The loaded data are the four variables and the frequency of combination of four variables.

  • Interpretation of case statistics graphics

    The color of female shunting is blue, the color of male shunting is red, and the width of blue shunting is equal to the total width of female.

# Scatter2

  • Introduction

    Two-dimensional spatial scatter to demonstrate multi-numerical variable relationships.

# 3D Scatter

  • Introduction

    3D scatter plot is to apply a number of quantitative variables to different coaxes in space and combine different variables into coordinates in space, so as to clearly explain the interaction between the three quantitative variables.

  • Analysis of case data

    The loaded data are three variables and grouping.

  • Interpretation of case statistics graphics

    In the figure, temperature, pressure and time are respectively placed on x (horizontal axis), Y (vertical axis) and Z (perspective axis) to generate a THREE-DIMENSIONAL scatter plot, and the correlation between variables and their correlation degree can be intuitively found.

  • Extra parameters

    Phi:The larger the positive value is, the more it tends to observe from the top down. The greater the negative value, the more likely it is to be observed from the bottom of the three-dimensional figure upwards.

    Theta:The Angle at which a three-dimensional figure rotates horizontally

# Gradient Scatter

  • Introduction

    Two-dimensional spatial scatter to demonstrate multi-numerical variable relationships.

# Scatter Pie

Scatter Pie can be used to visualize data fraction in different space coordinates.

Currently, this plugin provides a simple API for the Scatter Pie implementation in scatterpie (opens new window) package.

The input data table should contains at least 3 columns. The first two columns correspond to the space coordinates and other more columns represents data values in different categories. If you know the long/wide data format, you can also provide input data table with long format (default is wide format).## Scatter

  • Introduction

    Two groups of data are used to form multiple coordinate points. By observing the distribution of coordinate points, it can judge whether there is correlation between variables or summarize the data processing mode of coordinate point distribution.

  • Analysis of case data

    The loaded data are the horizontal axis values and their corresponding vertical axis values and groups.

  • Interpretation of case statistics graphics

    Value1 represents the horizontal axis and Value2 represents the vertical axis. The diagram shows that Value1 and Value2 in treatment plan 1 are positively correlated: that is, when Value1 becomes larger, Value2 will become larger; In treatment plan 2, two variables are negatively correlated: that is, when Value1 becomes larger, Value2 becomes smaller.

  • Extra parameters

    Change Point Shape:Different groups have different shapes.

# Simple Funnel Chart

  • Case Data Analysis

    A certain shopping website had a total of 60 visits, 30 clicks, and 20 purchases. Create a funnel chart based on the data.

  • Table Data Description

    The first row contains two parameter names, key and value, which should not be modified. They represent the data labels and their corresponding values, respectively.

    The following rows contain the specific data.

  • Special Parameter

    theme: The theme for the chart.

# slopegraph

  • Introduction

Sopegraph can be used to display the change of values.

# Stacked Percentage Bar Chart

  • Case Data Analysis

    The case data represents the sales percentage of three different products in a shopping mall over the course of one week.

  • Data Table

    The first column represents the y-axis labels, and the remaining columns contain the data for plotting.

  • Special Parameters

    title: Title of the chart.

    x_label: Name of the x-axis.

    y_label: Name of the y-axis.

    add_percent_label: Whether to add percentage labels on the chart.

# Stack Violin

  • # Function Introduction

    The expression of key genes in each cluster in single-cell transcriptomic (Single Cell RNA-Seq)analysis.

  • # Data Structure

    Gene expression matrix. Gene expression matrix in all cells and groups in single cell transcriptome analysis (Single Cell RNA-Seq).

  • # Parameter details

    Main parameters

    Title: the main title of the image (some images can replace the default title)

    Theme: image theme (provided by ggplot2)

    Color palette: image matching

    Font family: font (such as time new Roma specified by mainstream journals)

    Width: the width of the output image (the default is inches, such as 12 x 7 inch in the standard)

    Height: the height of the output image (the default is inches, such as 12 x 7 inch)

    Alpha: transparency of the element (0-1, 0 for transparency, 1 for opacity)

    Important parameters

    Legend position: the position of the legend in the image

    Legend direction: arrangement of multiple elements in legend (horizontal or vertical)

    Legend Title Size: Legend Main Title Size

    Legend text size: the size of the element text in the legend

    Axis title size: the size of the image axis title

    Axis font size: the size of the image axis scale font

    Axis text angle: the angle of the image axis text

    Axis adjust: image axis text distance (fine tuning)

    Special parameters

    Y Axis Text Angle: Angle of y-axis text

# Survival Analysis

  • Introduction

    The survivorship curve is a graph showing the number or proportion of individuals surviving to each age for a given species or group (e.g. males or females).

  • Analysis of case data

    The loaded data are point-in-time, status and groups.

  • Interpretation of case statistics graphics

    The horizontal axis represents time and the vertical axis represents the probability of survival. The blue curve represents the survivolship curve of G1 group and the red curve represents the survivolship curve of G2 group. After logrank test, p value =0.0013<0.05 indicates that difference in survival status between the two groups could not be explained by sampling error, and the grouping factor is the reason for the difference in survival rate between the two curves. This graph shows that overall survival is better in G2 than in G1.

# Taylor Diagram

  • Introduction

    It can be used to display the standard deviation (SD), root mean square (RMS) error and correlation coefficient of the models simultaneously.

# time-roc

  • # Function Introduction

    Receiver Operating Characteristic (ROC) analysis with time records in survival analysis.

  • # Data Structure

    <Table1>: (Numeric) survival data (i.e survive, risk).

    <Table2>: (Numeric) time data.

  • # Parameter details

    Main parameters

    Title: the main title of the image (some images can replace the default title)

    Theme: image theme (provided by ggplot2)

    Color palette: image matching

    Font family: font (such as time new Roma specified by mainstream journals)

    Width: the width of the output image (the default is inches, such as 12 x 7 inch in the standard)

    Height: the height of the output image (the default is inches, such as 12 x 7 inch)

    Alpha: transparency of the element (0-1, 0 for transparency, 1 for opacity)

    Important parameters

    Legend position: the position of the legend in the image

    Legend direction: arrangement of multiple elements in legend (horizontal or vertical)

    Legend Title Size: Legend Main Title Size

    Legend text size: the size of the element text in the legend

    Axis title size: the size of the image axis title

    Axis font size: the size of the image axis scale font

    Axis text angle: the angle of the image axis text

    Axis adjust: image axis text distance (fine tuning)

    Special parameters

    Time unit: time unit, year, month, day optional

    Annotation text location (x): the position of annotation corresponding to the X axis

    Annotation text location (y): the position of annotation corresponding to the Y axis

# Heatmap Decision Tree

  • Introduction

    The heatmap decision tree is a visualization graph that combines two types of graphs: heatmap and decision tree visualization.

  • Case data analysis

    The first column is the name of the species, the second column is the island, and the rest are the species characteristics.

  • Graphic interpretation of case statistics

    The upper half of the figure is a decision diagram, and the lower half is a heat map.

    Decision tree: The topmost island has the greatest influence on the classification of species, and can be classified into different species according to different conditions.

    Heatmap: you can observe the changes in the amount of each species under each condition.

    Legend: For each categorical variable, different colors indicate different types; for continuous variables, the higher the value, the lighter the color, otherwise, the darker.

  • Extra parameters

    Display mode: heat-tree: draw a heat map decision tree diagram; heat-only: draw only a heat map; tree-only: draw a decision tree diagram only.

    The relative size of the heat map: 0 means no heat map; 1 means only heat map. That is, the proportion of the height of the heat map relative to the entire heat map decision tree.

    Graphic spacing: the spacing between the heat map and the decision tree diagram.

    Relative level weight: Relative weight of child node positions according to their levels, commonly ranges from 1 to 1.5. 1 for parent node perfectly in the middle of child nodes.

    Leaf node clustering: Logical. If TRUE, target/label is included in hierarchical clustering of samples within each leaf node and might yield a more interpretable heatmap.

    Sample clustering: Logical. If TRUE, hierarchical clustering would be performed among samples within each leaf node.

    Continuous variable legend: whether to display the color legend of the continuous variable of the heat map.

    Categorical variable legend: whether to display the color legend of the categorical variable of the heat map.

# Treemap

  • Introduction

    Tree map is a kind of tree structure diagram that graphical form to represent hierarchy structure.

  • Analysis of case data

    The loaded data are different groups and their data.

  • Interpretation of case statistics graphics

    Different colors represent different groups and different areas represent data and proportion.

  • Extra Parameters

    Corelated Labe Size with Box Size: Associate the label size with the square size ratio.

# Tricolored Histogram

  • Introduction

    The tricolored histogram divides the histogram into three regions: low-value zone, middle-value zone, and high-value zone, using three different colors.

  • Special Parameters

    left_color: Color for the low-value zone.

    middle_color: Color for the middle-value zone.

    right_color: Color for the high-value zone.

    alpha: Transparency.

    binwidth: Width of each bar in the histogram.

    low_value: Low value threshold.

    high_value: High value threshold.

# tSNE

  • Introduction

    T-sne is a nonlinear dimensionality reduction algorithm suitable for high-dimensional data reduction to two or three dimensions and visualization. The algorithm can make the t distribution of points with greater similarity closer in the lower dimensional space. For low similarity points, the t distribution is farther away in the low dimensional space.

  • Analysis of case data

    The loaded data are the data set (gene name and corresponding gene expression value) and sample information (sample name and grouping)

  • Interpretation of case statistics graphics

    Different colors represent different samples, which is the same as PCA (principal component analysis) graphic interpretation. The difference lies in the visualization effect. For dissimilar points in T-SNE, a small distance will generate a large gradient to repel them.

  • Extra parameters

    Perplexity:

    Theta:

# UMAP

  • Introduction

    UMAP is a nonlinear dimensionality reduction algorithm suitable for high-dimensional data reduction to two or three dimensions and visualization. The algorithm can make the t distribution of points with greater similarity closer in the lower dimensional space. For low similarity points, the t distribution is farther away in the low dimensional space.

  • Analysis of case data

    The loaded data are the data set (gene name and corresponding gene expression value) and sample information (sample name and grouping)

  • Interpretation of case statistics graphics

    Different colors represent different samples, which is the same as PCA (principal component analysis) graphic interpretation. The difference lies in the visualization effect. For dissimilar points in T-SNE, a small distance will generate a large gradient to repel them.

  • Extra parameters

    Perplexity:

    Theta:

# Upset

  • Data table

    There are two types of data tables: list and binary. The list format is that each column is a set and contains all the elements corresponding to the set. In the binary format, the first column is all the elements of all sets, and the subsequent columns are a numeric matrix composed of 0 and 1. 1 indicates that the corresponding row element exists in a certain set, and 0 indicates that it does not exist.

  • Extra parameters

    Sets Label | Label of set barplots (left)

    Comb Label | Label of combined barplots (top)

    Data Types | list and binary. The parameter description is the same as the data table.

    Mode: Refer to https://jokergoo.github.io/ComplexHeatmap-reference/book/upset-plot.html#upset-mode

    Reorder Sets | Sort the set histogram

    Reorder Comb | Sort the combination histogram

    Point Size | Set the size of the points

    Line Width | Set the width of the lines

    Sets Bar | set the color of the set barplots (left)

    Comb Bar | set the color of the combined barplots (top)

    Point Lines | set the dotted line color

    Background Rectangles | background rectangle color

    Unselected Points | Unselected point color

# Venn2

  • Introduction

    A Venn diagram is a diagramthat shows all possible logical relations between a finite collection of different sets. These diagrams depict elements as points in the plane, and sets as regions inside closed curves. A Venn diagram consists of multiple overlapping closed curves, usually circles, each representing a set. The points inside a curve labelled S represent elements of the set S, while points outside the boundary represent elements not in the set S. This lends to easily read visualizations; for example, the set of all elements that are members of both sets Sand T, S ∩ T, is represented visually by the area of overlap of the regions S and T. In Venn diagrams the curves are overlapped in every possible way, showing all possible relations between the sets.

  • Analysis of case data

    The loaded data is a collection of five gene names.

  • Interpretation of case statistics graphics

    The closed curve of 5 colors represents 5 sets, and the number represents the number of overlapping or non-overlapping genes in multiple sets. For example, 8 in the sample figure represents 8 identical gene names in 5 sample sets.

# Venn

  • Introduction

    A Venn diagram is a diagramthat shows all possible logical relations between a finite collection of different sets. These diagrams depict elements as points in the plane, and sets as regions inside closed curves. A Venn diagram consists of multiple overlapping closed curves, usually circles, each representing a set. The points inside a curve labelled S represent elements of the set S, while points outside the boundary represent elements not in the set S. This lends to easily read visualizations; for example, the set of all elements that are members of both sets Sand T, S ∩ T, is represented visually by the area of overlap of the regions S and T. In Venn diagrams the curves are overlapped in every possible way, showing all possible relations between the sets.

  • Analysis of case data

    The loaded data is a collection of five gene names.

  • Interpretation of case statistics graphics

    The closed curve of 5 colors represents 5 sets, and the number represents the number of overlapping or non-overlapping genes in multiple sets. For example, 8 in the sample figure represents 8 identical gene names in 5 sample sets.

# Violin-Group

  • # Function Introduction

    Violin and box plot of grouped data with T-test.

  • # Data Structure

    Data frame. <1st-col>: (Numeric) value data,

    <2nd-col>: (String) 1st-rank groups,

    <3rd-col>: (String) 2nd-rank groups.

  • # Parameter details

    Main parameters

    Title: the main title of the image (some images can replace the default title)

    Theme: image theme (provided by ggplot2)

    Color palette: image matching

    Font family: font (such as time new Roma specified by mainstream journals)

    Width: the width of the output image (the default is inches, such as 12 x 7 inch in the standard)

    Height: the height of the output image (the default is inches, such as 12 x 7 inch)

    Alpha: transparency of the element (0-1, 0 for transparency, 1 for opacity)

    Important parameters

    Legend position: the position of the legend in the image

    Legend direction: arrangement of multiple elements in legend (horizontal or vertical)

    Legend Title Size: Legend Main Title Size

    Legend text size: the size of the element text in the legend

    Axis title size: the size of the image axis title

    Axis font size: the size of the image axis scale font

    Axis text angle: the angle of the image axis text

    Axis adjust: image axis text distance (fine tuning)

    Special parameters

    Add geom: add box line graph to violin graph to show more abundant information

    Stat method: confidence statistics method (t-test by default)

# Violin

  • Introduction

    The violin plot, named for its resemblance to a violin, is a statistical diagram combining a box diagram with a kernel density diagram to show the distribution of data and the probability density.

  • Analysis of case data

    The loaded data is data set (gene names and expression levels in different tumors).

  • Interpretation of case statistics graphics

    The violin plot can reflect the data distribution, which is similar to the box diagram. The black horizontal line in the box shows the median gene expression level in each tumor, and the upper and lower edges in the white box represent the upper and lower quartiles in the data set. The violin graph can also reflect the data density, and the more concentrated the data set, the fatter the graph. The gene expression distribution in the BLGG group is more concentrated, followed by BIC group and AML group.

  • Extra parameters

    P Value Form:Value is a numeric value, signif is the number of "*" and no is not showing the P value

    Add Boxplot:Place a boxplot inside the violin

# Volcano

  • Introduction

    The volcanogram is a visual representation of the difference in gene expression between two samples.

  • Analysis of case data

    The loaded data is the gene name and its corresponding logFC and p.value.

  • Interpretation of case statistics graphics

    The horizontal axis is denoted by log2 (fold change), and the more different genes are distributed at both ends of the picture.The ordinate is denoted by -log10 (p.value) and is the negative log of the P value of T test significance.Blue dots represent down-regulated genes, red dots represent up-regulated genes, and gray dots represent genes that are not significantly different.

# Waterfall Chart 2

  • Introduction

    Used to visualize changes in data, with the difference from version 1 being the ability to customize the colors for upward and downward values.

  • Special Parameters

    calc_total: Whether to calculate the total of the data.

    width: Width of the bars.

    up_color: Color for upward values.

    down_color: Color for downward values.

    total_color: Color for the total value.

# Waterfalls (opens new window)

  • Introduction

    The waterfall chart is used to display the cumulative effect of sequentially introduced positive or negative values . These intermediate values can either be time based or category based.

  • Analysis of case data

    The loaded data have two columns, with the first for category based items and the second for their corresponding values.

  • Interpretation of case statistics graphics

    As shown in the example figure, the x-axis represent each category based items, the y-axis showing their cumulative values. Increments and decrements that are sufficiently extreme can cause the cumulative total to fall above and below the axis at various points.

  • Extra Parameters

    Rect Text Size: set the size of the text in the rectangles.

    Bar Width: set the width of the rectangle bar.

    Line Type: set the the linetype drawn between successive rectangles.

  • Reference Packages

    waterfalls: (Maintainer: Hugh Parsonage hugh.parsonage+waterfall@gmail.com)

Last Updated: 1/9/2023, 2:35:57 PM