Home AlphaFold 3 Unveiled: $250M Seed-Funded AI Breakthrough in Molecular Interaction Prediction

AlphaFold 3 Unveiled: $250M Seed-Funded AI Breakthrough in Molecular Interaction Prediction

Jun 11, 2025 09:03 CST Updated 09:03
DeepMind

Artificial Intelligence Enterprises

Isomorphic Labs

AI Drug Developer

May 8, 2024, GoogleDeepMindJointly published with Isomorphic Labs in the journal *Nature* the latest AI model AlphaFold 3 in the field of proteins! This model can accurately predict the structures of biomolecules such as proteins, DNA, RNA, and ligands, as well as their interaction patterns. This follows...AlphaFold 2Another Major Breakthrough Afterwards

图片

In predicting drug interactions, AlphaFold 3 has achieved unprecedented accuracy, including the binding of proteins with ligands and antibodies with their target proteins.PoseBustersIn the benchmarking of,AlphaFold 3 is 50% more accurate than the best existing traditional methods, and it requires no structural information input, making it the first artificial intelligence system to surpass traditional physics-based prediction tools.This ability to predict antibody-protein binding is crucial for understanding various aspects of the human immune response and new antibodies.The design is crucially importantYes.


With the development of high-throughput biotechnology, various omics technologies have been developed to characterize different but complementary biological information, includingGenomics, Epigenomics, Transcriptomics, Proteomics, and Metabolomics, etc.

Recent advances in artificial intelligence have evolved from "shallow" learning architectures to "deep" learning architectures. As an important branch of artificial intelligence, machine learning (ML) can automatically learn to capture complex patterns and make intelligent decisions based on data. ML has a very wide range of applications in cancer research and clinical oncology. In particular, driven by the rapid growth of multi-omics data, deep learning (DL)-based methods, which belong to the subfield of ML, have become a powerful tool for biomedical data analysis.

How hot are the research fields of artificial intelligence and omics, and why is it necessary to hold training sessions? The following content provides answers.

In the past two years, top research groups from MIT, Harvard University, UPenn, Tsinghua University, Fudan University, Westlake University, and others have been engaged in research on artificial intelligence and biomedicine. These research achievements have been repeatedly published in well-known international top journals such as Nature Reviews Genetics, Nature Methods, Science Advances, Cancer Cell, and Nature Biotechnology, laying the foundation for our publication in top journals.

Due to the limited research materials and learning platforms, non-disclosure of information technology, and the urgent need for training, we sincerely invite you to attend the "Artificial Intelligence and Omics" online training course. The number of participating members has reached over 2000! Publish in top journals! Hop on board quickly!



图片

Eight Major Courses to Help Publish in Top Journals

      
图片

01

AI Protein Design

02

CADD Computer-Aided Drug Design

03

AIDD Artificial Intelligence Drug Discovery and Design Top Journal Reproduction

04

Machine Learning Microbiome Multi-Omics Joint Analysis

05

CRISPR-Cas9 Gene Editing Technology

06

Deep Learning Genomics

07

Machine Learning Metabolomics

08

Deep Learning Analysis of Proteomics



图片

Introduction of the Lecturer

图片
图片

图片

AI Protein Design

The lecturer, Dr. Liu, holds a Ph.D. in Bioinformatics and has been engaged in bioinformatics and medical artificial intelligence research for 15 years. He has developed several bioinformatics tools, published over 20 SCI papers, including nearly 10 articles on artificial intelligence algorithms, and authored a practical textbook on medical data analysis. His research focuses on the application of medical artificial intelligence in the diagnosis and treatment of complex diseases.

CADD Computer-Aided Drug Design

The lecturer is from the Institute of Biophysics, Chinese Academy of Medical Sciences (PUMC). The lecturer specializes in deep learning, machine learning, virtual drug screening, computer-aided drug design, AI-driven drug discovery, molecular docking, and molecular dynamics. They have published several articles in CNS journals and possess extensive training experience, having trained over 5,000 students.

AIDD Artificial Intelligence Drug Discovery Top Journal Reproduction

The lecturer is from Tianjin University, with over a decade of experience in computer algorithm research and programming. Research areas include bioinformatics, deep learning, drug synthesis pathway design, and adverse drug reactions. The lecturer holds 5 invention patents, has participated in 4 key national scientific research projects, and has published 10 high-level SCI papers in well-known journals such as BMC Bioinformatics, Journal of Biomedical Informatics, and International Journal of Molecular Sciences.

Machine Learning Metabolomics

The lecturer is a Ph.D. in neuroscience from a 985 university, primarily utilizing technologies such as metabolomics, transcriptomics, and molecular biology to study the pathogenesis and biomarkers of chronic diseases in neurology. Skilled in conducting comprehensive research on untargeted and targeted metabolomics using liquid chromatography-mass spectrometry (LC-MS) technology, from sample preparation to data analysis, as well as bioinformatics integration analysis of multi-omics big data. In the past five years, they have published 10 SCI papers in journals such as J Clin Invest, EBioMedicine, Cell Death Dis, Cell Death Discov, and Nanotoxicology.

Machine Learning Microbiomics

The main speaker, Dr. Li, holds a Ph.D. in Bioinformatics and has over a decade of experience in sequencing data analysis. His research areas include machine learning, microarray data analysis, nucleic acid and protein sequence analysis, metagenomics, DNA, RNA, methylation sequencing data analysis, single-cell sequencing data analysis, miRNA and target gene analysis, survival analysis, and prognostic model construction. He has extensive training experience, having conducted more than 50 online and offline training sessions. The training content covers the application of machine learning in biomedicine, microbiology, and proteomics, single-cell multi-omics data mining, WGCNA co-expression network construction, ceRNA network construction, and R programming basics. He has published over 30 SCI papers, including 15 as the first author or co-first author, with an h-index of 20.

Deep Learning Genomics

The lecturer, Dr. Chen, is a Ph.D. candidate from the Netherlands. He has published several papers in domestic and international academic journals, including renowned journals such as Nature Communications and Cell Regeneration. His research focuses mainly on the three-dimensional structure of chromatin, bioinformatics, developmental biology, and genetics. By utilizing multi-omics data and deep learning algorithms, he conducts data analysis and mining, including ChIP-seq, ATAC-seq, RNA-seq, CNV, etc., to address and answer fundamental biological mechanisms within the field.

Deep Learning Analysis of Proteomics

The lecturer, Dr. Liu, holds a Ph.D. in Bioinformatics and has been engaged in bioinformatics and medical artificial intelligence research for 15 years. He has developed several bioinformatics tools, published over 20 SCI papers, including nearly 10 articles on artificial intelligence algorithms, and authored a practical textbook on medical data analysis. His research focuses on the application of medical artificial intelligence in the diagnosis and treatment of complex diseases.

CRISPR-Cas9 Gene Editing Technology

The lecturer, from the Chinese Academy of Agricultural Sciences, has over a decade of experience in gene editing research. They are familiar with the application of gene editing across various fields and have deep expertise in the development and optimization of gene editing systems. With dozens of SCI publications, they also possess extensive teaching experience!

   


图片
1
AI Protein DesignCurriculum Content
图片

Swipe to view


Day One

Overview of Protein Design and Tool Preparation

1.Why Do Protein Design?

The vast potential conformational space of proteins

2.Classification of Protein Design Methods

The current best solution:

Protein Structure Prediction:Alphafold2,Rosettafold2

Fixed Structure Sequence Prediction:ProteinMPNN

De Novo Design:RFDiffusion+ProteinMPNN+Alphafold2Iteration

3.VscodeThe use of,sshConnecting to Supercomputing Clusters (Hands-on)

VScodeInstallation

Remote sshPlugin Installation

~/.ssh/configIn China Configurationusername, ipInformation

4.LinuxConfiguration, CreationpythonEnvironment (Practical Operation)

Conda create -n env_name python=3.9

5.Supercomputing Job Submission (Hands-on)

SlurmTeaching of the Operation Management System,sbatch, salloc,scancelUsage

6.Overview of Generative Models, Special LectureDiffusion modelTheory

The Next Day

Deep Learning Methods for Protein Structure Prediction

1.Deep Learning-Based Model--Alphafold2Rosettafold

AF2Reasons for Success:

a.UtilizeMSAInformation

b.TransformerExtract RowMSAInformation

c.Recycling

d.Self-distillation Dataset (pLDDTThe introduction of)

2. AF2 Local Operation (Practical)

2.1Based onAlphafold2Reproduction Work—OpenfoldUnifold

3.Alphafold2Hands-on Operation

3.1MSABymmseqs2 apiGenerate, no need to download the dataset of structures and sequences (required3TBSpace)

4.Model Based on Language Models—ESMfold(Practical Operation)

4.1ESMfoldThe Logic: UseMasked LMReplaceAF2in ChinaMSAModule

4.2ESMfoldInstallation: (EnsurenvccInstallation)

5.Protein Multichain Structure Prediction—Alphafold multimer

6.Protein-Nucleic Acid Complex Prediction—RosetaffoldNA(Practical Operation)

Add nucleic acid representation

7.Protein-Nucleic Acid-Small Molecule Complex Prediction—Rosetaffold-all atom, Alphafold3

Day Three

Deep Learning Methods and Models for Protein Multi-Conformation Sampling

Protein Multiconformation Prediction (Simulation)

1. Traditional Physics-Based Methods—Molecular DynamicsMD

2. Based onMSAMethod of manipulation—MSA subsamplingAF cluster(Practical Operation)

MSA subsamplingMethod SubsamplingAF2TheMSAInput

MSA subsamplingMethod

2.1Environment Configuration andAF2The same

2.2AF_clusterMethod

2.3Environment Configuration andAF2Run the same

2.4GenerateMSA

2.5Model Prediction

3.Methods Based on Generative Models—AlphaflowUFConfDiGAlphaflow uses flow matching(Practical Operation)

3.1AlphaflowMethod

3.2pythonEnvironment Configuration Operation

3.3input_csvSequence Information of Representative Proteins

3.4msa_dirRepresentativeMSAThe Path

3.5weightsModel used by the representative

3.6.samplesRepresentative Sampling Number

4.UFConfUseDiffusion model(Practical Operation)

Day Four

Deep LearningProtein Dataset Mining Tool and Protein Pocket Search Tool

1.Protein Dataset Mining Tool

1.1Sequence Alignment and Clustering Tools

1.2BLASTSlow speed

2.Sequence Rapid Alignment ToolMMseqs2

2.1.Diagonalk-merShort Sequence Matching

22.tableFind CorrespondencetargetSequencek-merLocation of occurrence

2.3.targetSequence andquerySequence Matching

3.Structure Comparison and Clustering Tools

3.1TM-align

4.Structural Rapid Comparison ToolFoldseek

41FoldseekTeam &mmseqsSeries Comparison

5.FoldseekHands-on Operation

6Deep learning protein pocket search tool

6.1.Protein Pocket Search

6.2Alpha sphere

7.Structure-Based Protein Pocket Search Tool--FpocketCavityPlus(Practical Operation)

8.Trajectory-based (multi-conformation) protein pocket search—Mdpocket(Practical Operation)

9.A Deep Learning-Based Tool for Predicting Protein-Small Molecule Binding Sites—Diffdock

DiffdockHands-on Operation

Day Five

Application of Deep Learning in Protein Design

1.Overview of Deep Learning-Based Protein Design

2.Structural Generation Model--RFDiffusion

3.RFDiffusionIsconditionalStructural Generation Model (Practical Operation)

3.3.RFDiffusion: Based onRosettaFold

3.4.RFDiffusion-All-Atom: Based onRosettaFold-All-Atom

3.5.RFDiffusionHands-on Operation

3.6.RFDiffusionDesign Framework Structure

4.Inverse Folding Model--ProteinMPNN(Practical Operation)

4.1inverse foldingModel

4.2ProteinMPNNHands-on Operation of Reverse Folding Design Sequence

5.UtilizeAF2Improve the Success Rate of Protein Design (Practical Operations)

5.1Alphafold2Folding Design Sequence

5.2ScreeningAlphafold2In ChinapLDDTHigher Sequence

5.3Iterative Prediction Structure

6.Binder designDesign Process

6.1RFDiffusionDesignbinder

6.2ProteinMPNN-FastRelax Binder DesignDesign

6.3AF2 complex predictionDesign

7.Structural Sequence Generation Model--ProteinGenerator

7.1ProteinGeneratorIs the generation of structure and sequence

Day Six

Protein Design Based on Deep Learning Language Models

1.Deep Learning Enzyme Design (Hands-on)

11.Principles of Enzyme Design

1.2.Enzyme Property Prediction

1.3.RFDiffusionAAModel

1.4RFDiffusionAAAndRFDiffusionComparison

2.Protein Design Based on Language Models

2.1ProgenModel (Practical Operation)

2.2ProgenTraining

2.3conditional tagTraining of the following language models

3.ESM2ESM3(Practical Operation)

3.1MultimodalESM3Language Model

3.2Training in three modules: sequence, structure, and function

Day Seven

Deep Learning-Assisted Enzyme Design

1. Basic Knowledge Explanation

Enzyme Transition State Theory, Theozyme, Fitness Landscape, Epistasis

2. The Development of Directed Evolution Methods for Enzymes as Seen from the Work of Frances H. Arnold (who won the 2018 Nobel Prize in Chemistry for her contributions to the directed evolution of enzymes)

1. Traditional Directed Evolution Experimental Workflow

2. MLDE (Machine Learning Directed Evolution), learns the mapping relationship between sequences and enzyme performance, recommending new mutation combinations (PNAS article)

3. ftMLDE (focused training MLDE), active learning process, constructing informative training data (Cell Systems article)

3. De novo design of enzymes

1.De novo design of Diels-Alder catalytic enzymes

a) Rosetta-based Inside-out Strategy (Science Article)

b) Improving structural issues through the Foldit protein folding game (Nat. Biotechnol. article);

c) The Practice of Foldit Protein Folding Game*

2. De novo design of luciferase, Family-wide hallucination, generating new structures based on the structural hallucination of the enzyme family (reproducing Nature article)

3. RFdiffusion+PLACER De Novo Design of Serine Hydrolase (Reproducing Science Article)

4. Using the similarity of predicted structures to mine new enzyme functions from sequences (Reproducing Cell article)

1.Download data from the InterPro database

2. TM-score Calculates Structural Distance

3.UPGMA Structural Clustering, Draw the Phylogenetic Tree

4. Sequence Selection



Case Study Images:

图片
图片
2
CADD Computer-Aided Drug DesignCurriculum Content
图片

Swipe to view

The morning of the first day

Background and Theoretical Knowledge as Well as Tool Preparation

1. Introduction and Use of the PDB Database

1.1 Introduction to the Database

1.2 Query and Selection of Target Protein Structures

1.3 Download of Target Protein Structure Sequence

1.4 Download and Preprocessing of Target Proteins

1.5 Batch Download Protein Crystal Structures

2. Introduction and Usage of Pymol

2.1 Introduction to Basic Software Operations and Fundamental Knowledge

2.2 Protein-Ligand Interaction Diagram

2.3 Protein-Ligand Small Molecule Surface Diagram, Electrostatic Potential Representation

2.4 Protein-Ligand Structure Superposition and Alignment

2.5 Plotting Interaction Forces

3.Introduction and Use of Notepad

3.1 Introduction to Advantages and Main Functions

3.2 Interface and Basic Operations Introduction

3.3 Plugin Installation and Usage

Afternoon

General Protein

-Ligand Molecular Docking Explanation

1. Introduction to Relevant Theories of Docking

1.1 The Concept and Basic Principles of Molecular Docking

1.2 Basic Methods of Molecular Docking

1.3 Commonly Used Software for Molecular Docking

1.4 General Process of Molecular Docking

2. Conventional Protein-Ligand Docking

2.1 Collection of Receptors and Ligand Molecules

2.2 Processing of Complex Pre-conformations

2.3 Preparation of Receptor and Ligand Molecules

2.4 Protein-Ligand Docking

2.5 Analysis of Docking Results

Taking the main protease of the新冠病毒 protein and related inhibitors as an example

The Next Day

Virtual Screening

1. Introduction and Download of Small Molecule Database

2. Introduction to Related Programs

2.1 Introduction and Usage of OpenBabel

2.2 Introduction and Use of ChemDraw

3. Preprocessing for Virtual Screening

4. The Process and Practical Demonstration of Virtual Screening

Case: Screening for Main Protease Inhibitors of SARS-CoV-2

5. Result Analysis and Plotting

6. Drug ADME Prediction

6.1 Introduction to ADME Concepts

6.2 Introduction to Relevant Websites and Software for Prediction

6.3 Analysis of Prediction Results

Day Three

Expansion of docking methods

1.Protein-Protein Docking

1.1 Application Scenarios of Protein-Protein Docking

1.2 Introduction to Related Procedures

1.3 Collection and Preprocessing of Target Protein

1.4 Calculation Using Examples

1.5 Preset of Key Residues

1.6 Acquisition of Results and File Types

1.7 Analysis of Results

With the current popular target

PD-1/PD-L1, etc.

2. Involving the docking of metalloenzyme proteins

2.1 Background Introduction of Metalloenzyme Protein-Ligand

2.2 Collection and Preprocessing of Proteins and Ligand Molecules

2.3 Treatment of Metal Ions

2.4 Docking of Metal Cofactor Protein-Ligand

2.5 Result Analysis

Taking human farnesyltransferase and its inhibitors as examples

3. Protein-Polysaccharide Molecular Docking

4.1 Protein-Polysaccharide Interactions

4.2 Key Points of Docking Processing

4.3 The Process of Protein-Polysaccharide Molecular Docking

4.4 Protein-Polysaccharide Molecular Docking

4.5 Analysis of Related Results

Inα-Glycosyltransferase and Polysaccharide Molecular Docking as Examples

5. Nucleic Acid-Small Molecule Docking

5.1 Application Status of Nucleic Acid-Small Molecules

5.2 Introduction to Related Procedures

5.3 Types of Nucleic Acid-Small Molecule Binding

5.4 Nucleic Acid-Small Molecule Docking

5.5 Analysis of Related Results

Human Telomere

g - Quadruple chain and ligand molecular docking as an example.

Introduction to Operating Procedures and Practical Demonstration

Day Four

Methods for Expanding Docking Usage

1.Flexible Docking

1.1 Introduction to the Use Cases of Flexible Docking

1.2 Advantages of Flexible Docking

1.3 Protein-Ligand Flexible Docking

Focus: Method for setting flexible residues

1.4 Analysis of Related Results

Cyclin-dependent kinase

2 (CDK2) with ligand 1CK as an example

2. Covalent Docking

2.1 Introduction to Two Covalent Docking Methods

2.1.1 Flexible Side Chain Method

2.1.2 Two-Point Attractor Method

2.2 Collection and Preprocessing of Proteins and Ligands

2.3 Covalent Docking of Covalent Drug Molecules with Target Proteins

2.4 Comparison of Results

Taking the currently popular covalent drugs for COVID-19 as an example.

3. Protein-Hydration Docking

3.1 The Significance and Methods of Hydration in Protein-Ligand Interactions

3.2 Collection and Preprocessing of Proteins and Ligands

3.3 Preparation of Relevant Parameters for Docking

Focus: The Addition and Treatment of Water Molecules

3.4 Protein-Water-Ligand Docking

3.5 Result Analysis

Acetylcholine-binding protein

(AChBP) with nicotine complex as an example

Day Five

Molecular Dynamics Simulation (Linux and GROMACS Installation and Usage)

1. Introduction and Simple Usage of Linux System

1.1 Common Linux Command Lines

1.2 Common Program Installation on Linux

1.3 Experience: How to Perform Virtual Screening on Linux

2. Introduction to Molecular Dynamics Theory

2.1 Principles of Molecular Dynamics Simulation

2.2 Methods and Related Programs of Molecular Dynamics Simulation

2.3 Introduction to Related Force Fields

3. Introduction and Usage of Gromacs

Focus: Introduction to Main Commands and Parameters

4. Introduction and Use of Origin

Day Six

Execution of Solvated Molecular Dynamics Simulations

1. General Process for Handling Solvated Proteins

2. Preparation of Protein Crystals

3. Energy Minimization of Structures

4. Pre-equilibration of the system

5. Unrestricted Molecular Dynamics Simulation

6. Molecular Dynamics Results Presentation and Interpretation

Taking lysozyme in water as an example

Day Seven

Execution of Protein-Ligand Molecular Dynamics Simulations

1. Protein-Ligand Processing Workflow in Molecular Dynamics Simulations

2. Preparation of Protein Crystals

3. Preparation of Initial Conformations for Protein-Ligand Docking

4. Preparation of Ligand Molecular Force Field Topology Files

4.1 Brief Introduction to Gauss

4.2 A Brief Introduction to Ambertool

4.3 Generating Force Field Parameter Files for Small Molecules

5. Pre-equilibration with separate restraints on temperature and pressure for the complex system

6. Unrestricted Molecular Dynamics Simulation

7. Presentation and Interpretation of Molecular Dynamics Results

8. Trajectory Post-processing and Analysis

Taking the main protease of the coronavirus protein target and related inhibitors as examples                                                           

Case Practice Images:

图片


3
AIDD Artificial Intelligence Drug Discovery Top Journal ReproductionCurriculum Content
图片

Swipe to view



FirstDay

Environment Setup andDepthBasic Knowledge Learning and Explanation

1.AIDDOverview: FromCADDToAIDD

2.Software Installation and Environment Setup

(1)anaconda

(2)vscode

(3)Configuration of Environmental Variables

(4)SwitchpipAndcondaMirror Source

(5)Creation of Virtual Environment

3.RDKITUse of the Toolkit

(1)Based onRDKitMolecular Reading and Writing

(2)Based onRDKitMolecular Drawing

(3)Based onRDKitMolecular Fingerprint and Molecular Descriptors

(4)Based onRDKitCompound Similarity and Substructures

4.Methods for Acquiring Comprehensive Drug Databases

(1)Based onrequestsBasic Crawling Operations

(2)Small Molecule DatabasePubChemData Acquisitionpubchempy / requests

(3)Protein DatabasePDBUniProtData Acquisition

5.Deep Learning-Assisted Drug Design

(1)Basic Concepts of Neural Networks andsklearnIntroduction to the Toolkit

(2)Basic Knowledge of Graph Neural Networks and Message Passing Mechanisms

(3)TransformerBasic Model Knowledge: Tokenization, Positional Encoding, Attention Mechanism, Encoder, Decoder, Pre-training-Fine-tuning Framework,huggingface Ecosystem Introduction

(4)Model Evaluation and Validation: Accuracy, Precision, Recall,F1Score,ROCCurve,AUCCalculation, Mean Absolute Error, Mean Squared Error,R2Scores, Explained Variance Scores, Cross-Validation, etc.

SecondDay

Top Journal Reproduction Series1——Representation Learning and Property Prediction of Molecules and Biochemical Reactions for Drug Discovery

Training Background:In artificial intelligence-assisted drug discovery (AIDD) in,Representation Learning and Property Prediction of Molecules and Biochemical ReactionsIs the cornerstone of the entire research process. The structure of a molecule determines its function, and how to effectively represent complex molecular structures and biochemical reaction processes in a form that computational models can understand is a precondition for achieving efficient prediction and optimization. By constructing reasonable molecular representations (such as graph neural networks,SMILESEncoding, fingerprints, etc.), we can letAIThe model captures key chemical features, which are then used to predict the physicochemical properties, biological activity, and toxicity of molecules, providing a reliable foundation for subsequent virtual screening, molecular generation, and reaction design. Therefore, this topic not only establishesAIDDThe core competency framework for modeling and predictive capabilities in China also lays a solid foundation for intelligent decision-making throughout the drug discovery process.

Training Content1:

Nature Machine Intelligence|Application of Attention-Based Neural Networks in Chemical Reaction Space Mapping《Mapping the space of chemical reactions using attention-based neural networks

1.Dataset

1.1.PistachioDataset: Includes260Tens of thousands of chemical reactions, from patent data, covering792A category of reactions. The data were deduplicated and filtered for validity (usingRDKit)。

1.2.USPTO 1k TPLDataset: Based onUSPTOPatent data, including44.5Ten Thousand Reactions, Generated through Atom Mapping and Template Extraction1,000A reaction template category.

1.3.Schneider 50kDataset: Public dataset, containing5Myriad Reactions,50A category used for comparison with traditional fingerprint methods.

2.Model.The study compared twoTransformerArchitecture:

2.1.BERTClassifier: An encoder-based model, after being pre-trained through masked language modeling, is fine-tuned on classification tasks, using[CLS]Tagged embeddings as reaction fingerprints (rxnfp)。

2.2.Seq2SeqModel: Encoder-Decoder structure that breaks down the classification task into hierarchical predictions of superclasses, categories, and specific reactions. Both adopt a simplified version.BERT(Hidden Layer256Dimension), Input is UnlabeledSMILESSequence, no reagents required-Reagent differentiation or atomic mapping.

3.Training. Model training is divided into two steps:

3.1.Pre-training:BERTThrough MaskSMILESToken prediction tasks are used for self-supervised learning to learn general representations.

3.2.Fine-tuning: Optimizing the model for classification tasks using cross-entropy loss, learning rate2×10⁻⁵, Sequence Length512. The evaluation adopts confusion entropy (CEN) and Matthews correlation coefficient (MCC) to address data imbalance.


Training Content2:

TOPJournal | Prediction of Biochemical Reaction Yield Based on Deep LearningPrediction of chemical reaction yields using deep learning》 

1.Data. The study used three types of data:  

1.1.Buchwald-Hartwig HTEDataset: Contains3955IndividualPdCatalysisC-NConjugation reaction, covering15Halide minerals,4Ligand,3Alkali and23A combination of additives, yield measured through standardized experiments, high data quality.

1.2.Suzuki-Miyaura HTEDataset: Contains5760A reaction, involving15Electrophilic/Nucleophile,12Ligand,8Alkaline and4A combination of solvents with uniformly distributed yields.

1.3.USPTOPatent Dataset: Extracted from publicly available patents, containing reaction yields of different scales (gram-scale and sub-gram-scale). The data is noisy and inconsistently distributed, requiring smoothing via neighboring reaction yields to improve model performance.

2.Model. The core model is based on pre-trainedrxnfp(Reaction Fingerprint)BERTArchitecture, with the addition of a regression layerYield-BERT. The input is a standardized reactionSMILES, capturing contextual information of the reaction center and key reagents through the self-attention mechanism. The model does not require handcrafted features (such asDFTCalculate descriptors), directly predict yield in an end-to-end manner. Experiments show that its performance is superior to traditional methods (such as random forests and molecular fingerprint concatenation), especially inHTEThe data is close to the prediction level of chemical descriptors, and the parameters are highly robust (little impact from hyperparameter tuning).

3.Training. Training is divided into two steps:  

3.1.Pre-training:BERTLearning through masked language tasksSMILESGeneral representation.

3.2.Fine-tuning: Adopt simpleTransformersLibrary andPyTorchFramework, toMSELoss Optimization Regression Layer, Learning Rate (2×10⁻⁵) anddropoutRate (0.1–0.8) as the main parameter tuning object.HTEData Adopted Randomly/Time Division Validation,USPTOData is smoothed by proximity reaction yield to mitigate noise impact. Small-sample experiments (5%Training data) shows that the model can quickly screen high-yield reactions and guide synthesis optimization.


Training Content3:

TOPJournal|Based onT5ChemRepresentation Learning and Property Prediction of Biochemical Reactions in ModelsUnified Deep Learning Model for Multitask Reaction Predictions with Explanation

1.Data Sources and Processing.Through self-supervised pre-training andPubChemThe molecular dataset is used for training to achieve excellent performance in four different types of chemical reaction prediction tasks. The model handles reaction type classification, forward reaction prediction, single-step retrosynthesis, and reaction yield prediction.

2.Model Architecture and PrinciplesT5ChemThe model is based on natural language processing.“Text-to-Text Transfer Transformer”(T5)Unified deep learning model for framework development, adapted byT5Framework for handling multiple chemical reaction prediction tasks.T5ChemThe model includes an encoder.-Decoder structure, and introduces task-specific prompts and different output layers according to the task type, such as molecular generation head, classification head, and regression head, to handle sequence-to-sequence tasks, reaction type classification, and product yield prediction.

3.Training Process and Details.

3.1.T5ChemThe model was first introduced inPubChemThe97 millionPerform self-supervised pre-training on the molecule, usingBERTSimilar“masked language modeling”Target.

3.2.During the pre-training phase, in the source sequencetokensRandomly masked, the model's goal is to predict the correct masked elements.tokens

3.3.After pre-training is completed, the model is fine-tuned in downstream supervised tasks using different task-specific prompts and output layers.

3.4.The model generates molecules during the testing phase.token by tokenin a predictive manner until generationEnd of sentence markerOr reach the maximum predicted length.


TheThreeDay

Top Journal Reproduction Series2——Representation Learning and Property Prediction of Proteins Facilitate Drug Discovery

Training BackgroundInAIDDIn China, proteins are the main targets of drug action, and the complexity of their structure and function determines the success or failure of drug design.Representation Learning and Property Prediction of ProteinsIsomorphic Labs-Target interactions and the discovery of candidate drugs are crucial steps. Proteins, especially enzymes, serve as the primary targets for drugs, and their functions, structures, and dynamic properties directly influence drug design and efficacy. This topic is explored through two cutting-edge research works:*Enzyme function prediction using contrastive learningDemonstrates how to use contrastive learning to extract high-quality functional representations from protein sequences, achieving precise prediction of enzyme functions;CatPred*An integrated deep learning framework was proposed for in vitro enzyme kinetics parameters, such asKmkcatetc.) are crucial for establishing efficacy models and optimizing lead compounds. These methods have significantly improved the accuracy and generalization ability of protein modeling, providingAIProvides strong support for target discovery, mechanism understanding, and candidate drug screening.

Training Content1: 

Nature Communication|A Comprehensive Framework for Deep Learning of Enzyme Kinetic Parameters In Vitro《CatPred: a comprehensive framework for deep learning in vitro enzyme kinetic parameters

CatPred A comprehensive deep learning framework for predicting in vitro enzyme kinetic parameters (kcatKmKi), to address the issues of high experimental measurement costs, sparse data, and poor generalization capabilities. This method not only provides accurate predictions but also introduces a quantification of prediction uncertainty, supporting out-of-distribution (out-of-distribution) Robust prediction of enzyme sequences. In addition, the authors constructed a new standardized dataset (CatPred-DB), and systematically compared various enzyme representation methods.

1.DataCatPred The dataset used comes fromBRENDA AndSABIO-RK Database, the author constructedCatPred-DB, including:23197 Articlekcat41174 ArticleKmAnd11929 ArticleKi Data, each record contains the amino acid sequence of the enzyme,AlphaFold OrESMFold Predicted structures, substratesSMILES Expression. The data was cleaned and standardized, with missing and duplicate values removed, and a logarithmic transformation applied to the parameters to conform to a normal distribution.

2.ModelCatPred Adopting a modular design, enzymes and substrates are represented through different neural network modules for representation learning, and probabilistic regression output (in the form of mean and variance of a Gaussian distribution) is used, allowing for uncertainty estimation.aleatoric + epistemic)。

3.Training

3.1.All models adopt the negative log-likelihood loss function (NLL) training, to simultaneously predict parameter mean and uncertainty.

3.2.Training Use-Verification-Test Trisection Method (80%-10%-10%), and establishOut-of-training setThe test subset is used for generalization ability evaluation.

3.3.In order to assess uncertainty,CatPred Use10An ensemble of models trained with different initial parameters to quantifyepistemic uncertainty

3.4.During model training, different similarities (sequencesidentity<99%80%60%40%) The test set demonstrates its robustness.


Training Content2:

Science|Based on Contrastive LearningProteinClassificationAttribute PredictionEnzyme function prediction using contrastive learning

1.Data Sources and Processing: CLEANThe model's training is based onUniProtHigh-quality data in the database, which contains approximately1.9Hundreds of millions of protein sequences.CLEANThe model takes amino acid sequences as input and outputs a list of enzyme functions sorted by probability (ECNumbering as an example). In order to verifyCLEANaccuracy and robustness, the authors conducted extensivein silicoExperiment, andCLEANApplied to an internally collected database of uncharacterized halogenases (total36Individual)ECNumbering annotations, followed by in vitro experimental validation through case studies.

2.Model Architecture and Principles: CLEANThe model adopts a contrastive learning framework, aiming to learn an embedding space for enzymes, where the Euclidean distance reflects functional similarity. Embedding refers to the numerical representation of protein sequences, which is machine-readable while preserving the important characteristics and information carried by the enzymes. InCLEANIn the task, with the sameECThe numbered amino acid sequences have smaller Euclidean distances, while those with differencesECThe sequence of numbers has a greater distance.

3.Training Process and Details:

3.1.During the training process,CLEANThe model is trained with a contrastive loss function, where positive samples are selected based on their proximity to the anchor (anchor) Embed negative sequences with small Euclidean distances to improve training efficiency.

3.2.Model uses language modelESM1bThe obtained protein representation serves as the input to a feedforward neural network, with the output layer generating refined, function-aware embeddings of the input proteins.

3.3.When predicting, by calculating the query sequence with allECThe pairwise distances between clustered centers are used to predict the input protein.ECNumbering.

3.4.CLEANTwo methods have also been developed to predict confidence from the output rankings.ECNumber: One is a greedy method, and the other is based onPThe method of value.


FourthDay

Top Journal Reproduction Series3——Deep Learning-Based Molecular Generation Aids Drug Discovery

Training BackgroundMolecular generation is a key technology in fields such as chemistry, biology, and materials science, and is of great significance for new drug development, new material design, and chemical reaction prediction. Traditional molecular generation methods rely on expert knowledge and trial-and-error experiments, which are time-consuming and costly. With the development of artificial intelligence technology, especially the application of natural language processing and diffusion models in molecular generation, we are now able to use computational models to accelerate this process. This course will introduce fromNLPThe design patterns of diffusion models, which are capable of understanding and generating molecular structures, thereby enhancing the efficiency and accuracy of molecular design. Through this course, participants will be able to master the latest technologies and methods in molecular generation, as well as how to apply these techniques to practical problems.

Training Content1

Nature CommunicationBased onEnd-to-End Graph Generation Framework for Molecular GenerationRetrosynthesis prediction using an end-to-end graph generative architecture for molecular graph editing

1.Data Sources and Processing:Graph2EditsThe model used a publicly available benchmark dataset.USPTO-50k, including50016A reaction, these reactions are correctly atom-mapped and classified as10Different types of reactions. The dataset is divided into40k5k5kThe reactions were used for the training, validation, and test sets.

2.Model Architecture and Principles:Graph2EditsThe model is an end-to-end graph generation architecture based on graph neural networks (GNN) Predict the editing sequence of the product graph and generate intermediates and final reactants based on the predicted editing sequence order. This model merges the two-stage process of the semi-template method (identifying the reaction center and completing the synthon) into a one-pot learning process, enhancing its applicability in complex reactions and making the prediction results more interpretable. The core of the model consists of a graph encoder and an autoregressive model used to generate the editing sequence and apply these edits to infer intermediates and reactants.

3.Training Process and Details:

3.1.Graph2EditsThe model uses a directed message-passing neural network (D-MPNN) as the graph encoder to obtain atomic representations and global graph features, and predict atoms/Key editing and termination symbols.

3.2.Model Training UseTeacher Mandatory Policy, that is, using real editing sequences as model input. At each editing step, the model calculates the probabilities of all possible edits and selects the highest-scoring one.kAn editor applies these edits to the input graph to obtainkAn intermediate.

3.3.During the generation process, the generation branch will stop if the maximum number of steps is reached or the graph representation indicates termination.

3.4.Finally, according to the possibilities for the precedingkA sequence and graph are ranked, and collected as the final prediction result.


Training Content2

Nature Computational Science|Molecular Generation Network Based on Equivariant Diffusion Model《Structure-based drug design with equivariant diffusion models

1.Brief introduction. This paper proposes a structure-based drug design method (SBDD), utilizingSE(3)-Equivariant Diffusion Model (DiffSBDD) Generate novel small molecule ligands that match the binding site conditions of proteins. This method involvesSBDDThe problem is modeled as a 3D conditional generation task, capable of generating all atomic positions at once, overcoming the limitations of traditional autoregressive methods that lose global context due to sequential generation.DiffSBDDNot only supports de novo molecular design, but also enables property optimization, negative design, and molecular fragment modification (inpainting) and other tasks are applied flexibly.

2.Data Summary. The study usedCrossDockedAndBinding MOADTwo datasets for training and evaluation.

2.1.CrossDockedThe dataset contains40,344Training Protein-Ligand Pair and130A test pair, validation set size is246Ensure that proteins from different sets come from different main enzyme classification categories to avoid overfitting.

2.2.Binding MOADThe dataset was filtered and used for testing, with analysis limited to samples that all methods could generate.78IndividualCrossDockedAnd119IndividualBinding MOADObjective. In addition, dataset processing involves removing corrupted entries and throughZenodoPublicly provide processed data and sampled molecules to ensure research reproducibility.

 3.Model Summary.DiffSBDDIsSE(3)-Equivariant diffusion models generate 3D molecular structures conditioned on protein binding sites, using3DThe graph representation (atomic coordinates and types) avoids the complex post-processing required to infer molecular structures from density maps in traditional methods. The model design respects rotational and translational symmetries in three-dimensional space.


FifthDay

Top Journal Reproduction Series4: Protein Combined with Molecular Dynamics-Dynamic Prediction of Ligand-Complex Interactions

Training Background:Protein-The prediction of ligand interactions is one of the core tasks in modern drug discovery and bioengineering, and its importance goes without saying. In the drug development process, accurately predicting the binding sites, three-dimensional structures, and affinities of proteins with small-molecule ligands not only helps to uncover the mechanisms of intermolecular interactions but also significantly accelerates the screening and optimization of candidate drugs, reducing research and development costs and time. Traditional experimental methods such asXAlthough X-ray crystallography and nuclear magnetic resonance are precise, they are time-consuming, costly, and struggle to meet the demands of large-scale screening. However, with the rapid development of deep learning and artificial intelligence technologies, computational methods in protein-Show great potential in ligand prediction.

Research Content1: 

Nature Communication|Interaction-Aware Proteins-Ligand Docking and Affinity Prediction Model《Interformer: an interaction-aware model for protein-ligand docking and affinity prediction

1.Brief Introduction: This study proposes a method namedInterformerBased onGraph-TransformerUnified Model Architecture for Protein-Ligand docking and affinity prediction. Addressing the limitation of existing deep learning models that neglect modeling non-covalent interactions between protein and ligand atoms,InterformerIntroduced the Interaction-Aware Mixture Density Network (MDN) to explicitly capture hydrogen bonds and hydrophobic interactions, combined with a negative sampling strategy and pseudoHuberLoss Function, Optimizing Interaction Distribution through Contrastive Learning to Enhance the Accuracy of Docking Poses and the Robustness of Affinity Prediction.

2.Dataset: The study usedPDBBindTime-split Test Set (333samples) to evaluate docking accuracy,PosebustersBenchmarking validates physical reasonableness, and internal real-world datasets test generalization ability. Training data is sourced fromPDBBindCrystal Structure Database

3.ModelInterformerBased onGraph-TransformerArchitecture, including:(1) The graph represents a module, with atoms as nodes and adjacency relationships as edges;(2) Masked Self-Attention (MSA) mechanism, throughIntra-BlocksAndInter-BlocksCapture ligands separately/Interactions within and between proteins;(3) Interactive PerceptionMDN, integrating four Gaussian distributions to simulate van der Waals forces, hydrophobic interactions, and hydrogen bonds;(4) The edge output layer integrates node and edge features to predict energy;(5) The Pose Scoring and Affinity modules predict the correct pose and experimental affinity values based on virtual nodes.

4.Training Details: The training is divided into two stages: first, the energy model is trained based on crystal structures to generate negative samples, and then the pose scoring and affinity models are trained jointly with both positive and negative samples. Optimization is performed using negative log-likelihood loss.MDNBinary cross-entropy loss optimization posture scoring, pseudoHuberLoss (σ=4) Optimize Affinity Prediction (UnitIC50KdKI, after negative log normalization). Monte Carlo sampling generates candidate poses,


Research Content2:

Nature CommunicationMolecular Dynamics-Driven Protein-Dynamic Prediction of Ligand Complex StructuresDynamicBind: predicting ligand-specific protein-ligand complex structure with a deep equivariant generative model

1.Brief Introduction: This study proposes a method namedDynamicBindA deep learning method for predicting ligand-specific proteins-Ligand complex structure. Traditional molecular docking methods often treat proteins as rigid or only partially flexible, making it difficult to handle large-scale conformational changes in proteins. While molecular dynamics simulations can capture dynamic conformations, the computational cost is high.DynamicBindBy constructing smooth energy landscapes through equivariant geometric diffusion networks, protein simulations without ligands are efficiently modeled (apo) State to Ligand Binding (holo) Conformational transition of the state, without relying onholoStructure or extensive sampling.

2.Dataset: Research based onPDBbind2020Database (19,443A protein-Ligand complex crystal structure), divided by time:2019Data from previous years were used for training and validation,2019The data from the year was used for testing. Additionally,Major Drug Targets (MDT)Test set (599Yes), focusing on kinase,GPCRSuch as the main drug targets, requirementsAlphaFoldPredicted structure and crystal structurepocket RMSD>2Å, ensure the test difficulty. Achieve this during training.AlphaFoldSample of protein sections generated by interpolating predicted structures with crystal structures.

3.ModelDynamicBindIt is an equivariant generative model based on graph neural networks, using coarse-grained representations (proteins asNodes and side chain dihedral angles representation, ligands represented by heavy atom nodes), outputs include translation, rotation, torsion angle updates of proteins and ligands, as well as binding affinity andcLDDTConfidence Score. The model learns fromapoToholoThe“morph-like”Transform, optimize energy landscapes, including63.67Million parameters.

4.Training Details: Training in8BlockNvidia A100 80GB GPUCarry out on5Day, input is additionmorphTransformed ProteindecoyConformations and ligand conformations with added Gaussian noise, the goal is denoising operation. The loss function includes eight terms (ligand and protein translation, rotation, torsion, etc.), throughKabschAlgorithm AlignmentapoAndholoStructure, combined with diffusion noise to adjust conformational transitions. Iterative reasoning.20Update the initial structure.

                                                      



Case Study Images:

图片
图片




4
Deep Learning Genomics Curriculum Content
图片

Swipe to view


Day One

Theoretical Part

Introduction to Deep Learning Algorithms

1.Supervised Learning Neural Network Algorithms

1.1Fully Connected Deep Neural NetworkDNNExamples of Applications in Genomics

1.2Convolutional Neural NetworkCNNExamples of Applications in Genomics

1.3Recurrent Neural NetworkRNNExamples of Applications in Genomics

1.4Graph Convolutional NetworkGCNExamples of Applications in Genomics

2.Unsupervised Neural Network Algorithms

2.1AutoencoderAEExamples of Applications in Genomics

2.2Generative Adversarial NetworkGANExamples of Applications in Genomics

Practical Content

1.LinuxOperating System

1.1Commonly UsedLinuxCommand

1.2 VimEditor

1.3Genomic Data File ManagementModify File Permissions

1.4View and Explore Genomic Regions

2.PythonLanguage Foundation

2.1.PythonPackage Installation and Environment Setup

2.2.Common Data Structures and Data Types

The Next Day

Theoretical Part

Fundamentals of Genomics

1.Genomic Database

2.Epigenome

3.Transcriptome

4.Proteome

5.Functional Genomics

Practical Content

Common Deep Learning Frameworks in Genomics

1.Install and Introduce Deep Learning Toolkitstensorflow, keraspytorch

2.Identifying Deep Learning Model Elements in the Toolkit

2.1.Data Representation

2.2.Tensor Operation

2.3."Layer" in Neural Networks

2.4.Model Composed of Layers

2.5.Loss Function and Optimizer

2.6.Dataset Splitting

2.7.Overfitting and Underfitting

3.Genomic Data Processing

3.1Install and Usekeras_dnaProcessing various gene sequence data such asBED GFFGTFBIGWIGBEDGRAPHWIGetc.

3.2Usekeras_dnaDesign Deep Learning Models

3.3Usekeras_dnaSplit Training Set, Test Set

3.4Usekeras_dnaSelecting gene sequences of specific chromosomes, etc.

4.Deep Neural NetworkDNNApplication in Identifying Motif Features

4.1Implement Single-Layer Single FilterDNNRecognition Motif

4.2Implement Multi-Layer Single FilterDNNRecognition Motif

4.3Implement Multi-Layer Multi-FilterDNNRecognition Motif

Day Three

Theoretical Part

Convolutional Neural NetworkCNNApplications in Gene Regulation Prediction

1.Chip-SeqMotif Feature IdentificationG4, such asDeepG4

2.Chip-SeqPredicted in ChinaDNAMethylation,DeepSEA

3.Chip-SeqPredicted transcription factor binding inDeepSEA

4.DNase-seqPredicting Chromosome Affinity in China,Basset

5.DNase-seqPredicting Gene Expression in ChinaeQTLEnformer

Practical Content

Reproducing Convolutional Neural NetworksCNNIdentify Motif FeaturesDeepG4, Non-coding gene mutationsDeepSEA, Predict Chromosome AffinityBasset, Gene ExpressioneQTL

1.ReproductionDeepG4FromChip-SeqIdentify in ChinaG4Features

2.Installationselene_sdk, ReproduceDeepSEAFromChip-SeqPredict in ChinaDNAMethylation, Non-coding Gene Mutations

3.ReproductionBasset, fromChip-SeqPredicting Chromosome Affinity in China

4.ReproductionEnformer, fromChip-SeqPredicting Gene Expression in ChinaeQTL

Day Four

Theoretical Part

Deep Learning in Identifying Copy Number VariationsDeepCNV, Regulatory FactorDeepFactorApplication on

1.SNPPrediction of Copy Number Variations in MicroarraysCNVDeepCNV

2.RNA-SeqPredicted in ChinapremiRNAdnnMiRPre

3.Predicting regulatory factor proteins from protein sequences,DeepFactor

Practical Content

1.ReproductionDeepCNVUtilizeSNPMicroarray Combined with Image Analysis for the Identification of Copy Number Variations

2.Recurrent Neural Network ReproductionRNNTool dnnMiRPre, fromRNA-SeqPredict in ChinapremiRNA

3.ReproductionDeepFactor, Identify transcriptional regulatory factor proteins from protein sequences

Day Five

Theoretical Part

Application of Deep Learning in Identification and Disease Phenotypes and Biomarkers

1.Deep Learning Tool for Identifying Breast Cancer Subtypes from Gene Expression DataDeepType

2.Identifying disease phenotypes from high-dimensional multi-omics data,XOmiVAE

3.Deep Learning Tools for Identifying Key Genes in Gene Sequences and Protein Interaction NetworksDeepHE

Practical Content

1.ReproductionDeepType, fromMETABRICDistinguishing Breast Cancer Subtypes in Breast Cancer Data

2.ReproductionXOmiVAE, fromTCGAIdentification of Breast Cancer Subtypes in Multidimensional Databases

3.ReproductionDeepHEIdentification of Key Genes Using Gene Sequences and Protein Interaction Networks                                                         





Case Study Images:



图片
5
Machine Learning Metabolomics Curriculum Content
图片

Swipe to view


The First Day, Morning

A1 Metabolites and the Development and Application of Metabolomics

(1) Metabolism and Physiological Processes;

(2) Metabolism and Disease;

(3) Untargeted and Targeted Metabolomics;

(4) Spatial Metabolomics and Mass Spectrometry Imaging (MSI);

(5) Metabolomics and drugs and biomarkers;

(6) Metabolic Flux and Mechanism Research.

A2 Metabolic Pathway and Metabolism Database

(1)  A Brief Introduction to Several Classical Metabolic Pathways;

(2) Three major common metabolite databases: HMDB, METLIN, and KEGG;

(3) Metabolomics Raw Database: Metabolomics Workbench and Metabolights. A3 Recommended References

The First Day, Afternoon

A4 Metabolomics Experimental Workflow Overview

A5 Chromatography, Mass Spectrometry Hardware and Principle Analysis

(1) Principle and Structure of Chromatographic Analysis;

(2) Selection of Chromatograph and Chromatography Column;

(3) Chromatographic mobile phase: Gradient elution method;

(4) Analysis of Ion Source, Mass Analyzer, and Mass Detector;

(5) Principle of Mass Spectrometry and Animation Demonstration;

(6) Liquid Chromatography-Mass Spectrometry (LC-MS);


The next morning

B1 Metabolite Sample Processing and Extraction

(1) Extraction procedures and precautions for various tissues, blood, and body fluid samples;

(2) Metabolite Extraction Process and Precautions;

(3) Issues related to the transportation and storage of samples and metabolites;

B2 LC-MS Data Quality Control and Database Search

(1) The method for setting QC and Blank samples during the LC-MS experiment;

(2) Data quality control monitoring and analysis during the LC-MS operation process;

(3) Principles of Upstream Analysis in Metabolomics — Based on Compound Discoverer and Xcms Software;

(4) Data conversion, peak picking, peak alignment, and library searching using Xcms software;

The Next Afternoon

B3 R Software Foundation

(1) Installation of R and RStudio;

(2) Configuration of Rstudio Interface;

(3) Basic Operations and Statistical Calculations in R;

(4) Packages in R: The use of packages, functions, and parameters;

(5) R language syntax, data types and data structures;

(6) R Basic Plotting;

B4 R Language Graphing Tool —— ggplot2 Package

(1) Introduction to ggplot2

(2) The philosophy of ggplot2 plotting;

(3) The color system of ggplot2;

(4) ggplot2 Data Mining and Plotting in Practice;


The Morning of the Third Day

Machine Learning

C1 Supervised Machine Learning in Metabolomics Data Processing

(1) The relationship among artificial intelligence, machine learning, and deep learning;

(2) Regression Algorithms: Starting from Linear Regression, Logistic Regression, and Cox Regression;

(3) PLS-DA Algorithm: Is There Hope for Data Without Differences After PCA Dimensionality Reduction?

(4) The Meaning and Selection of VIP Score;

(5) Classification algorithms: Decision Trees, Random Forests, and Bayesian Networks;

C2 An R Walkthrough for Implementing a Classification Algorithm on a Set of Metabolomics Data

(1) Data Interpretation;

(2) Drills and Operations;

The Third Day, Afternoon

C3 Unsupervised Machine Learning in Metabolomics Data Processing

(1) Dimensionality reduction in big data processing;

(2) PCA analysis plotting;

(3) Three Common Types of Cluster Analysis: K-means, Hierarchical Analysis, and SOM

(4) R language implementation of heatmaps and hcluster plots;

C4 A R Walkthrough for Dimensionality Reduction and Clustering Analysis of a Set of Metabolomics Data

(1) Data parsing;

(2) Drills and Operations;


The Fourth Day, Morning

D1 Online Metabolomics Analysis Webpage Metaboanalyst Operation

(1) Use R to clean the data into the format required by the webpage;

(2) Data format issues for independent groups, paired groups, and multiple groups;

(3) Upstream Analysis in Metaboanalyst (Raw Data Peak Extraction, Peak Alignment, and Library Search)

(4) The pipeline of Metaboanalyst, parameter settings, and considerations;

(5) Viewing and exporting results in Metaboanalyst;

(6) Data editing in Metaboanalyst;

(7) Full-process Drills and Operations.

(8) Metabolic Joint Multi-Omics Analysis Web Operation.

The Fourth Day, Afternoon

D2 Metabolomics Data Cleaning and Advanced R Programming

(1) t, fold-change, and response values in metabolomics;

(2) Data Cleaning Process;

(3) R language tidyverse;

(4) Data preprocessing: Data filtering and data normalization (Normalization of samples and Scaling of metabolites);

(5) Metabolomics Data Cleaning Exercise;


The Fifth Day, Morning

E1 Reproduction of Literature Data Analysis (1 paper)

(1) In-depth interpretation of literature;

(2) Practical operation: From raw data download to image reproduction;

(3) Trainee hands-on practice.

The Fifth Day, Afternoon

E2 Machine Learning and Metabolomics Top Journal Interpretation (3 articles);

(1) Signal Transduction and Targeted Therapy: An article on the effects of starvation on metabolomics in different brain regions

Metabolic atlas of mouse brain tissue with chemical alterations; (Database type)

(2) A metabolomics analysis of maternal blood throughout pregnancy in Cell identifies metabolic biomarkers for predicting gestational age and delivery.

Literature; (Biomarker Type)

(3) A metabolomic analysis of the gut microbiota in pancreatic cancer patients published in Nature identifies metabolites that can enhance chemotherapy efficacy.

Literature. (Mechanism Research Type)

                                                           





Case Practice Images:



图片
图片






6
Deep Learning Analysis of Macro Proteomics Course Content
图片

Swipe to view


Day One

Proteomics Sequencing Technology and Databases

MorningTheoretical Explanation

1.Proteomics Sequencing Mass Spectrometry Technology

2.Introduction to Proteomics Databases

3.Introduction to Deep Learning Analysis of Proteomics Models

AfternoonGPUServer Practical Operation

1.LinuxOperating System

1.1Commonly UsedLinuxCommand

1.2 VimEditor

1.3Genomic Data File ManagementModify File Permissions

1.4View and Explore Genomic Regions

2.PythonLanguage Foundation

2.1.PythonPackage Installation and Environment Setup

2.2.Common Data Structures and Data Types

The Next Day

Deep Learning Identifies Physicochemical Properties of Protein Peptides in Mass Spectrometry Sequencing

MorningTheoretical Explanation

1.Deep Learning Model Predicts Chromatography Retention Time and Fragment Ion ConcentrationProsit

2.Deep Learning Predicts Cross-Sectional Collisions in Mass Spectrometry SequencingCCSToolDeepCollisionalCrossSection

3.Deep Learning Predicts Single-Cell Proteomics CoverageDeepSCPModel

AfternoonDeep Learning ModelPythonCode Parsing andGPUServer Practical Operation

1.Reproducing Deep Learning Model Predictions for Chromatographic Retention Times and Fragment Ion ConcentrationsPrositModel

2.Reproducing Deep Learning Predictions in Mass Spectrometry Sequencing with Cross-Sectional Collision ToolsDeepCollisionalCrossSection

3.Reproducing Deep Learning Prediction of Single-Cell Proteomics CoverageDeepSCPModel

Day Three

Deep Learning for Peptide Identification and Peptide Assembly

MorningTheoretical Explanation

1.Deep Learning Identifies Peptides from Metaproteomics DeepFilterModel

2.Deep Learning Identifies Peptides from Protein DatabasesDeepDIAModel

3.Deep Learning for Peptide AssemblyDeepNovo AndDeepNovo-DIAModel

AfternoonDeep Learning ModelPythonCode Parsing andGPUServer Hands-on Operation

1.Reproducing Deep Learning for Peptide Identification from Metaproteomics DeepFilterModel

2.Reproducing Deep Learning for Peptide Identification from Protein DatabasesDeepDIAModel

3.Reproducing Deep Learning for Peptide AssemblyDeepNovoAndDeepNovo-DIAModel

Day Four

Deep Learning Identifies Post-Translational Modification Binding Sites for Disease and Drug Target Recognition

MorningTheoretical Explanation

1.Capsule Network Deep Learning Model for Predicting Post-Translational Modification Binding SitesCapsNet_PTM

2.Attention Mechanism Deep Learning PredictionMHC I Binding SiteACMEModel

3.Deep Learning ModelPUFFINQuantificationPeptide-MHCEnhancing High-Affinity Peptide Screening in Drug Design by Incorporating Uncertainty

4.Deep Learning Model Predicts Cancer AntigensACP-MHCNN Model

AfternoonDeep Learning ModelPythonCode Parsing andGPUServer Hands-on Operation

1.Recurrent Capsule Network Deep Learning Model for Predicting Post-Translational Modification Binding SitesCapsNet_PTM

2.Recurrent Attention Mechanism Deep Learning Predictionpan-specific MHC I Binding SiteACMEModel

3.Reproduce Deep Learning ModelsPUFFINQuantificationPeptide-MHCEnhancing High-Affinity Peptide Screening in Drug Design by Incorporating Uncertainty

4.Reproducing Deep Learning Model Predictions for Cancer AntigensACP-MHCNNModel

Day Five

Deep Learning Identifies Protein Functions

MorningTheoretical Explanation

1.Deep Learning Model3DConvolutional Networks Predict Proteins-Protein InteractionDeepRank

2.Deep Learning Model for Quantifying Protein ExpressionDLNetworkForProteinAbundance

3.Predicting Protein Function Using Deep Learning Models Based on Natural Language Attention MechanismSPROF-GO

4.Deep Learning ModelPCfun Predicting Protein ComplexesGene OntologyFunction

AfternoonDeep Learning ModelPythonCode Analysis andGPUServer Practical Operation

1.Reproduce Deep Learning Models3DConvolutional Networks Predict Proteins-Protein InteractionsDeepRank

2.Reproducing Deep Learning Models for Quantitative Protein ExpressionDLNetworkForProteinAbundance

3.Reproducing Deep Learning Model Predicting Protein Function Based on Natural Language Attention MechanismSPROF-GO

Reproduce Deep Learning ModelsPCfun Predicting Protein ComplexesGene OntologyFunction


                                                           





Case Practice Images:



图片
图片

7
Machine Learning Microbiome Multi-Omics Joint AnalysisCurriculum Content
图片

Swipe to view


Day One

Introduction to Microbial Multi-omics

1.Basic Concepts of Microbiology

2.Introduction to Commonly Used Analyses in Microbiology

3.Basic Concepts and Detection Methods of Metabolomics

4.Basic Concepts and Detection Methods of Transcriptomics

5.Introduction to Basic Concepts of Machine Learning

RIntroduction and Practical Operation of Language

1.RLanguage Overview

2.RSoftware andRPackage Installation

3.RLanguage Grammar and Data Types

4.Conditional Statement

5.Cycle

6.Function

7.Commonly Used Machine Learning and Microbial Multi-Omics Data Analysis RelatedRPackage Introduction

The Next Day

Introduction to Microbial Multi-omics Related Databases and Data Retrieval

1.Gut Microbiota+Metabolic Database

2.Curated Metagenomic Data

3.IBDMDBDatabase

4.GEODatabase

Case Studies on the Application of Microbial Multi-Omics

1.Using Machine Learning Based on Microbiomics+Metabolomics Data Predict Sample Type

2.Integrating Microbiome and Metabolomics Data to Identify Disease-Related Modules

3.Microbiomics in Tumor Research+Host Transcriptomics+Immunoassay Combination Analysis

4.Based on Microbiome Data+Transcriptome Array+Longitudinal Integrative Analysis of Metabolomics Data

Day Three (Practical Operation)

Introduction and Use of Zero-Code Microbial Multi-Omics Integration and Network Visualization Analysis Tools

1.Data Upload (Supported8Various types of data, including microbiological, metabolic, genetic, proteomic, etc.)

2.Select the appropriate database based on the data type

3.Building Networks

4.Visualization

Zero-Code Microbiome-Introduction and Use of Metabolomics Network Analysis Tools

1.Building Microbiota and Metabolism Models

2.Calculating Microbial Contribution to Metabolites Using Metabolic Models

3.Calculate the metabolic potential scores at the community level and use regression models to evaluate the differences in potential scores across different samples.

4.Visualize the impact of characteristic microorganisms on specific metabolites and identify key microorganisms

Introduction and Use of Zero-Code Microbiomics and Metabolomics Correlation Analysis Tools

  1. 1. Correlation Analysis within Omics
  2. 2. Inter-omics Correlation Analysis
  3. 3. Multi-omics Integrative Analysis
  4. 4. Multi-omics Network Analysis
  5. 5. Result Visualization

Day Four (Practical Operations)+Reproduction)

Using Machine Learning Based on Microbiomics+Metabolomics Data Predict Sample Type

1.α-diversity,β-diversityAnalysis

2.Dynamic Correlation Analysis of Diet and Metabolites

3.Microbiome Differences and Disease-Specific Analysis

4.Multi-omics Factor Analysis

5.Association Analysis of Microbiota Function and Metabolic Phenotypes

6.Integrating Microbiome and Metabolomics Data to Predict Sample Types

Integrating Microbiomics and Metabolomics Data to Identify Disease-Associated Modules

1.Integration of Microbiomics and Metabolomics Data

2.Identification of Disease-Related Multi-Omics Modules

3.Module Intersection Analysis

4.Predicting Disease Status Based on Modules Using Machine Learning

5.Important Module Analysis

Day Five (Practical Operation)+Reproduction)

Integrated Analysis of Microbiomics, Host Transcriptomics, and Immunity in Tumor Research

1.Microbiome Analysis

2.Transcriptomics Analysis, Differential Expression Gene Identification

3.ThroughCCAMethod for Correlating Microbiome Data with Host Transcriptomics Data

4.Microbial Immune Correlation Analysis

Longitudinal Integrated Analysis Based on Microbiome Data, Transcriptome Arrays, and Metabolome Data

1.Analysis of Gut Microbiota Composition

2.Combined Analysis of Microbiome and Metabolome

3.Integrated Analysis of Metabolomics and Transcriptomics

4.Microbiome-Host Interaction Analysis

Course Objectives

1.Understanding the Concepts Related to Microbial Multi-Omics

2.Understand machine learning related concepts and commonly used machine learning models

3.UnderstandRLanguage

4.Master commonly used microbiological multi-omics data analysis and machine learning related techniquesRUse of the Package

5.Mastering Microbiomics/Metabolomics/Host Transcriptomics Joint Analysis Ideas and Methods

6.ReproductionSCIArticle


                                                          




Case Practice Images:



图片
图片






图片

8
CRISPR-Cas9 Gene Editing TechnologyCurriculum Content
图片

Swipe to view


Day One

1.Introduction to Gene Editing Tools

1.Getting to the Root: Gene Editing and GMOs

a)Analyze the Essential Differences Between Gene Editing and Genetically Modified Organisms, and Discuss the Regulatory Differences of the Two Technologies

2.Pioneer of Gene Editing Tools-ZFNsAndTALENs

a)Design Principles, Advantages, Disadvantages, and Historical Contributions of Early Gene Editing Tools

3.CRISPRSystem Family Introduction

a)The Evolution from Bacterial Immune Systems to Gene Editing Tools, Various TypesCasClassification of Proteins

4.CRISPR-Cas9Working Principle

a)sgRNAAndDNACombining Mechanism,PAMIdentification, Double-Strand Break Repair Pathways

5.CRISPR-Cas9Mediated Gene Knockout and Knockin

a)ThroughNHEJAndHDRDifferent Editing Effects Achieved by Two Repair Pathways

6.CRISPR-Cas1213Working Principle

a)RNAPotential Applications of Targeted Editing and Diagnosis

7.New typeCRISPRSystem

a)CasΦ, Small-sizedCasProteinCasMINIetc.

b)High FidelityCas9VariantSpCas9-HFeSpCas9etc.

8.Introduction to Gene Cloning Related Technologies

a)Plasmid Design,PCR, Restriction enzyme digestion, ligation and other basic techniques

9.SnapgeneSoftware Practical Operation

a)Plasmid Map Design, Primer Design, Virtual Cloning and Sequencing Analysis


The Next Day

1.CRISPR-Cas9System Knockout Vector Construction Practice

a)sgRNADesign-related Precautions

i.     PAMSite Selection, Off-Target Prediction,GCContent Consideration, Secondary Structure Avoidance

b)Recommended auxiliary tools,CRISPickCHOPCHOP. Based on deep learningsgRNAPrediction tools, etc.

c)Conventional Construction Scheme

i.     FromoligoDetailed Process of Synthesizing into a Complete Vector and Common Problem Solving

d)Introduction to Sequencing Principles

i.     SangerSequencing and High-Throughput Sequencing Technology: Principles and Application Choices

e)Sequencing Data Analysis

2.Principle of Multiplex Gene Editing

a)Strategies for Simultaneously Editing Multiple Genes and Methods to Improve Editing Efficiency

b)Multi-Target Design and Mutual Interference Avoidance Strategy

3.Practical Operation of Multigene Editing Vector Construction

a)MoresgRNATandem Strategy

b)Multi-promoter Design Strategy and Expression Balance Considerations

4.CRISPRa/CRISPRi(Gene Activation and Gene Suppression)

a)dCas9-PVPRSystem Introduction, Detailed Explanation of Working Principles

b)dCas9-VP64/GI/SAMIntroduction to Gene Activation Systems

c)Introduction to Gene Editing Recruitment System (Suntag/Moontag

5.CRISPRSystem's 'Alternative' Applications

Day Three

1.CBEPrinciples and Applications of the System

a)CBESummary of the system evolution process, fromBE1To the latestCBEThe Evolution and Performance Improvement of the System

b)GenomeCBEEditor (Plant Breeding)/Gene Function Research/Clinical Treatment)

c)OrganelleCBEIntroduction to Editing Tools, Mitochondria/Special Challenges and Solutions in Chloroplast Editing

d)CBESystematic off-target effects,RNAOff-target &DNAOff-target Detection and Avoidance Strategies

e)New TypeCBESystem, A Guide to Comparing and Choosing Various Improved Versions

2.ABEThe Principle and Application of the System

a)PACEAndPANCEIntroduction to Artificial Directed Protein Evolution Systems and Other Conventional Protein Evolution Techniques

b)Escherichia coli Orthogonal Evolution System

c)ABESummary of the System's Evolution Process,ABE1.xToABE8.xPerformance Parameter Comparison

d)ABEThe "Alternative" Application of the System,ABEHow the System WorksCUnconventional functions such as editing and splicing regulation

3.Dual-base editing system

a)SWISS/STEME/A&C-BEmax/SPACE/ACBE, The design principles and application scenarios of various dual-base editing systems

b)Summary of Dual-Base Editing System Modifications

4.Other Types of Base Editing Systems

a)Glycosylase-Mediated Base Editing: A Novel Editing Mechanism and Application Potential

b)CGBEAYBEgGBETSBE

Day Four

1.Reporting System

a)Cell Experiment Combined with Flow Cytometry Analysis

b)Plant Stable Transformation Herbicide Resistance, Colorimetric and Other Reporter Systems

2.Practical Operation of Protoplast Preparation and Application

3.Practical Optimization of Cells and Gene Editing Tools

4.RNAEditing System

5.PESystem Principle

a)Prime EditingWorking Mechanism: Reverse Transcription, Strand Displacement, Repair

b)Detailed Analysis of Factors Affecting Editing Efficiency: Comparison of the Impact of Various Parameters on Editing Efficiency

c)DoublepegRNAPrinciples and Applications (Large Genomic Fragment Insertion)

d)Large Genomic Deletion


Day Five

1.PESystem Optimization Case

a)All FieldsPEOptimized Success Case Analysis: Plants, Human Cells

b)System Transformation Strategy for Specific Application Scenarios

2.PESystem Construction in Practice

a)Application of Primer Design Tools

b)Hands-on Practice of Vector Construction: Experimental Workflow from Basic Vectors to Mature Systems

3.Lentivirus Packaging and Delivery

4.Other Delivery Systems

a)Nanoparticle Delivery: Liposomes, Polymers, etc.

b)Physical Methods: Electroporation, Microinjection, Biolistic Technology

5.Ethics and Safety



                                                          




Case Practice Images:



图片
图片


图片

图片

Training Objectives

图片



图片
图片

图片


01.AI Protein Design: The course will provide a detailed explanation of various protein structure prediction models, including Alphafold2, Rosettafold2, ESMfold, RosettafoldNA, Rosettafold All Atom, and AlphaFold3, enabling students to master the use of multiple protein structure prediction models and compare different protein sampling methods. Students will also learn protein multi-conformation sampling methods and model usage tools, deep learning protein dataset mining tools, and protein pocket search tools with hands-on demonstrations. Students will understand the theoretical foundations of these two tools and, through practical operation, learn how to identify and analyze protein pockets. Deep learning-based protein RFDiffusion (structure generation model), ProteinMPNN (inverse folding model), and ProteinGenerator (structure and sequence generation model) utilize Alphafold2 to improve the success rate of protein design; enabling students to master David Baker's core technologies.

02.CADD Computer-Aided Drug Design: This training mainly covers 10 docking methods, including metalloenzyme-protein docking, protein-polysaccharide docking, nucleic acid-small molecule docking, flexible docking, covalent docking, protein-hydrated docking, protein-water-ligand docking, antibody docking, macromolecule docking (protein-peptide docking), macromolecule protein-protein docking, as well as virtual screening and molecular dynamics simulation.

03.AIDD Artificial Intelligence Drug Discovery Top Journal Reproduction: This training focuses on mastering the application of deep learning in chemical reaction prediction, establishing a systematic understanding from protein modeling to downstream tasks (such as drug screening, mechanism of action analysis) for real drug development scenarios, enhancing the ability to apply AI methods to practical biomedicine problems, the application of Natural Language Processing (NLP) in molecular generation, the application of diffusion models in molecular generation, and through case studies (e.g., Interformer screening for high-affinity small molecules), learning how to apply these predictive technologies to enzyme engineering and drug discovery to accelerate the screening and optimization of candidate molecules.

04. Deep Learning Genomics: Deeply study and understand the basic frameworks and logic of deep learning while mastering the use of fundamental bioinformatics software (Linux, R, Python, etc.), enabling learners to better handle genomic data and uncover new knowledge beyond existing understanding. Constructing robust deep learning models to explore new research ideas and identify potential biological mechanisms, thereby better serving one’s own scientific research and exploration.

05. Machine Learning Metabolomics: 1. Familiar with the background knowledge of metabolomics and machine learning, as well as hardware and software; 2. Introduction to R language and machine learning theory and common usage; 3. Master the entire process of metabolomics from sample processing to upstream and downstream data analysis and graphing; 4. Able to reproduce figures from metabolomics-related articles in CNS and its subsidiary journals; 5. Able to analyze one's own metabolomics data flexibly and proficiently.

06.Deep Learning for Proteomics Analysis: The course provides in-depth explanations and hands-on practice on the application of deep learning in proteomics, enabling students to master the process of analyzing proteomics data using deep learning. It offers systematic learning of deep learning and proteomics theory, familiarizes students with software coding operations, and ensures proficiency in using these cutting-edge analytical tools as well as innovating deep learning algorithms to address biological and clinical disease-related issues and demands.

07. Machine Learning Microbiome Multi-omics Joint Analysis: AIDD Artificial Intelligence Drug Discovery and Design: This course allows students to understand the cutting-edge background of drug discovery, learn various common algorithms in the field of artificial intelligence, become familiar with the installation and use of toolkits, acquire certain algorithm programming skills, and be able to apply computational methods to study drug-related issues. Through extensive case studies and hands-on practice, students will develop some capacity for AIDD model construction and data analysis.

08.Application of CRISPR-Cas9 Gene Editing Technology: This course starts from a global perspective, covering the basic principles of cutting-edge tools like CRISPR-Cas9 to their practical applications in fields such as medicine and agriculture. It progresses from simple to complex, starting with fundamental principle explanations and ending with hands-on application practices. Upon completing this course, you will master the relevant principles and applications of gene editing technology. Additionally, you will learn optimization strategies for gene editing systems and how to operate commonly used biology software. Whether you are a biology student or a researcher interested in gene editing, this course will provide you with valuable knowledge and skills to help you make breakthroughs in this revolutionary field!


图片

图片

Teaching Time

图片



图片
图片

图片

01.AI Protein Design

2025.07.12-2025.07.13  (09:00-11:30--13:30-17:00)

2025.07.19-2025.07.20  (09:00-11:  30--13:30-17:00)

2025.07.26-2025.07.27  (09:00-11:  30--13:30-17:00)

2025.08.02                    (09:00-11:  30--13:30-17:00


02.CADD Computer-Aided Drug Design

2025.07.12-2025.07.13  (09:00-11:30--13:30-17:00)

2025.07.19-2025.07.20  (09:00-11:  30--13:30-17:00)

2025.07.26-2025.07.27  (09:00-11:  30--13:30-17:00)

2025.08.02                    (09:00-11:  30--13:30-17:00


03.AIDD Artificial Intelligence Drug Discovery Top Journal Reproduction

2025.07.15-06.07.18       (19:00--22:00)

2025.07.20-06.07.23       (19:00--22:00)

2025.07.28-06.07.29       (19:00--22:00)


04.Deep Learning Analysis of Proteomics

2025.07.12-2025.07.13  (09:00-11:30--13:30-17:00)

2025.07.19-2025.07.20  (09:00-11:  30--13:30-17:00)

2025.07.26                    (09:00-11:  30--13:30-17:00)


05.Deep Learning Genomics

2025.07.19-2025.07.20  (09:00-11:  30--13:30-17:00)

2025.07.26-2025.07.27  (09:00-11:  30--13:30-17:00)

2025.08.02                    (09:00-11:  30--13:30-17:00)


06.Machine Learning Multi-Omics Joint Analysis of Microbiota

2025.07.19-2025.07.20  (09:00-11:  30--13:30-17:00)

2025.07.26-2025.07.27  (09:00-11:  30--13:30-17:00)

2025.08.02                    (09:00-11:  30--13:30-17:00)


07. Machine Learning Metabolomics

2025.07.14-2025.07.17  (19:00--22:00)

2025.07.21-2025.07.24  (19:00--22:00)

2025.07.29-07.30          (19:00--22:00)


08.Application of CRISPR-Cas9 Gene Editing Technology

2025.07.12-2025.07.13  (09:00-11:30--13:30-17:00)

2025.07.19-2025.07.20  (09:00-11:  30--13:30-17:00)

2025.07.26                    (09:00-11:  30--13:30-17:00



Tencent Meeting Live Streaming Classes        Live Playback Available After Class


图片

图片

Training Costs



图片

图片


Course Registration Fee:

Deep Learning Genomics, Machine Learning Metabolomics, Deep Learning Proteomics Analysis, Machine Learning Multi-Omics Joint Analysis of Microbiota, Application of CRISPR-Cas9 Gene Editing Technology

Public Price: ¥4980 per person per class (including registration fee, training fee, and material fee)

Self-funded Price: ¥4680 per person per class (including registration fee, training fee, and material fee)


AI Protein Design:

Public Price: ¥6880 per person per class (including registration fee, training fee, and material fee)

Self-funded price: ¥6,580 per person per class (including registration fee, training fee, and material fee)

AIDD and CADD:

Public Price: ¥5880 per person per class (including registration fee, training fee, and material fee)

Self-funded price: ¥5,580 per person per class (including registration fee, training fee, and material fee)


Heavyweight Discounts:

Discount 1:

Buy Two, Get One Free (Sign up for two classes and get one learning spot free, the free class can be chosen freely)

Two Classes Together: 10,880 RMB (Can attend three live courses)

Three Classes Together: 14,880 RMB (Can study four live courses)

Four classes together: 18,880 yuan (Free access to any courses held by our institution for a whole year)

Special Offer 2: 28,880 RMB (Free access to any courses hosted by this institution for two full years)

Discount 3: Early registration and payment can enjoy a 300 yuan discount (limited to fifteen participants).

Special Offer: Buy One Get One Free (Bonus Replay) (Including full course replays and lecture materials PPT)

Free Gift Course | Machine Learning Biomedical Online Replay Course

Free Gift Course | Single-Cell Spatial Transcriptomics Online Replay Course

Free Gift Course | Comparative Genomics Online Replay Course

Free Gift Course | Machine Learning Proteomics Online Replay Course


图片


Training Features and Benefits
图片






1. Course Features -- Comprehensive Coverage of Technical Applications, Principles and Processes, and Practical Examples

2. Learning Mode -- Combining theoretical knowledge with hands-on operation, enabling beginners to quickly master the skills.

3. Course Service Q&A -- The main instructor will provide professional answers to the questions you encounter in your actual work.


Teaching Method: Online live streaming via Tencent Meeting, theory+Hands-on teaching mode, where the teacher guides students step by step through operations.Starting from scratch, electronicPPTAnd TutorialsOne week before the course starts, all training software will be sent to the students in advance. If there are any questions, they can be resolved through voice activation, screen sharing, and WeChat group discussions. Students and teachers communicate with each other, as well as students with other students. After the training is completed, the teacher will continue to answer questions for a long time. The training group will not be disbanded. Past trainees have consistently given very high evaluations of the training quality and teaching methods!


Tencent Meeting Live Streaming Q&A | Step-by-Step Guidance


图片
Registration Consultation Method (Please scan the QR code below for WeChat)
图片

Contact: Teacher Ye

Registration phone: 13838281574(WeChat same number)

Email: y13838281574@163.com