Close

What Is Antibody De Novo Sequencing?

Background Definition Workflow Data Analysis Data Verification Application What Can Creative Biolabs Do

Background of De Novo Sequencing

Proteins that make up biopharmaceuticals are bio-macromolecules that sustain life activities. They are composed of one or more peptide chains, each of which is connected by a certain number of amino acids in a certain order through peptide bonds. The amino acid sequence of a protein constitutes its primary structure, and protein sequencing is the process of determining this sequence. Protein sequencing can provide very useful information, including: identifying proteins; providing information on post-translational modifications (PTMs), which are important for the structure and function of proteins; improving antibody drug development, etc. Therefore, protein sequencing is an important technology in biomedicine.

Currently, the protein sequencing methods on the market can be mainly divided into three types: Edman degradation sequencing, PCR-based protein sequencing, and mass spectrometry-based protein sequencing.

Our optional protein sequencing services:

Definition of De Novo Sequencing

Protein de novo sequencing technology is the process of analyzing the amino acid sequence of a protein by using tandem mass spectrometry (MS/MS) to obtain ions and calculate the mass of amino acid residues on the peptide chain based on the mass difference between two fragment ions. The biggest advantage of this technology is that it does not need to rely on any database and can analyze the sequences of known or unknown proteins.

Workflow of De Novo Sequencing

The samples were hydrolyzed by multiple enzymes, and the peptides were detected by mass spectrometry. The mass spectra of the collected peptides were analyzed by a de novo algorithm, and the corresponding peptide sequences were obtained. After the comparison and splicing of peptides, the amino acid sequences of the proteins were obtained from the N-terminal to C-terminal of the protein. At present, the platform of protein de novo sequencing can be established by liquid chromatography coupled with high resolution mass spectrometer, which ensures the sensitivity of the identification of low abundance peptide fragments. At the same time, the fragmentation mode of HCD and ETD binding is adopted to distinguish the isomers of leucine and isoleucine.

Fig. 1 Overview of novel antibody discovery approach. (Adrian Guthals, 2017)Fig. 1 Overview of novel antibody discovery approach. (Adrian Guthals, 2017)

Our de novo antibody sequencing services:

Data Analysis

Peptide coverage: The protein samples were digested based on specific and non-specific proteases respectively to achieve 100% protein coverage, produce rich and highly overlapping peptide types, and finally get the high-quality data needed for sequencing.

Fig. 2 General workflow for sequencing of novel proteins via PEAKS based on an electropherogram. (Jianhui Cheng, 2020)Fig. 2 General workflow for sequencing of novel proteins via PEAKS based on an electropherogram. (Jianhui Cheng, 2020)

Peptide splicing: Using HCD will produce a series of b/y ions, and the sequence of peptides can be deduced according to these ions' information. However, because many kinds of endonuclease digestion will produce a lot of repetitive peptides, the peptides should be screened. The first step is to select the peptide with high kurtosis, which is actually to judge the fragmentation of the secondary ions of the peptide. However, when the fragment ions are missing, the database search may still be successful. Therefore, in order to improve the accuracy of ab initio sequencing, the method may be combined with different ion fragmentation techniques to produce multiple spectra.

Discrimination of leucine & isoleucine: In the past, Leu and Ile were often considered indistinguishable by MS because their molecular masses were exactly the same. Incorrectly positioned Leu/Ile residues within variable domains, particularly within the CDRs (complementarity determining regions) of an antibody, can result in substantial loss of antigen-binding affinity and antibody specificity. Leucine and isoleucine can obtain c/z ions under ETD fragmentation conditions, and then assist HCD fragmentation to remove some groups based on z ions to form w ions. Leucine removes the propyl group, forming a 43Da loss. Isoleucine removes the ethyl group, forming a 29Da loss. This allows precise discrimination between leucine and isoleucine.

Data Verification

Compared with database search methods, the main limitation of current ab initio sequencing software is the lack of automatic statistical verification. If there is no such automatic verification method, you still need to check the peptide map matching manually to determine which ab initio sequencing results to take seriously.

Molecular weight verification: The basic modification information (oxidation, deamidation, N-terminal cyclization, C-terminal K deletion, glycosylation) can be obtained from the protein molecular weight, so as to verify the correctness of the protein sequence obtained by de novo sequencing.

Protein sequence verification: A new database was established for the sequences obtained by de novo sequencing, and the coverage analysis of the collected mass spectrometry data was performed again, and an almost complete matching result was obtained.

Fig. 3 The matched heavy chain contigs for infliximab after 1st homologous database search. (Jianhui Cheng, 2020)Fig. 3 The matched heavy chain contigs for infliximab after 1st homologous database search. (Jianhui Cheng, 2020)

Applications of De Novo Sequencing

Protein sequencing technology can detect biomarkers of disease and provide a new way to study cancer and other problems. Proteins produced by cells can act as unique biomarkers in many diseases, such as cancer, Alzheimer's disease, heart failure, and diabetes, and more accurate detection of these biomarkers can help researchers understand the causes of the disease. It can also provide patients with earlier and more accurate diagnosis.

De novo sequencing is one of the key technologies of mass spectrometry-based proteomics, especially for new proteins whose genomic information is usually limited or unavailable. Therefore, the complete de novo sequencing of unknown protein sequences is of great significance. The use of antibody de novo sequencing technology can assist researchers in carrying out antibody drug research and development related scientific research projects, but it can also greatly shorten the antibody drug development cycle and quality control of the production process, and assist in the implementation and landing of the project.

What Can Creative Biolabs Do?

Creative Biolabs de novo antibody sequencing workflow provides complete antibody sequences with unparalleled accuracy and reliability. The antibody samples were digested with multiple enzymes and the fragments were measured by LC-MS/MS to derive the de novo peptide sequence. These fragments are then analyzed through Creative Biolabs' state-of-the-art bioinformatics algorithm suite to assemble any full-length antibody sequence of any isotype. Our unique de novo sequencing workflow enables us to identify complete antibody sequences with hundreds of overlapping peptides without the need for a database or sequence library as a reference or homology. Creative Biolabs has overcome the key technical difficulties of antibody de novo sequencing, realized the accurate identification of leucine and isoleucine, and promoted the breakthrough scientific development of protein/antibody sequencing in the research, diagnosis, and treatment industries. Our services consist of variable region, variable plus leader region, as well as full-length heavy- and light-chain antibody sequencing for all species, isotypes, and allotypes. The polyvalent form of purified monoclonal antibody can cover 100% of the desired area and has excellent accuracy for sequencing. Creative Biolabs has a complete customized service chain, from antibody design and cloning to de novo sequencing and analysis, as well as in vitro optimization to meet any customer needs.

References
  1. Guthals A, Gan Y, Murray L, Chen Y, Stinson J, Nakamura G, Lill JR, Sandoval W, Bandeira N. De Novo MS/MS Sequencing of Native Human Antibodies. J Proteome Res. 2017 Jan 6;16(1):45-54. doi: 10.1021/acs.jproteome.6b00608. Epub 2016 Nov 2.
  2. Cheng J, Wang L, Rive CM, Holt RA, Morin GB, Chen DDY. Complementary Methods for De Novo Monoclonal Antibody Sequencing to Achieve Complete Sequence Coverage. J Proteome Res. 2020 Jul 2;19(7):2700-2707. doi: 10.1021/acs.jproteome.0c00223. Epub 2020 May 7.
  3. Ma B, Johnson R. De novo sequencing and homology searching. Mol Cell Proteomics. 2012 Feb;11(2):O111.014902. doi: 10.1074/mcp.O111.014902. Epub 2011 Nov 16.
  4. Tran NH, Rahman MZ, He L, Xin L, Shan B, Li M. Complete De Novo Assembly of Monoclonal Antibody Sequences. Sci Rep. 2016 Aug 26;6:31730. doi: 10.1038/srep31730.
  5. Xiao Y, Vecchi MM, Wen D. Distinguishing between Leucine and Isoleucine by Integrated LC-MS Analysis Using an Orbitrap Fusion Mass Spectrometer. Anal Chem. 2016 Nov 1;88(21):10757-10766. doi: 10.1021/acs.analchem.6b03409. Epub 2016 Oct 14.

All listed services and products are For Research Use Only. Do Not use in any diagnostic or therapeutic applications.

Online Inquiry
CONTACT US
USA:
Europe:
Germany:
Call us at:
USA:
UK:
Germany:
Fax:
Email:
Our customer service representatives are available 24 hours a day, 7 days a week. Contact Us
© 2024 Creative Biolabs. | Contact Us