By Philipp E Geyer, Lesca M Holdt, Daniel Teupser and Matthias Mann

3 downloads 148 Views 4MB Size Report
Illustrations: Molecular S ystems Biology (U ... Extracellular matrix. Nucleus Kinase .... Biology, Matthias Mann and co
clinical proteomics By Philipp E Geyer, Lesca M Holdt, Daniel Teupser and Matthias Mann

the plasma proteome

Ch y Ac lom ut ic e r Bl ph on oo a Co d c se m oa p Ch lem gul ol e at Se est nt p ion rin er a An pr ol m thw o ay ti t e Ex bac eas tab o tr te e i li ac ri nh sm ib Cy ellu al ito to la r Ce pl rm a N ll m sm atr u e a ix N cleu mb uc s ra ne Ki leo n Ri ase tide b -b Cy on in to ucl di ki eo ng ne p ro te in

Functions

Proteins APOA1 FGB AFM C2 F12 ADIPOQ CRP SEPP1 LBP

Concentration [pg/ml]

109 108 107 106 105 104

Functional plasma proteins

CEACAM1 GOT1 PARK7

Tissue leakage proteins

103

10 10

PNLIP

VEGFA IL1B

Signal proteins

101

IL6 0 12 0

0 11 00

0

10 0

90

0

0

80

0

70

60

0

0

0

0

50

40

30

20

0

10 0

100

Protein rank

In total 77% of all clinical decisions in patients are based on laboratory testing. The numbers are based on 9 million tests performed in the year 2016 at the Institute of Laboratory Medicine, University Hospital Munich. The second figure illustrates the distribution of laboratory tests based on frequency of request.

Clinical decisions made… …without lab testing

23%

finger prick and blood composition

105 104 103 102 101

0

5

10

15

20

25

30

35

40

45

50

55

60

65

70

Blood is a suspension, consisting of a cellular (~44%) and a liquid component (~56%). Its cellular portion can be classified into thrombocytes, leucocytes and erythrocytes. The straw-colored liquid portion of blood is called plasma and is a mixture of water, small molecules like electrolytes, substrates or vitamins and an extensive diversity of all human proteins that are encoded by the 20,000 human genes.

Keyword rank

Drugs 1.8% Specific antibodies 1.5%

Distribution of laboratory tests

Nucleic acids 0.5%

Cells

…with lab testing

16.9%

77%

Others 2.1%

44 % Blood cells

Proteins and enzymes

5× 106 Erythrocytes

41.9%

potential for new biomarkers

100

vs. Numbers of proteins

DISCOVERY Shotgun proteomics

Rectangular strategy Phases

Numbers of samples

DISCOVERY

10s

100s–1000s

70

Numbers of proteins

Not changed

Shotgun proteomics

Study specific Cohort 1

VERIFICATION

Targeted proteomics

10s–100s

Validated biomarker/panel

10s

Biomarker [%]

Phases

Numbers of samples

The blue area illustrates the density of biomarkers as a function of increasing depth of the plasma proteome. Within the 300 most abundant proteins, 23% are already known biomarkers. The top of the yellow region extrapolates this proportion to the remainder of the plasma proteome. If the portion of biomarkers remained as high as it is in the 300 most abundant proteins, there are at least 233 potential biomarkers to be discovered (yellow area).

80

60 50

potential 280 Biomarkers 23%

40 30

Study specific

23 20

VALIDATION

Immunoassays 100s –1000s In the classical ‘Triangular strategy’ a relatively small number of cases and controls are measured at great depth by hypothesis-free discovery proteomics, ideally leading to the quantification of thousands of proteins (top layer in the panel). This may yield dozens of differentially expressed candidates that are screened by targeted proteomics methods in cohorts of moderate size (middle layer). Finally, for one or a few of the remaining candidates, immunoassays are developed, which are then validated in large cohorts and applied in the clinic (bottom layer).

Shotgun proteomics

Not changed

10

Cohort 2 100s–1000s

0 100s–1000s

In the ‘Rectangular strategy’ a large cohort is investigated in the discovery phase at the greatest proteome coverage possible. In the validation phase, another cohort is analyzed to confirm the biomarker candidates, utilizingthe same technology and similar cohort size. Both cohorts can be analyzed in parallel, but only the proteins that are significantly different in both studies (orange as opposed to green circle in the right-hand part of the panel) are validated biomarkers.

Sponsored by

0

300

600

900

1200

Time points

47 Biomarkers ~4%

www.thermofisher.com

Knowledge base Risk

Disease

Plasma proteome profiling of diverse disease, risk, treatment, lifestyle or other relevant alterations will over time accrue a knowledge base that connects plasma protein changes to perturbations in a general manner. In

di

vi

du

al

Treatment

Lifestyle

Proteins

s

The plasma proteome profile of a given individual can then be deconvoluted using the information and algorithms associated with the knowledge base.

Proteins

1500

Abundance rank

70 Biomarkers ~23%

Time

molecular phenotyping

VALIDATION

1–10

Plasma proteome profiling can be further applied to compare case-control studies and for the investigation of longitudinal protein trajectories.

Proteins

biomarker discovery

90

Longitudinal trajectories

plasma proteome profiling

5– 10× 103 Leucocytes

35.3%

Effect size

As the protein concentration in plasma is so high (50 µg/µl), a simple finger prick delivering just 1µl of plasma is enough to analyze the human plasma proteome.

2 – 4×105 Thrombocytes

Small molecules

Triangular strategy

controls

Proteins

© 2018 EMBO Press | Coordination by P. Geyer, M. Mann, M. Polychronidou | Concept by P.E. Geyer, L.M. Holdt, D. Teupser and M. Mann | Illustrations: Molecular Systems Biology (U. Mackensen)

today’s blood tests

7% Proteins

106

100

Biomarker candidates

cases

107

Bioinformatics keyword annotation of the plasma proteome database. The blue boxplots with the 10 – 90% whiskers visualize the range of diverse proteins contributing to distinct functions.

AFP

3% Small molecules

108

www.plasmaproteomedatabase.org Nanjappa et al, 2014

IL1RAP TNNI3

ACE

102

1)

109

Cases versus controls studies

p value

1010

Concentration range of the plasma proteome with the gene names of several illustrative blood proteins (red dots). Concentrations are in serum or plasma and measured with diverse methods as retrieved from the plasma proteome database in May 2017. 1)

90% Water

Proteins

ALB

Concentration [pg/ml]

1011

applications

56% Plasma

+

+

+

+

+ Individual

www.evosep.com

Plasma proteome profile

Modified from: Geyer PE, Holdt LM, Teupser D, Mann M (2017). Mol Syst Biol. 13(9): 942

msb.embopress.org /content/13/9/942

+

clinical proteomics Poster

p in ost si er de

Biomarker discovery by plasma proteomics A large number of diagnostic tests rely on measuring the levels of protein biomarkers in the blood. Up to now, the most widespread tests typically involve immunoassays or enzymatic assays, which measure only a single marker at a time. In stark contrast to the aforementioned assays, mass spectrometry (MS)-based proteomics approaches enable multiplexed measurements and unbiased analyses of plasma samples, thus removing the need to make assumptions on the nature and number of measured biomarkers. Moreover, since mass spectrometers analyze protein amounts typically in the low microgram range a single drop of blood is sufficient for proteome analyses. Despite these advantages and the notable improvements in the sensitivity and dynamic range of MS-based proteomics technologies over the recent years, such technologies have not yet been routinely incorporated in clinical diagnostics workflows. In their recent Review published in Molecular Systems Biology, Matthias Mann and colleagues (Geyer et al, 2017) provide an overview of the use of MS-based proteomics for clinical applications. The classical paradigm in plasma biomarker research is a “triangular approach”, in which discovery proteomics is applied to a relatively small number of samples. The identified protein biomarker candidates are then screened in smaller cohorts and a few selected candidates are finally validated by immunoassays in larger cohorts before their application to the clinic. In contrast to this workflow, the “rectangular strategy” measures as many proteins as possible for a large number of individuals and conditions. In this case, both biomarker discovery and validation are performed in large cohorts and at high proteome coverage. This strategy clearly has the potential for more precise and informed diagnostics and is promising for building a knowledge base linking changes in plasma

proteome profiles to diseases, medical treatments and lifestyles. This approach is nicely demonstrated in the study by Geyer et al, 2016, in which longitudinal plasma proteome profiling of obese individuals during weight loss and maintenance, revealed widespread effects of weight loss on the proteome. This poster illustrates these key aspects discussed in the Review Article by Philipp Geyer, Matthias Mann and colleagues. It highlights that MS-based approaches are not only promising for discovering new biomarkers and biomarker panels, but also offer new opportunities for developing diagnostic strategies based on a combination of plasma proteome profiles with clinical data and information on the patient’s history. Maria Polychronidou, PhD Senior Editor | Molecular Systems Biology Geyer PE, Lesca M Holdt LM, Teupser D, Mann M (2017) Revisiting biomarker discovery by plasma proteomics Mol Syst Biol (2017) 13: 942 Geyer PE, Wewer Albrechtsen NJ, Tyanova S, Grassl N, Iepsen EW, Lundgren J, Madsbad S, Holst JJ, Torekov SS, Mann M (2016) Proteomics reveals the effects of sustained weight loss on the human plasma proteome Mol Syst Biol (2016) 12: 901

SPONSORED BY

www.thermofisher.com

www.evosep.com