Food allergens of plant origin - the relationship between
allergenic potential and biological activity

Project Update - June 2001

Task 2: Identification of structures and activities common to plant food allergens.

 

Why identify common structures and activities?

Several decision trees have been proposed to facilitate the assessment of the allergenic risks posed by novel foods, including GMOs (Figure 1). These involve:

  • Comparing the amino acid sequence of a candidate transgene or novel food protein with known food allergens to identify homologies.

  • Determining the IgE-reactivity of the protein using sera from individuals with a demonstrated reactivity to a homologous protein.

  • Confirming a negative IgE-reactivity by skin-prick testing in allergic humans

 

Whilst it was originally suggested that an IgE-reactive novel protein should then be labelled as such, many biotechnology and food companies have now adopted a stance which halts all further product development using the gene. The use of allergic individuals in such risk assessment procedures is not acceptable and hence the decision trees are in the process of being revised.

Decision tree illustrating the DBPCFC - double blind placebo controlled food challenge
Figure 1: The decision tree developed for assessing the allergenic potential of novel proteins originally proposed by Metcalfe and co-workers (1996)

One of the problems with protein sequence comparisons to identify allergens is that they will only identify close homologues. One of the tasks of Protall is engaged in is identifying structures and activities, which are common to food allergens, using plant food allergens as an example. This article describes our progress to date.

 

Plant protein allergens

There are three main types of plant protein, based on their function within the plant. These are

  • Metabolic and structural

  • Protective proteins against attack from pests and pathogens

  • Storage proteins

Classification of seed proteins on the basis of their solubility
Figure 2: Classification of seed proteins on the basis of their solubility

Many of the highly allergenic plant foods, such as peanuts, are actually derived from seeds. Seed proteins have been classified on the basis of their solubility, which does not necessarily represent molecular relationships (see Figure 2). We have focused on two main superfamilies of plant food allergens, which appear to sensitise individuals via the gastrointestinal tract namely:

Prolamin superfamiliy: families of proteins belonging to this superfamily include the storage prolamins of cereals, 2S albumins, a -amylase/trypsin inhibitors and non-specific lipid-transfer proteins. Certain proteins in all these families are allergens

Cupin superfamily: the families that belong to this superfamily include the germins and the 11S and 7S storage globulins. Food allergens are generally found in the storage globulin family, although there is an instance a member of the germin family from bell peppers also being an allergen.

The structural attributes common to plant food allergens will be illustrated with examples from the prolamin and cupin superfamilies.

 

PROLAMIN SUPERFAMILY

The dendrogram in Figure 3 shows the relationship between the various members of the prolamin superfamily, as defined by their amino acid sequences.

Dendrogram showing the relationship between albumins from a number of plant species
Figure 3: Dendrogram showing the relationship between albumins from a number of plant species

Proteins known to be albumins were retrieved from the NIH database. The translation of the nucleotide sequences and the alignment of the protein sequences were obtained using the "Vector NTI" software package that employs a ClustalW algorithm for sequence comparison. Those proteins known to be plant food allergens are shown in boxes; The barley inhibitor Hor v 1 is known to be a respiratory allergens but not as a food allergen.

 

2S albumins

These proteins have a conserved cysteine skeleton, and are generally synthesised as a single polypeptide which is cleaved to give a 3kD and 9kD subunits linked by disulphide bonds. A number of well characterised plant food allergens belong to this family including the Brazil nut (Ber e 1) and yellow mustard (Sin a 1) allergens. A model of the three dimensional structure of Sin a 1 is shown in Figure 4.The proteins are rich in a -helix and held together by a number of disulphide bonds. They can make up a large proportion of seed protein, appear to be stable to proteolysis and can bind lipids.

The three-dimensional homology model for the 2S albumin allergen from yellow mustard Sin a 1
Figure 4: The three-dimensional homology model for the 2S albumin allergen from yellow mustard Sin a 1

 

Non-specific lipid transfer proteins (LTPs).

The LTP family is made up of low molecular weight (7-9kDa) monomeric proteins, which are very basic. They are able to catalyse the transfer of lipids between vesicles and membranes in vitro and there is increasing evidence that their role in vivo may be in cutin biosynthesis. The proteins are made up of a bundle of four a -helices with a lipid-binding cavity in the centre (see Figure 5). There are indications that bound lipid increases resistance to proteolysis; these proteins also survive thermal treatments, and can refold to their native structure on cooling.

A ribbon diagram of a model of the three-dimensional structure of the non-specific lipid transfer protein from peach. (Pru p 3)
Figure 5: A ribbon diagram of a model of the three-dimensional structure of the non-specific lipid transfer protein from peach. (Pru p 3)

The initial model was calculated using Swiss-Model (Guex and Peitsch, 1997) and improved manually by including the disulphide bridges seen in the homologous structures. The ribbon is coloured from blue at the N-terminus to red at the C-terminus using ICMlite Version 2.7 (Abagyan et al., 1994).

 

Common properties of the 'α-class' of plant proteins

On the basis of the information on the 2S albumins and LTPs it can be seen that they share a number of common structural features:

 

  • Generally small with Mr of less than 30,000 daltons

  • Contain a high proportion of cysteine residues and hence are highly disulphide bonded

  • Are compact and hence resistant to attack by proteolytic enzymes

  • Despite their compactness they contain a number of flexible loops, which maybe important antibody recognition sites

  • Many are able to bind lipid which can increase resistance to proteolysis

  • They are abundant in the seed. However, some allergenic fruit LTPs are not abundant but are potent allergens perhaps because of their exceptional stability to proteolysis.

Future discussions in the project will focus modifying this list to accommodate information on other classes of plant allergens such as the globulin storage proteins, which are major allergens in peanuts and soya. Identifying common properties of allergens will pave the way for using bioinfomatics more effectively for predicting the allergenic potential of novel proteins in the first steps of the decision tree.

Contributors: ENC Mills1, MJC Alcocer2, J Jenkins1, D Marion3, R I Monsalve4, PR Shewry5

1 Institute of Food Research, Norwich Laboratory, Norwich Research Park, Colney, Norwich NR4 7UA, UK. (Partner 1)
2
School of Biological Sciences, University of Nottingham, Nottingham, NG7 2RD, UK.
3
INRA-Laboratoire de Biochimie et Technologies des Proteines, BP 71627, Rue de la Geraudiere, Nantes 44316, CEDEX 03, France.(Partner 4)
4 Departamento de Bioquimica y Biologia Molecular, Faculdad de Quimica, Universidad Complutense, Madrid 28040, Spain.(Partner 19)
5 IACR-Long Ashton Research Station, Department of Agricultural Sciences, University of Bristol, Long Ashton, Bristol, BS18 9AF, UK.(Partner 3)

 

References

Abagyan, R.A., Totrov, M.M., and Kuznetsov, D.N. (1994). ICM - a new method for protein modeling and design. Applications to docking and structure prediction from the distorted native conformation. J.Comp.Chem. 15, 488-506.

Guex, N. and Peitsch, M. C. (1997) SWISS-MODEL and the Swiss-PdbViewer: An environment for comparative protein modelling. Electrophoresis 18:2714-2723.

Metcalfe, D.D., Astwood, J.D., Townsend, R., Sampson, H.A., Taylor, S.L., Fuchs R.L. (1996) Assessment of the allergenic potential of foods derived from genetically engineered crop plants. Crit. Rev. Food Sci. Nutr. 36, S165-S186.

 

THE CUPINS SUPERFAMILY

 

What are cupins?

The cupin superfamily is comprises proteins possessing a common b -barrel structure thought to originate in a prokaryotic ancestor. The term ‘cupin' was derived from the Greek for ‘barrel', and relates to the b -sheet barrel-like structure characteristic of these proteins. This motif is found as a single domain in fungal spherulins, fern sporulins and the plant germins and oxalate oxidases (Figure 6). There are also two domain cupins, the globular storage proteins of plants termed legumins (11S) and euvicilins (7S).

The evolution of the cupin superfamily

Figure 6: The evolution of the cupin superfamily

 

Most of the food allergens belonging to the cupin superfamily that have been characterised are primarily globulin storage proteins. This includes the

Legumins (11S): Soya glycinin, peanut arachin (Ara h)

Vicilins (7S): Soya b -conglycinin, peanut conarachin (Ara h 1), walnut (Jug r1)

 

Structure of Seed Storage globulins

The 11S globulins are hexameric heteroligomeric proteins of Mr ~ 360,000, each subunit comprising an acidic 30-40,000 dalton polypeptide disulphide linked to a 20,000 dalton basic polypepide (Figure 2). Subunits are generally coded by multiple genes and are synthesised as a single precursor protein, which is postranslationally cleaved at a conserved protease site (Gly-Leu-Glu-Glu-Thr). The 7S proteins are trimeric, of Mr ~ 180,000, made up of N-glycosylated 50,000 dalton subunits in some plant species; in others such as soya and peanut subunits are present which have an 170 residue N-terminal insert (Figure 7).

Figure 7: Schematic diagram of the legumin-like 11S and euvicilin-like 7S globulin subunits.

Thermostability of globulins

The globulins are thermostable proteins, showing only partial loss of secondary and tertiary structure at temperatures between 75-94ºC. Thus differential scanning calorimetry (DSC) of the allergenic soya globulin b -conglycinin shows the main thermal transition occurring at around 75° C. Similarly far-UV CD analysis of the protein shows that whilst there is some loss of secondary structure following heating a substantial amount of the b -structure remains. These proteins also form large aggregates, many millions of Daltons in size, following heating, which at high protein concentrations (5-10%) form heat-set gels, an important property widely exploited by the food industry in adding soya proteins to foods.

 

Proteolytic intermediates

On partial proteolysis the globulins retain much of their quaternary structure, forming large proteolytic intermediates, which are held together by non-covalent interactions. Thus peanut 7S globulin, Ara h 1, retains both its trimeric structure and IgE reactivity following pepsin digestion. Soya glycinin forms a stable intermediate of Mr ~ 280,000 on trypsinolysis known as glycinin T and results from clipping of the acidic subunits to form a 13,000 and a 16,000 molecular weight fragments.

 

Conclusions

Plant food allergens, which belong to the 11S and 7S globulin families of the cupin superfamily share a number of common structural features which may be important in predisposing them to becoming potent allergens. These include

  • Thermostable, able to form large aggregates and hence able to retain multiple IgE-binding sites following cooking

  • Ability to form large, stable proteolytic intermediates, which also retain sufficient size to interact with the immune system.

 

If you are interested in exploring plant protein families in greater depth try visiting these websites:

 

NAVIGATION:

Funded through the EU FAIR Programme

home | about food allergy | PROTALL database | links | newsletters | project summary | contact us