Review Article |
Corresponding author: Ekaterina Kozuharova ( ina_kozuharova@yahoo.co.uk ) Academic editor: Plamen Peikov
© 2024 Ekaterina Kozuharova, Vasil Simeonov, Christina Stoycheva, Niko Benbassat, Daniela Batovska.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Kozuharova E, Simeonov V, Stoycheva C, Benbassat N, Batovska D (2024) Lavender essential oils–hidden relationships between the samples of origin. Pharmacia 71: 1-10. https://doi.org/10.3897/pharmacia.71.e127293
|
Lavender essential oil is an economically important ingredient in perfumery, the food industry, and pharmacy. There is notable diversity in the composition of lavender essential oils. The reasons are the high genetic diversity of lavender cultivars and the variety of ecological specifics in the regions of cultivation. The aim of this research is to check which lavender essential oil variety is best regarding the content of the most important components through comparative statistical tests. We created a data set of 88 lavender essential oil samples from 16 countries. The multivariate statistics (hierarchical and non-hierarchical clustering) and factor analysis reveal hidden relationships between the objects of the study (samples) or between the variables characterizing the objects (chemical descriptors–16 components). The results are discussed in detail. All samples from Bulgaria, together with a few of the Italian, French, Greek, Indian, and Chinese samples, fall into one cluster with the standard maximums.
Graphical abstract:
Lavandula angustifolia essential oil, multivariate statistics, hierarchical and non-hierarchical clustering (K-means clustering), factor analysis
Lavender essential oil is among the top five in aromatherapy (
Lavender essential oil can help against insomnia and anxiety. It has painkiller, anti-inflammatory, anti-allergic, and anti-microbial activity. Additionally, lavender essential oil demonstrates insect repellent and acaricidal properties. Linalool and linalyl acetate are the main ingredients of lavender essential oil. Their quantity characterizes their quality. Their quantities vary. Linalool is responsible for the sedative and painkiller effects, as well as the anti-inflammatory activity of lavender essential oil. It has antioxidant and antitumor effects, as well as antimicrobial activity. Linalyl acetate has an anti-inflammatory effect. It can prevent hypertension-related ischemic injury and the development of type 2 diabetes mellitus. Although these two components have certain pharmacological effects, the essential oil as a whole often possesses higher efficiency due to the synergism of all the constituents (
Lavender (Lavandula angustifolia subsp. angustifolia (L. spica L. var. alpha, L. officinalis Chaix, L. fragrans Jord., L. vera DC) is one of the over 30 species of the genus Lavandula L. (family Lamiaceae). In terms of its general morphology, this genus is a rather mixed and divergent group of shrubs, woody-based perennials, or short-lived herbs, often aromatic, glabrous, or with a variable indumentum. L. angustifolia is a small woody shrub. The leaves are simple and linear-lanceolate in shape. They are gray tomentose when young, becoming greener with age. The inflorescence stalk is usually unbranched and bears a compact spike. The spike consists of cymes, and each cyme has many flowers (3–)5–7(–9). The diagnostic feature is the shape of the bracts. Those of L. angustifolia are broadly ovate-rhombic to obovate, unlike L. latifolia L., which has linear bracts. L. angustifolia is native to SW and South-Central Europe (Italy, France, and Spain), and it grows in the mountains (Upson, 2002). Only three members of the genus Lavandula are industrially cultivated for the production of essential oils: lavender (L. angustifolia), spike lavender (L. latifolia), and lavandin, a sterile hybrid developed by crossing L. angustifolia × L. latifolia. The essential oil of L. angustifolia is more expensive than that of other lavender species on the market because of its high quality and the plant’s low yield of essential oil (
Lavender is cultivated worldwide in a number of countries, and the leading lavender oil producers are Bulgaria, France, the UK, China, India, Spain, and others (
The aim of this research is to check which lavender essential oil variety is best regarding the content of the most important components through comparative statistical tests.
We accessed Google Scholar, Web of Science, and PubMed to identify publications for the period 1900–2022, with the search string “Lavandula angustifolia + essential oil”, “linalool”, “pharmacological effects”, etc. Also, we included in the search string “Bulgaria”, “China”, “India”, and “France”–the countries that are recognized as main world producers of lavender essential oil, as well as “Greece”, “Ukraine”, “Poland”, and other neighboring countries. Following the PRISMA 2000 guidelines, the records were assessed for eligibility, and the inappropriate ones were excluded. We selected 32 publications that presented the full components’ lists as a result of GC or GC-MS analyses of lavender (L. angustifolia) essential oil obtained by hydrodistillation (either industrially or using Clevenger apparatus for 40 minutes to 4 hours) from various counties and varieties, under different cultivation. Based on published results of various samples (each publication presented analyses of 1 to 19 samples) was created a data set in Excel (
Contents of the main lavender essential oil components of the standards (ISO 3515:2002,
Components | ISO and/or European Pharmacopoeia (10th edition) | |
---|---|---|
Min [%] | Max [%] | |
Camphor | 1.20 or 1.50 | |
1,8-Cineole | 2.50 or 3.00 | |
1,8-Cineole + Phellandrene | ||
Phellandrene | 1.00 | |
D-Limonene | 1.00 | |
Z-β-Ocimene | 1.00 | 10.00 |
E-β-Ocimene | 0.50 | 6.00 |
Lavandulol | 0.1 | 3.00 |
Lavandulyl acetate | 0.20 | 8.00 |
Linalool | 20.00 | 45.00 |
Linalyl acetate | 25.00 | 47.00 |
3-Octanone | 0.10 | 5.00 |
Terpinen-4-ol | 0.10 | 8.00 |
α-Terpineol | 2.00 |
Chemometric methods were used to mine the data set of dimensions [90 × 16] (88 experimentally tested samples of lavender essential oils along with minimum and maximum values of the standards; see above) and subject it to multivariate statistical analysis. The lavender essential oil samples are from different geographical locations. Ukraine (UA, n=30), Bulgaria (BG, n=14), Poland (PL, n=10), France (FR, n=7), India (IN, n=6), Greece (GR, n=6), Italy (IT, n=3), Australia (AU, n=2), China (CN, n=3), Jordan (JO, n=2), Iran (IR, n=1), Hungary (HU, n=1), Lithuania (LT, n=1), Moldova (MD, n=1), United Kingdom (UK, n=1), and Bosnia and Herzegovina (BA, n=1) were analyzed for the content of 16 important chemical components.
The major goal of the chemometric study was to reveal hidden relationships between the objects of the study (partitioning with respect to the geographical location) and between the variables characterizing the objects (chemical descriptors). An additional task to the partitioning procedure was the determination of the specific descriptors for each group of similarity found in the partitioning procedure.
The multivariate statistical methods used for the data mining were hierarchical and non-hierarchical clustering (K-means clustering) and, additionally, factor analysis. All of the methods are well-known and described fully in the literature (
The first step of our analysis was to build the hierarchical dendrograms for linkage of the objects (Fig.
Members of Cluster Number 1 and Distances from Respective Cluster Center Cluster contains 6 variables | |
Variables | Distance |
1,8-Cineole + Phelandrene | 0,786206 |
D-Limonene | 0,816296 |
Z-β-Ocimene | 0,802858 |
Linalyl acetate | 0,746208 |
3-Octanone | 0,744208 |
total % | 0,832584 |
Members of Cluster Number 2 and Distances from Respective Cluster Center Cluster contains 2 variables | |
Variables | Distance |
Camphor | 0,225288 |
1,8-Cineole | 0,225288 |
Members of Cluster Number 3 and Distances from Respective Cluster Center Cluster contains 2 variables | |
Variables | Distance |
1,8-Cineol + Z-β-Ocimene | 0,609882 |
E-β-Ocimene | 0,609882 |
Members of Cluster Number 4 and Distances from Respective Cluster Center Cluster contains 6 variables | |
Variables | Distance |
1,8-Cineol + D-Limonene | 0,818032 |
Lavandulol | 0,742492 |
Lavandulyl acetate | 0,906411 |
Linalool | 0,981822 |
Terpinen-4-ol | 0,821831 |
α-Terpineol | 0,978135 |
Specific descriptors of the four clusters. Legend: For convenience, the variables are partially numbered, e.g., camphor is number 1, D-limonene is number 5, etc.
1 Camphor | 1,8-Cineole | 1,8-Cineole + Phelandrene | 1,8-Cineol + D-Limonene |
5 D-Limonene | 1,8-Cineol + Z-β-Ocimene | Z-β-Ocimene | E-β-Ocimene |
9 Lavandulol | Lavandulyl acetate | Linalool | Linalyl acetate |
13 3-Octanone | Terpinen-4-ol | α-Terpineol | Total % |
As already mentioned above, a priory hypothesis, along with expert opinion, required partitioning of the variables and the objects into 4 clusters for each category. The members of each cluster of objects and variables are presented in Tables
The four clusters of variables (Fig.
It is of substantial interest to find out which descriptors are specific to each one of the identified clusters. The plot of averages for each variable for each identified cluster is presented in Fig.
Specific descriptors (highest and lowest levels) for all identified clusters.
Clusters | Highest levels of | Lowest levels of |
---|---|---|
1 | 1,8-Cineol+z-β-Ocimene | Terpineol |
1,8-Cineol+Phelandrene | ||
E-β-Ocimene | ||
Linalyl acetatete | ||
3- Octanone | ||
Total | ||
2 | – | E-β-Ocimene |
1,8-Cineol+Z-β-Ocimene | ||
3 | Lavandulol | Linalyl acetatete |
Terpinen-4-ol, | Total | |
α-Terpineol | ||
4 | Camphor | Linalool |
1,8-Cineole |
The four clusters of objects (locations of origin of the lavender essential oil samples and minimum and maximum values of the standards, Fig.
Members of Cluster Number 1 and Distances from Respective Cluster Center Cluster contains 32 cases | |
Country of origin of the essential oils | Distance |
PL-2 | 0,803600 |
BG-5-1 | 1,063737 |
FR-5-3 | 1,131704 |
BA-8 | 0,841112 |
AU-9 | 1,099822 |
MD-10-2 | 0,492620 |
UA-10-3 | 0,498780 |
BG-10-4 | 0,512807 |
AU-10-5 | 0,876861 |
IT-14 | 0,771338 |
BG-16-1 | 1,073207 |
BG-16-2 | 0,590999 |
BG-16-3 | 1,420205 |
BG-16-4 | 0,786706 |
BG-16-5 | 0,865210 |
BG-16-6 | 0,671406 |
BG-16-7 | 0,899521 |
BG-17-1 | 0,572984 |
BG-17-2 | 0,546892 |
BG-17-3 | 0,535056 |
GR-18-1 | 1,034623 |
UK-21-6 | 0,882925 |
GR-24 | 0,778940 |
UA-28-3 | 0,779688 |
BG-29 | 0,537594 |
CN-31 | 0,888515 |
FR-33-1 | 1,337309 |
FR-33-3 | 1,984810 |
IN-35-1 | 0,728521 |
IN-35-3 | 0,702783 |
BG-36-1 | 0,733750 |
standart max | 1,656808 |
Members of Cluster Number 2 and Distances from Respective Cluster Center Cluster contains 41 cases | |
Country of origin of the essential oils | Distance |
Pl-1-2 | 0,740580 |
CN-4 | 0,985789 |
IT-5-2 | 0,709475 |
PL-5-4 | 0,741186 |
IT-6 | 0,694345 |
IN-7 | 0,594857 |
FR-10-1 | 0,449550 |
LT-12 | 0,707037 |
FR-13 | 0,661259 |
HU-21-3 | 0,962680 |
UK-21-4 | 0,691624 |
FR-21-5 | 0,674496 |
IN-26 | 0,547958 |
UA-27-1 | 0,533303 |
UA-27-2 | 0,513843 |
UA-27-3 | 0,782405 |
UA-27-4 | 0,714650 |
UA-27-5 | 2,019227 |
UA-27-6 | 0,368959 |
UA-27-7 | 0,656498 |
UA-27-8 | 0,708169 |
UA-28-1 | 0,669752 |
UA-28-2 | 0,421578 |
UA-28-4 | 0,564641 |
UA-28-5 | 0,474619 |
UA-28-6 | 0,717241 |
UA-28-7 | 0,278719 |
UA-28-8 | 0,257587 |
UA-28-9 | 0,270680 |
UA-28-10 | 0,227829 |
UA-28-11 | 0,579047 |
UA-28-12 | 0,428956 |
UA-28-13 | 0,570951 |
UA-28-15 | 0,511818 |
UA-28-16 | 0,387688 |
UA-28-17 | 0,956365 |
CN-30 | 0,480355 |
FR-33-2 | 0,738944 |
IN-34 | 0,744650 |
IN-35-2 | 1,285159 |
standart min | 0,641462 |
Members of Cluster Number 3 and Distances from Respective Cluster Center Cluster contains 10 cases | |
Country of origin of the essential oils | Distance |
PL-1-1 | 0,979280 |
PL-1-3 | 0,800925 |
PL-22-1 | 0,601253 |
PL-22-2 | 0,532358 |
PL-22-3 | 0,803043 |
PL-22-4 | 0,681577 |
PL-22-5 | 0,591797 |
UA-28-14 | 1,212039 |
UA-28-18 | 1,398853 |
UA-28-19 | 0,899213 |
Members of Cluster Number 4 and Distances from Respective Cluster Center Cluster contains 7 cases | |
Country of origin of the essential oils | Distance |
GR-15 | 1,166784 |
GR-18-2 | 0,494226 |
GR-19-1 | 0,584855 |
GR-19-2 | 0,750952 |
IR-23 | 0,876077 |
JO-25-1 | 0,521416 |
JO-25-2 | 0,511501 |
Cluster Number 1 contains 32 members (Table
Cluster Number 2 includes 41 members and is the biggest one (Table
Analogically, Cluster Number 3 consists of 10 members (Table
Finally, Cluster Number 4 is the smallest one with only seven members (Table
Factor analysis (often named principal components analysis, or PCA) is a typical projection method. It helps to clarify the dataset structure by reducing variables and replacing the initial variables with latent variables (factors). The new factors are characterized by factor loadings (indicating relationships between the initial variables) and factor scores (representing the new special coordinates of the objects). The output is usually a specific table with factor loadings, which helps to interpret the physical meaning of the latent factors. In Table
Factor Loadings (Varimax normalized) Extraction: Principal components (marked loadings are >.700000) | ||||
---|---|---|---|---|
Variables | Factor - 1 | Factor - 2 | Factor - 3 | Factor - 4 |
Camphor | 0,114 | -0,935 | 0,057 | -0,002 |
1,8-Cineole | 0,124 | -0,951 | 0,094 | -0,019 |
1,8-Cineole + Phelandrene | 0,079 | 0,271 | 0,709 | 0,133 |
1,8-Cineol + D-Limonene | -0,448 | 0,059 | -0,378 | 0,186 |
D-Limonene | 0,055 | 0,020 | 0,768 | 0,147 |
1,8-Cineol + Z-β-Ocimene | 0,032 | 0,103 | 0,076 | 0,125 |
Z-β-Ocimene | 0,055 | -0,129 | 0,778 | 0,011 |
E-β-Ocimene | -0,013 | -0,139 | 0,796 | 0,024 |
Lavandulol | -0,858 | 0,093 | -0,029 | -0,243 |
Lavandulyl acetate | -0,715 | 0,083 | 0,108 | 0,223 |
Linalool | -0,105 | 0,017 | -0,702 | 0,410 |
Linalyl acetate | 0,450 | 0,408 | 0,241 | 0,774 |
3-Octanone | -0,285 | 0,192 | 0,781 | 0,286 |
Terpinen-4-ol | -0,727 | -0,009 | -0,050 | -0,150 |
α-Terpineol | 0,127 | 0,088 | -0,131 | -0,786 |
total % | 0,195 | 0,037 | -0,011 | 0,881 |
Expl. Var. % | 21,8 | 17,7 | 16,6 | 14,1 |
The first latent factor explains nearly 22% of the total variance of the system. It indicates the role of lavandulol, lavandulyl acetate, and terpinen-4-ol on the data structure and could be a conditionally named “terpene effect” factor, which characterizes Polish samples (Table
The second latent factor explains another 18% of the total variance and is related to the high loadings of camphor and 1,8-cineole being very important characteristics of the lavender oil. The Greek and some Mediterranean samples are related to this conditional “Camphor” factor (Table
The next latent factor, with an explanation of almost 17% of the total variance, is related to the special derivates of ocimene and limonene. The third latent factor could conditionally be named an “aromatic” factor (Table
The last latent factor 4 (about 14% of the total variance) indicates the significance of α-terpineol and linalyl acetate as major components to form the total content of the sample quantity; conditionally, this factor could be known as the “macro components“ factor (Table
The ISO defines lavender essential oil as the “oil obtained by steam distillation of recently cut flowering tops of Lavandula angustifolia Mill.” and fixes the main chemical components, which should be within a certain range of content [percents] (
Based on a large data set, we used multivariate statistical methods. The data mining was hierarchical and non-hierarchical clustering (K-means clustering) and, additionally, factor analysis in order to reveal hidden relationships between the objects of the study (partitioning with respect to the geographical location) or between the variables characterizing the objects (chemical descriptors). On both hierarchical dendrograms for linkage of the objects and variables into patterns of similarity, it is quite obvious that the objects are partitioned into 4 groups of similarity (clusters). K-means clustering is used to discuss the priory hypothesis along with expert opinion. Attention deserves Cluster Number 1, which indicates high quality. It contains 32 members (31 samples from various countries and standard maximums). This cluster could be conditionally named the “Bulgarian” pattern because it contains all the samples from Bulgaria together with a few of the French, Greek, Indian, Chinese, etc. samples. The applied factor analysis explains the total variance of the system and confirms the significance of the latent components (conditionally named “terpene” and “camphor” as very specific descriptors for the overall data structure further; “aromatic” and “macrocomponent” factors as general indicators for the lavender essential oil quality). Thus, the results from the statistical data mining could be of use for predicting the quality of lavender essential oil if a limited number of chemical components are available.
Ekaterina Kozuharova was supported in this research by the European Union, NextGenerationEU, through the National Recovery and Resilience Plan of the Republic of Bulgaria, Project № BG-RRP-2.004-0004-C01.
Components identified in the samples from 88 lavender essential oil samples along with minimum and maximum values of the standards
Data type: docx
Explanation note: Legend: Abbreviation of the country, number of the publication [1–35] and number of samples referred in thеse publications.