Ignacio Diez Lopez1-2*, Sandra Maeso Mendez2 and Gaspar Sánchez Merino3
Received: October 21, 2024; Published: October 29, 2024
*Corresponding author: Ignacio Diez Lopez, Basque Country University, Pediatric Department, Victory, Spain
DOI: 10.26717/BJSTR.2024.59.009272
Big data tools are currently a major tool for assessing population changes. Could be a causal relationship between low economic levels and a higher prevalence of conditions associated with obesity? Big dat could give us answers about.
Main Objective: Study effect of the unemployment rate, average income and immigration rate as a possible effect of increasing the prevalence of malnutrition associated with childhood obesity.
Material and Methods: Data collected from computerized clinical history episodes, studying the variables of sex, age, weight, height, of a pediatric population (compare 2020 vs 2022), comparing it with the average income of their residential district, unemployment rate and immigration rate. Use of big data methods for the study of variables. Using the Cole-Green LMS algorithm with penalized likelihood, implemented in the RefCurv 0.4.2 software (2020), which allows managing large amounts of data. The hyperparameters have been selected using the BIC (Bayesian information criterion). To calculate population deviations from the reference, the reference was taken as being above 1.5 standard deviations from the average according to age.
Results: 66,975 computerised episodes of children under 16 years of age and a total of 1,205,000 variables studied. The data and comparative graphs between districts of the population studied are represented with respect to the variables analysed. There are significant differences, with an increase in the rate of overweight in those areas with lower economic income and higher unemployment and immigration rates. Big data technology allows for more efficient population studies, selecting populations most in need of health intervention, optimizing scarce health resources.
Note: CEIC OSI ARABA Approval Expte 2022-058.
Abbreviations: BCAM: Basque Center for Applied Mathematics; DP: Dirichlet Process; BMIBIC: Bayesian Information Criterion; SES: Socioeconomic Status
Health programs to carry out checks on children throughout childhood [1,2] to assess their growth and development status [3]. The body mass index (BMI) is a common parameter to calculate and assess the degree of overweight [3], whether or not it is a criterion of health. Childhood overweight has been seen to increase in the last decade in all regions of the world [4,5]. In Spain, the tables of Carrascosa et al [6] are most aplied about. This group of authors (Diez Lopez, et al. [7]) published how the use of these methods would allow population studies to be carried out with greater statistical power than classic longitudinal studies. There are various studies that correlate obesity, especially childhood obesity, with the most disadvantaged sectors of society [8]
The causes of the higher prevalence of obesity in the most disadvantaged strata have been postulated as diverse, from decreased physical activity, overeating, less education, use of lower-cost and higher-calorie foods [8,9]. Although after COVID-19 pandemia a high prevalence of obesity could be appeared [7]. For another hand, is know than COVID-19 could have an import role for incresing prevalence of obesity among children [10,11]. Home confinement, lack of physical activity, increased screen time were key factors [11], but how much did the family's economic situation condition it? what role have social and economic status of family in all this story? We try ask about in this original.
Main Objective
To describe the situation of the prevalence of overweight in the pediatric population of our area, Álava, Basque Country, Spain, using a new big data approach in relation to the place of residence and the unemployment rate, average income per person and rate of immigrant population compare 2020 vs 2022 To compare whether there are differences in the BMI variable (kg/m2) by comparing paired means between districts and neighborhoods.
Design
This is a population-based cross-sectional study.
Study Population: All minors under 18 years of age being followed up in the Basque health system (OSAKIDETZA) who present weight and height records in the electronic clinical history tool of OSABIDE GLOBAL in the Alava area. Inclusion criteria: Ages between 0 and 18 years.
Exclusion Criteria: Not datas registered.
Epidemiological Data
Source is used on the variables average income per inhabitant, unemployment rate and immigration rate by district/neighborhood.
Available at EUSTAT
https://www.eustat.eus/bankupx/pxweb/es/DB/-/PX_010154_cepv1_ep06b.px/table/tableViewLayout1/.
(Accessed 08/29/202).
Data is recorded between 01/01/2020 and 30/03/2020, and the same period 2 years later, between both COVID-19 pandemia was occurred.
Variables
• Weight (Kgrs)
• Size (cm)
• Gender (Male, Female, Binary)
• Age (expressed in years and months)
• Date of registration
• Place of residence – district/neighborhood code
• Unemployment rate, per capita income and immigration rate by district
Data Management Plan
A data protection impact assessment has been prepared. The data life cycle will involve the IT service of OSI Araba, the project's principal investigator and collaborating researchers, including professionals from the Basque Center for Applied Mathematics (BCAM) who are part of the research team. There is a collaboration agreement between BCAM and the Bioaraba Health Research Institut. The method already described by Diez- Lopez et al [7] is followed, based on the Dirichlet processes (Dirichlet process, DP) [10]. In this project we will adopt this approach that allows to build Gaussian mixture models (GM) [7-13]. In addition, Gaussian averaging models based on Dirichlet processes (Dirichlet) are used. process Gaussian mixture models, DPGMM). Gaussian averaging models based on hierarchical Dirichlet processes (Hierarchical Dirichlet). Dirichlet process Gaussian mixture model, HDPGMM) [14]. Specifically, by grouping the data according to the different variables, clusters will be obtained that will inform us about the somatometric similarities and differences of the population based on the somatometric variables and the district in which they live [15], incorporating recent methodological innovations on databases similar to ours already described [7,16-18]. The BMI is calculated as weight/height2 (kg /m2). These data are compared with the means and SDS of the studies published to date and reference of our population [6]. Overweight is defined as +1.5 SDS with respect to the normal reference for age and sex.
Data has been obtained from a total of 67,270 cases. The sum of all variables studied (some presented in this work and others reserved) amounts to 1,749,020 variables. We present in various tables the results obtained by sex, age and BMI and other variables. Data from the National Institute of Statistics and EUSTAT indicate that 338,765 people live in our territory. Of these inhabitants, 166,437 are men and 172,328 are women. In addition, 52,241 of these people were born abroad. In the last year, our territory has gained 3,199 foreign-born inhabitants: a number higher than the total population growth. Although the rate of immigrants in the Basque Country is on average 13% of the total population, there are significant differences between districts (Source EUSTAT), with the towns of Álava having the highest average percentage of immigrants of the entire population of the region. Not significal changes from 2020 vs 2022. About average incomes at the Basque Country was calculated as the total income minus income tax and social security contributions paid by workers, of the resident population in 2021 is 19,366 euros. There are significant differences between age groups, sex and districts. The income of minors depends on the average family income. The average family income of the Basque Country is 47,005 euros in 2021. Total family income is obtained by aggregating the personal income of family members, including minors.
The average income for all families in the Basque Country is around double the average personal income. There are significant differences between districts (Source EUSTAT), with the towns of Álava having the lowest average income of the entire population of the region. The unemployment rate in the Basque Country is 7.5%, well below the average for the country, Spain. Although 6 out of 10 households have all their people employed, in more than 1 out of 10 they are all unemployed. There are significant differences between districts (Source EUSTAT), with some towns in Álava and Vizcaya having the highest unemployment rate. Based on the above, it is clear that the territory under study is the one with the lowest per capita income, the highest prevalence of migrants and some of the areas most affected by structural unemployment. At the same time, within the territory we have studied, Álava, there are significant differences between different districts. If we focus on the BMI data, establishing as a significant cut-off point those that exceed more than 10% of the entire population affected by obesity, differences are evident between districts within the territory of Alava (Table 1) (Charts 1-6).
Table 1: Numerical representation of data by districts for the variable BMI (Kgrs) according to sex. Reference normal population (P50 Carrascosa study). The percentages indicate the amount of population whose BMI is 1.5 standard deviations above the average for their age. Age 2020 and 2022. Differences between both years. In red is represented the district with more than 10% of the population > 1.5 SDS.

After studying the variables BMI, unemployment rate, per capita income and immigrant population separately, an assessment is made of those districts in the territory that have a higher prevalence of childhood obesity (>10%) in relation to these other variables. We also study where was highest improving ratio It is observed that there is an indirect relationship between a higher prevalence of childhood obesity expressed as >1.5 SDS according to district and the per capita income recorded in said district expressed in thousands of euros. Most of cases increase their rate of overweight from 2020 to 2022. A direct relation sheep is observant about per capita income. At woman is possible to show a decrease of overweight on most rich district. Regarding the unemployment rate by district, the opposite is evident, that with per capita income there is a direct relationship between the obesity rate and the unemployment rate. This suggests that both variables are therefore in a possible cause-effect relationship in their effect on the state of child nutrition. So, for children, we find a relationship between incomes family and unemployment rate. Both of them they are relationated too with changes of obesity in the las period, after ant before COVID19. As charts show us there are a direct relationship between rate of migrated people in the different part of our region, economic level and increase of overweight in the last years.
Modern clinical records, combined with new epidemiological research techniques and the use of big data, allow us to propose new health strategies. Different statistical techniques, such as machine learning, have been shown in other fields [7,14-16] to be effective in interpreting a large amount of data generated in real life and making decisions about it. Somatometry in children in general and the problem of obesity in particular are proposed as one of the fields of research. The secular acceleration of weight relative to height [4,5] is observed, and a possible additional effect to that observed is that generated by the pandemic and its confinement. Other postulated causes are the relationship with the socioeconomic level of the family [8,9]. Health resources and BIG DATA are proposed as a faster and cheaper way to obtain a real picture of the population situation, and therefore to determine where, how and why to invest these scarce resources [15-17]. We point out that our study shows that childhood obesity is present in our population, with towns, neighborhoods or districts with an affectation rate of more than 10% of the entire child population. Obesity occurs at key stages of development, such as pre-pubertal or late pubertal age, which can contribute to maintaining the problem of overweight [4,7,18]. Children with immigrant backgrounds generally have higher rates of overweight and obesity compared to native-born children:
• In Spain, children of immigrant origin had significantly higher prevalence of overweight/obesity than native children (40.5% vs 29.5% for boys, 44.8% vs 30.3% for girls) economic Factors. Surprisingly, family economic status does not fully explain the higher obesity rates among immigrant children: Children of immigrants tend to have higher obesity rates across all socioeconomic status (SES) groups, including high SES families. Among children from the most economically successful families, those of newly arrived immigrants (1.0 generation) are significantly more likely to be overweight or obese. Other Contributing Factors as acculturated immigrants tend to have the highest obesity rates. Some studies show mixed results regarding obesogenic behaviors:
1) In New Zealand, children of foreign-born mothers had lower odds of consuming fast food and soft drinks, but higher odds of inadequate sleep duration
2) In Spain, immigrant children were at higher risk of consuming sugary soft drinks, exercising less, and using screens more
These findings suggest that addressing childhood obesity among immigrant populations requires a nuanced approach that goes beyond economic factors. Public health interventions should consider:
• Language barriers and cultural factors
• Acculturation processes
• Specific risk factors and behaviors within immigrant communities
By targeting these areas, policymakers and health professionals can work towards reducing health inequalities and improving outcomes for children in immigrant families. But in addition, our work shows how there are situations not foreign to the environment where a child lives that seem to condition his or her situation regarding the recorded weight [8,9,19,20]. The relationship between income level, quality (and quantity) in food purchases, the possibility of attending extracurricular, educational, and sports activities [20,21] and in general the environment where a child grows up also seems to mark the possibility of suffering or not from obesity. Knowing first-hand where, how and in what way to act within the global epidemic of childhood obesity [22] will allow us to optimize the scarce resources we have and to carry out health intervention policies that are as effective as possible [23,24].
The main limitation of the study is related to the fact that the data used come from the electronic medical record and therefore have not been generated for research purposes. This is why, as described in the literature, errors may occur in the measurement and transcription of the data (Heude B, et al. [3]). The nature of this study allows it to be repeated periodically, detecting areas of improvement in different subpopulations.
Ethical Aspects
The study has been prepared in compliance with the principles established in the Declaration of Helsinki (1964) latest version Fortaleza, Brazil 2013, in the Council of Europe Convention on Human Rights and Biomedicine (1997), and in the regulations on biomedical research, protection of personal data. Law 14/2007 on Biomedical Research Study approved by the CEIC on 03/24/2023 with CODE File 2022-058.
The study will be conducted without funding. The tasks described in the project are undertaken by the principal investigator and his collaborators.
This original study has been supported thanks to the work of the Collaborative Group from Basque Center of Applied Mathematics (BCAM). Bilbao, Bizkaia Basque Country, Spain
• Jose A. Lozano Basque Center for Applied Mathematics BCAM
• Ioar Married Tellechea Basque Center for Applied Mathematics BCAM
• Aritz Pérez Postdoctoral Fellow BCAM - Basque Center for Applied Mathematics.
