JCDR - Analysis of correlated outcomes, Case-control study, Cox 36 regression hazard model, Logistic regression model

Abstract Material and Methods Results Discussion Conclusion Acknowledgement References DOI and Others

Article in PDF How to Cite Citation Manager Readers' Comments (0) Audio Visual Article Statistics Link to PUBMED Print this Article Send to a Friend

Advertisers Access Statistics Resources

Dr Mohan Z Mani

"Thank you very much for having published my article in record time.I would like to compliment you and your entire staff for your promptness, courtesy, and willingness to be customer friendly, which is quite unusual.I was given your reference by a colleague in pathology,and was able to directly phone your editorial office for clarifications.I would particularly like to thank the publication managers and the Assistant Editor who were following up my article. I would also like to thank you for adjusting the money I paid initially into payment for my modified article,and refunding the balance.
I wish all success to your journal and look forward to sending you any suitable similar article in future"

Dr Mohan Z Mani,
Professor & Head,
Department of Dermatolgy,
Believers Church Medical College,
Thiruvalla, Kerala
On Sep 2018

Prof. Somashekhar Nimbalkar

"Over the last few years, we have published our research regularly in Journal of Clinical and Diagnostic Research. Having published in more than 20 high impact journals over the last five years including several high impact ones and reviewing articles for even more journals across my fields of interest, we value our published work in JCDR for their high standards in publishing scientific articles. The ease of submission, the rapid reviews in under a month, the high quality of their reviewers and keen attention to the final process of proofs and publication, ensure that there are no mistakes in the final article. We have been asked clarifications on several occasions and have been happy to provide them and it exemplifies the commitment to quality of the team at JCDR."

Prof. Somashekhar Nimbalkar
Head, Department of Pediatrics, Pramukhswami Medical College, Karamsad
Chairman, Research Group, Charutar Arogya Mandal, Karamsad
National Joint Coordinator - Advanced IAP NNF NRP Program
Ex-Member, Governing Body, National Neonatology Forum, New Delhi
Ex-President - National Neonatology Forum Gujarat State Chapter
Department of Pediatrics, Pramukhswami Medical College, Karamsad, Anand, Gujarat.
On Sep 2018

Dr. Kalyani R

"Journal of Clinical and Diagnostic Research is at present a well-known Indian originated scientific journal which started with a humble beginning. I have been associated with this journal since many years. I appreciate the Editor, Dr. Hemant Jain, for his constant effort in bringing up this journal to the present status right from the scratch. The journal is multidisciplinary. It encourages in publishing the scientific articles from postgraduates and also the beginners who start their career. At the same time the journal also caters for the high quality articles from specialty and super-specialty researchers. Hence it provides a platform for the scientist and researchers to publish. The other aspect of it is, the readers get the information regarding the most recent developments in science which can be used for teaching, research, treating patients and to some extent take preventive measures against certain diseases. The journal is contributing immensely to the society at national and international level."

Dr Kalyani R
Professor and Head
Department of Pathology
Sri Devaraj Urs Medical College
Sri Devaraj Urs Academy of Higher Education and Research , Kolar, Karnataka
On Sep 2018

Dr. Saumya Navit

"As a peer-reviewed journal, the Journal of Clinical and Diagnostic Research provides an opportunity to researchers, scientists and budding professionals to explore the developments in the field of medicine and dentistry and their varied specialities, thus extending our view on biological diversities of living species in relation to medicine.
‘Knowledge is treasure of a wise man.’ The free access of this journal provides an immense scope of learning for the both the old and the young in field of medicine and dentistry as well. The multidisciplinary nature of the journal makes it a better platform to absorb all that is being researched and developed. The publication process is systematic and professional. Online submission, publication and peer reviewing makes it a user-friendly journal.
As an experienced dentist and an academician, I proudly recommend this journal to the dental fraternity as a good quality open access platform for rapid communication of their cutting-edge research progress and discovery.
I wish JCDR a great success and I hope that journal will soar higher with the passing time."

Dr Saumya Navit
Professor and Head
Department of Pediatric Dentistry
Saraswati Dental College
Lucknow
On Sep 2018

Dr. Arunava Biswas

"My sincere attachment with JCDR as an author as well as reviewer is a learning experience . Their systematic approach in publication of article in various categories is really praiseworthy.
Their prompt and timely response to review's query and the manner in which they have set the reviewing process helps in extracting the best possible scientific writings for publication.
It's a honour and pride to be a part of the JCDR team. My very best wishes to JCDR and hope it will sparkle up above the sky as a high indexed journal in near future."

Dr. Arunava Biswas
MD, DM (Clinical Pharmacology)
Assistant Professor
Department of Pharmacology
Calcutta National Medical College & Hospital , Kolkata

Dr. C.S. Ramesh Babu
" Journal of Clinical and Diagnostic Research (JCDR) is a multi-specialty medical and dental journal publishing high quality research articles in almost all branches of medicine. The quality of printing of figures and tables is excellent and comparable to any International journal. An added advantage is nominal publication charges and monthly issue of the journal and more chances of an article being accepted for publication. Moreover being a multi-specialty journal an article concerning a particular specialty has a wider reach of readers of other related specialties also. As an author and reviewer for several years I find this Journal most suitable and highly recommend this Journal."
Best regards,
C.S. Ramesh Babu,
Associate Professor of Anatomy,
Muzaffarnagar Medical College,
Muzaffarnagar.
On Aug 2018

Dr. Arundhathi. S
"Journal of Clinical and Diagnostic Research (JCDR) is a reputed peer reviewed journal and is constantly involved in publishing high quality research articles related to medicine. Its been a great pleasure to be associated with this esteemed journal as a reviewer and as an author for a couple of years. The editorial board consists of many dedicated and reputed experts as its members and they are doing an appreciable work in guiding budding researchers. JCDR is doing a commendable job in scientific research by promoting excellent quality research & review articles and case reports & series. The reviewers provide appropriate suggestions that improve the quality of articles. I strongly recommend my fraternity to encourage JCDR by contributing their valuable research work in this widely accepted, user friendly journal. I hope my collaboration with JCDR will continue for a long time".

Dr. Arundhathi. S
MBBS, MD (Pathology),
Sanjay Gandhi institute of trauma and orthopedics,
Bengaluru.
On Aug 2018

Dr. Mamta Gupta,
"It gives me great pleasure to be associated with JCDR, since last 2-3 years. Since then I have authored, co-authored and reviewed about 25 articles in JCDR. I thank JCDR for giving me an opportunity to improve my own skills as an author and a reviewer.
It 's a multispecialty journal, publishing high quality articles. It gives a platform to the authors to publish their research work which can be available for everyone across the globe to read. The best thing about JCDR is that the full articles of all medical specialties are available as pdf/html for reading free of cost or without institutional subscription, which is not there for other journals. For those who have problem in writing manuscript or do statistical work, JCDR comes for their rescue.
The journal has a monthly publication and the articles are published quite fast. In time compared to other journals. The on-line first publication is also a great advantage and facility to review one's own articles before going to print. The response to any query and permission if required, is quite fast; this is quite commendable. I have a very good experience about seeking quick permission for quoting a photograph (Fig.) from a JCDR article for my chapter authored in an E book. I never thought it would be so easy. No hassles.
Reviewing articles is no less a pain staking process and requires in depth perception, knowledge about the topic for review. It requires time and concentration, yet I enjoy doing it. The JCDR website especially for the reviewers is quite user friendly. My suggestions for improving the journal is, more strict review process, so that only high quality articles are published. I find a a good number of articles in Obst. Gynae, hence, a new journal for this specialty titled JCDR-OG can be started. May be a bimonthly or quarterly publication to begin with. Only selected articles should find a place in it.
An yearly reward for the best article authored can also incentivize the authors. Though the process of finding the best article will be not be very easy. I do not know how reviewing process can be improved. If an article is being reviewed by two reviewers, then opinion of one can be communicated to the other or the final opinion of the editor can be communicated to the reviewer if requested for. This will help one’s reviewing skills.
My best wishes to Dr. Hemant Jain and all the editorial staff of JCDR for their untiring efforts to bring out this journal. I strongly recommend medical fraternity to publish their valuable research work in this esteemed journal, JCDR".

Dr. Mamta Gupta
Consultant
(Ex HOD Obs &Gynae, Hindu Rao Hospital and associated NDMC Medical College, Delhi)
Aug 2018

Dr. Rajendra Kumar Ghritlaharey

"I wish to thank Dr. Hemant Jain, Editor-in-Chief Journal of Clinical and Diagnostic Research (JCDR), for asking me to write up few words.
Writing is the representation of language in a textual medium i e; into the words and sentences on paper. Quality medical manuscript writing in particular, demands not only a high-quality research, but also requires accurate and concise communication of findings and conclusions, with adherence to particular journal guidelines. In medical field whether working in teaching, private, or in corporate institution, everyone wants to excel in his / her own field and get recognised by making manuscripts publication.

Authors are the souls of any journal, and deserve much respect. To publish a journal manuscripts are needed from authors. Authors have a great responsibility for producing facts of their work in terms of number and results truthfully and an individual honesty is expected from authors in this regards. Both ways its true "No authors-No manuscripts-No journals" and "No journals–No manuscripts–No authors". Reviewing a manuscript is also a very responsible and important task of any peer-reviewed journal and to be taken seriously. It needs knowledge on the subject, sincerity, honesty and determination. Although the process of reviewing a manuscript is a time consuming task butit is expected to give one's best remarks within the time frame of the journal.
Salient features of the JCDR: It is a biomedical, multidisciplinary (including all medical and dental specialities), e-journal, with wide scope and extensive author support. At the same time, a free text of manuscript is available in HTML and PDF format. There is fast growing authorship and readership with JCDR as this can be judged by the number of articles published in it i e; in Feb 2007 of its first issue, it contained 5 articles only, and now in its recent volume published in April 2011, it contained 67 manuscripts. This e-journal is fulfilling the commitments and objectives sincerely, (as stated by Editor-in-chief in his preface to first edition) i e; to encourage physicians through the internet, especially from the developing countries who witness a spectrum of disease and acquire a wealth of knowledge to publish their experiences to benefit the medical community in patients care. I also feel that many of us have work of substance, newer ideas, adequate clinical materials but poor in medical writing and hesitation to submit the work and need help. JCDR provides authors help in this regards.
Timely publication of journal: Publication of manuscripts and bringing out the issue in time is one of the positive aspects of JCDR and is possible with strong support team in terms of peer reviewers, proof reading, language check, computer operators, etc. This is one of the great reasons for authors to submit their work with JCDR. Another best part of JCDR is "Online first Publications" facilities available for the authors. This facility not only provides the prompt publications of the manuscripts but at the same time also early availability of the manuscripts for the readers.
Indexation and online availability: Indexation transforms the journal in some sense from its local ownership to the worldwide professional community and to the public.JCDR is indexed with Embase & EMbiology, Google Scholar, Index Copernicus, Chemical Abstracts Service, Journal seek Database, Indian Science Abstracts, to name few of them. Manuscriptspublished in JCDR are available on major search engines ie; google, yahoo, msn.
In the era of fast growing newer technologies, and in computer and internet friendly environment the manuscripts preparation, submission, review, revision, etc and all can be done and checked with a click from all corer of the world, at any time. Of course there is always a scope for improvement in every field and none is perfect. To progress, one needs to identify the areas of one's weakness and to strengthen them.
It is well said that "happy beginning is half done" and it fits perfectly with JCDR. It has grown considerably and I feel it has already grown up from its infancy to adolescence, achieving the status of standard online e-journal form Indian continent since its inception in Feb 2007. This had been made possible due to the efforts and the hard work put in it. The way the JCDR is improving with every new volume, with good quality original manuscripts, makes it a quality journal for readers. I must thank and congratulate Dr Hemant Jain, Editor-in-Chief JCDR and his team for their sincere efforts, dedication, and determination for making JCDR a fast growing journal.
Every one of us: authors, reviewers, editors, and publisher are responsible for enhancing the stature of the journal. I wish for a great success for JCDR."

Thanking you
With sincere regards
Dr. Rajendra Kumar Ghritlaharey, M.S., M. Ch., FAIS
Associate Professor,
Department of Paediatric Surgery, Gandhi Medical College & Associated
Kamla Nehru & Hamidia Hospitals Bhopal, Madhya Pradesh 462 001 (India)
E-mail: drrajendrak1@rediffmail.com
On May 11,2011

Dr. Shankar P.R.

"On looking back through my Gmail archives after being requested by the journal to write a short editorial about my experiences of publishing with the Journal of Clinical and Diagnostic Research (JCDR), I came across an e-mail from Dr. Hemant Jain, Editor, in March 2007, which introduced the new electronic journal. The main features of the journal which were outlined in the e-mail were extensive author support, cash rewards, the peer review process, and other salient features of the journal.
Over a span of over four years, we (I and my colleagues) have published around 25 articles in the journal. In this editorial, I plan to briefly discuss my experiences of publishing with JCDR and the strengths of the journal and to finally address the areas for improvement.
My experiences of publishing with JCDR: Overall, my experiences of publishing withJCDR have been positive. The best point about the journal is that it responds to queries from the author. This may seem to be simple and not too much to ask for, but unfortunately, many journals in the subcontinent and from many developing countries do not respond or they respond with a long delay to the queries from the authors 1. The reasons could be many, including lack of optimal secretarial and other support. Another problem with many journals is the slowness of the review process. Editorial processing and peer review can take anywhere between a year to two years with some journals. Also, some journals do not keep the contributors informed about the progress of the review process. Due to the long review process, the articles can lose their relevance and topicality. A major benefit with JCDR is the timeliness and promptness of its response. In Dr Jain's e-mail which was sent to me in 2007, before the introduction of the Pre-publishing system, he had stated that he had received my submission and that he would get back to me within seven days and he did!
Most of the manuscripts are published within 3 to 4 months of their submission if they are found to be suitable after the review process. JCDR is published bimonthly and the accepted articles were usually published in the next issue. Recently, due to the increased volume of the submissions, the review process has become slower and it ?? Section can take from 4 to 6 months for the articles to be reviewed. The journal has an extensive author support system and it has recently introduced a paid expedited review process. The journal also mentions the average time for processing the manuscript under different submission systems - regular submission and expedited review.
Strengths of the journal: The journal has an online first facility in which the accepted manuscripts may be published on the website before being included in a regular issue of the journal. This cuts down the time between their acceptance and the publication. The journal is indexed in many databases, though not in PubMed. The editorial board should now take steps to index the journal in PubMed. The journal has a system of notifying readers through e-mail when a new issue is released. Also, the articles are available in both the HTML and the PDF formats. I especially like the new and colorful page format of the journal. Also, the access statistics of the articles are available. The prepublication and the manuscript tracking system are also helpful for the authors.
Areas for improvement: In certain cases, I felt that the peer review process of the manuscripts was not up to international standards and that it should be strengthened. Also, the number of manuscripts in an issue is high and it may be difficult for readers to go through all of them. The journal can consider tightening of the peer review process and increasing the quality standards for the acceptance of the manuscripts. I faced occasional problems with the online manuscript submission (Pre-publishing) system, which have to be addressed.
Overall, the publishing process with JCDR has been smooth, quick and relatively hassle free and I can recommend other authors to consider the journal as an outlet for their work."

Dr. P. Ravi Shankar
KIST Medical College, P.O. Box 14142, Kathmandu, Nepal.
E-mail: ravi.dr.shankar@gmail.com
On April 2011 Anuradha

Dear team JCDR, I would like to thank you for the very professional and polite service provided by everyone at JCDR. While i have been in the field of writing and editing for sometime, this has been my first attempt in publishing a scientific paper.Thank you for hand-holding me through the process.

Dr. Anuradha
E-mail: anuradha2nittur@gmail.com
On Jan 2020

Important Notice

Original article / research

Year : 2022 | Month : July | Volume : 16 | Issue : 7 | Page : YC15 - YC19

Full Version

Application of Principal Component Analysis in Dealing with Multicollinearity in Modelling Clinical Data

Published: July 1, 2022 | DOI: https://doi.org/10.7860/JCDR/2022/55379.16629
Akash Mishra, N Sreekumaran Nair, KT Harichandrakumar, VS Binu, Santhosh Satheesh

1. PhD Scholar, Department of Biostatistics, Jawaharlal Institute of Postgraduate Medical Education and Research, Puducherry, Tamil Nadu, India. 2. Professor, Department of Biostatistics, Jawaharlal Institute of Postgraduate Medical Education and Research, Puducherry, Tamil Nadu, India. 3. Assistant Professor, Department of Biostatistics, Jawaharlal Institute of Postgraduate Medical Education and Research, Puducherry, Tamil Nadu, India. 4. Associate Professor, Department of Biostatistics, National Institute of Mental Health and Neurosciences, Bengaluru, Karnataka, India. 5. Professor, Department of Cardiology, Jawaharlal Institute of Postgraduate Medical Education and Research, Puducherry, Tamil Nadu, India.

Correspondence Address :
Dr. N Sreekumaran Nair,
Admin Block, 4th Floor, Department of Biostatistics, Jawaharlal Institute of Postgraduate Medical Education and Research, Puducherry, Tamil Nadu, India.
E-mail: nsknairmanipal@gmail.com

Abstract

Introduction: One of the stringent assumptions about covariates in the Cox hazard and Logistic regression modelling is that they should be independent. Incorporating correlated covariates as such into the model might distort the precision of the estimates due to multicollinearity. One way to deal with multicollinearity is by using Principal Component Analysis (PCA) technique.

Aim: To demonstrate the application of PCA in dealing with correlated covariates while modelling time to event and case-control study data.

Materials and Methods: This study was conducted at Jawaharlal Institute of Postgraduate Medical Education and Research, Puducherry, India, from February 2021 to January 2022. Two datasets were used for the demonstration i.e., data relates to a time to event outcome and a case-control study with binary outcome in which lipids were the correlated covariates. Three sets of Cox regression models were used to demonstrate change in hazard ratios with 95% Confidence Intervals (CI) for evaluating the effect of intervention at a different time of lipid measurement. Model I has evaluated treatment/Body Mass Index (BMI) effect on the outcome by ignoring the effect of lipid parameters. Model II has evaluated treatment/BMI effect on the outcome by incorporating lipid variables but ignoring multicollinearity. Model III has evaluated treatment/ BMI effect on the outcome by incorporating lipid variables through principal component analysis and thus adjusting for multicollinearity. Similarly, a logistic regression model was performed by using the same three sets of models to evaluate the effect of exposure (BMI). The comparability of lipids between the two groups for both datasets was tested using Hotelling’s T-squared statistic.

Results: The lipids measured at 12^th, 24^th and 36^th months between the two groups in the first data set as well as between cases and controls in the second data set were statistically significant. In the first dataset, at baseline, the Hazard Ratio’s (HR’s) were statistically similar irrespective of the models used; while decreasing successively with narrowing 95% CI’s as moving from model I to model III for the lipid measured at 12^th, 24^th and 36^th months. Further, at 24^th and 36^th months, the HR in model-III found to be significant. In the second data set, the Odds Ratio (OR) were significant for all the three models and it was almost similar for model I and II but in model III it was elevated.

Conclusion: The multicollinearity issue should be properly addressed before including correlated covariates in the Cox regression hazard and Logistic regression model. The PCA technique would be a favourable method.

Keywords

Analysis of correlated outcomes, Case-control study, Cox 36 regression hazard model, Logistic regression model

Any model that establishes the effect of the potential covariates on the outcome variable should comply with the nature of the outcome or dependent variable. For a longitudinal study with a time to event outcome variable, the commonly used statistical approach is the Cox hazard regression model (1),(2). Similarly, for a case-control study with a binary outcome variable, the suitable approach is the logistic regression model (1). One of the assumptions of such a regression model is that the predictor variables should not be correlated with each other. However, the predictors under consideration may not be truly independent but rather correlated in biomedical research. Such dependency between the covariates in the regression modelling leads to a condition referred to in a statistical term as multicollinearity which means a covariate can be predicted by the remaining covariates (3),(4).

The main issue with multicollinearity is that the estimate of the regression coefficient of one of the correlated predictors depends on the presence of the other predictors in the model. Also, due to multicollinearity, the estimated standard errors of the regression coefficients might get inflated and could lead to spurious results. Variables in clinical research studies are usually found to be correlated (5),(6),(7),(8),(9). This stipulates that the change in one variable is associated with the change in another variable. There are studies that have established the association of these lipid parameters with the outcome of interest such as Cardiovascular Disease (CVD) and Sudden Sensorineural Hearing Loss (SSNHL) (10),(11),(12),(13),(14),(15),(16),(17),(18),(19),(20),(21),(22),(23),(24).

The researcher while evaluating the effect of an intervention/exposure on the outcome has to be conscious in dealing with the effect of such multiple correlated predictors. There are studies where the multicollinearity issues of lipids were not addressed while modelling the outcome variable with Cox hazard or logistic regression models (19),(20),(21),(22),(23),(24). In these cited studies, the effect of intervention/exposure were evaluated by introducing the lipid parameters as the covariates in the model as such. So, due to multicollinearity, it is likely to get unreliable point estimates of Hazard Ratio (HR) or Odds Ratio (OR) of the intervention/exposure. Moreover, incorporating the correlated covariates into the model as such may weakens the statistical power of such regression models. In such conditions, the researcher will be concluding with a compromised precision of the effect of intervention/exposure. To address the multicollinearity issue, methods like partial least square (PLS), Ridge Regression (RR) and Principal Component Analysis (PCA) have been suggested. PLS and RR methods are used for the continuous outcome variable. Since our outcome variable is binary, so, in this article PCA technique was used (25).

The objective of this study was to demonstrate the application of PCA method in dealing with multicollinearity with Cox and logistic regression models. The demonstration was done from two data sets. The first data set was from the ACCORD BP (Action to Control Cardiovascular Risk in Diabetes Blood Pressure) trial in which data was recorded from time to event. While second data set was from a case-control study on Sudden Sensorineural Hearing Loss (SSNHL). Lipids were then correlated with covariates in both the data sets.

Material and Methods

Materials and Methods

This study was conducted at Jawaharlal Institute of Postgraduate Medical Education and Research, Puducherry, India, from February 2021 to January 2022.

Brief Description of Dataset

For the demonstration the following two datasets were used.

ACCORD BP trial dataset (26): The ACCORD trial dataset was available from Biologic Specimen and Data Repository Information Coordinating Centre (https://biolincc.nhlbi.nih.gov/home/) of National Heart, Lung, and Blood Institute, upon institutional request. It was an open-label multicentric randomised trial of 84 months follow-up. A total of 4733 high-risk type 2 diabetes mellitus eligible participants were randomised into two study groups:

• Intensive BP control group (n=2362): Treatment strategy was to lower Systolic Blood Pressure (SBP) below 120 mmHg.
• Standard BP control group (n=2371): The strategy was to lower SBP below 140 mmHg.

The treatment strategy followed in the respective BP control groups was for the comparison in reducing CVD events. The primary outcome variable considered was a composite of non fatal Myocardial Infarction (MI), non fatal stroke and CVD death whichever occurred first.

The five lipid parameters were measured at baseline and thereafter on yearly basis:

• Total Cholesterol (TC)
• Triglyceride (TG)
• Very Low Density Lipoprotein (VLDL)
• Low Density Lipoprotein (LDL)
• High Density Lipoprotein (HDL)

The participants who were not measured for their lipid parameters at different follow-ups were excluded from the analysis.

SSNHL case-control study dataset (10): The SSNHL case-control study dataset was publicly available from the authors obtained by dryad (http://dx.doi.org/10.5061/dryad.r2b1n). A total of 324 hospitalised cases for SSNHL and 972 controls with normal hearing were taken. As per World Health Organisation (WHO) criteria the underweight subject (BMI ≤18.5 kg/m²) from among the cases and controls were excluded from the analysis (27). The data on BMI and lipid parameters TC, TG, LDL and HDL for the cases and controls were available.

Models

Cox hazard regression model: The Cox proportional hazard model was used when the covariates considered in the model satisfied the proportionality assumption. For a random binary outcome variable Y with a vector of covariates X: [X₁ X₂ .....X_p] and the corresponding vector of b coefficients β'=[β₁ β₂.......β_p], Cox proportional hazard model with hazard rate h (t/X) at any time t is expressed as:

h (t/X)=h₀ (t) e^Xβ

Where, h₀ (t) is an unspecified non negative function of time called baseline hazard at time t. Thus, the HR to an individual of j^th group with 1×p vector of covariates X against an individual of k^th group with a vector of same covariates can be obtained as:

Where, X_j and X_k are the vector of the same covariate X for j^th and k^th groups respectively (1),(2),(3):

Cox time-dependent hazard model: The Cox time-dependent model was used when the covariates considered did not satisfy the proportionality assumption. The time-dependent Cox hazard model with hazard rate h[t/X(t)] at time t is expressed as:

h[(t/X(t)]=h₀ (t) e^X(t)β

Where, h₀ (t) is an unspecified non negative function of time called base line hazard at time t. Thus, the HR can be obtained as:

Where, X_j (t) and X_k (t) are the vector of the same covariate X(t) at time t for the j^th and k^th groups respectively. The estimate of HR associated with ith covariate and the corresponding (1-α) Confidence Interval (CI) was obtained by using the estimates of β_i and its standard error as e^b_i and e^b_i^+Z_α/2 SE _(βi), respectively.

Logistic regression model: The logistic regression model was used in case control study data set. For a random outcome variable Y with vector of covariates X: X₁ X₂ .....X_p and the corresponding vector of b coefficients β'=[β₀ β₁ β₂.......β_p], the estimate of odds ratio (OR) was obtained by using the logistic model as:

Where, β₀ is the constant, called intercept of the regression equation.

Thus, the odds (OR) to an individual of j^th group with p_j being the probability of occurrence of event with vector of the covariates X against an individual of k^th group with p_k being the probability of occurrence of the event with the vector of same covariates can be obtained as:

The estimates of OR associated with i^th covariate and the corresponding (1-α) CI was obtained by using the estimates of bi and its standard error as e^b_i and e^b_i^+Z_α/2 SE _(βi), respectively.

Principal Component Analysis (PCA): It is a data dimension reduction technique. It creates a new set of uncorrelated variables known as Principal Components (PC) based on the linear combinations of all correlated variables. Generally, first few PC’s can explain the most of total variability of all correlated variables (28).

The general PCA equation to create the independent variables is given by:

which maximises the variance of (eⁱ_T X) subject to the condition eⁱ_T e_i=1 and Cov (e_i^T X, e_k^TX)=0 for i?k, where, X'=[X₁ X₂ X₃ ...X_p], a random vector of correlated p variables which have the covariance matrix as S with the eigen values λ₁≥λ₂â€¦≥λ_p≥0 and e_i^T is the transpose of eigen vector corresponding to i^th eigen value (λ_i).All the PC’s are uncorrelated and variance equal to the eigen values of S i.e., Var (PC_i) = λ_i. Thus, the first PC explains the maximum variation of the data followed by second component and so on. For both the data set, the new independent variables were created using the measured values of lipid parameters.

Statistical Analysis

The following three sets of models were used in analysis for both the data sets:

Model I: Treatment/BMI effect on the outcome been compared by ignoring the effect of lipid parameters.

Model II: Treatment/BMI effect on the outcome been compared by incorporating lipid variables but ignoring multicollinearity.

Model III: Treatment/BMI effect on the outcome has been compared by incorporating lipid variables through principal component analysis and thus adjusting for multicollinearity.

However, the methodological component and analysis part were explained for each dataset separately.

ACCORD BP trial dataset: The Pearson correlation coefficients between lipids parameters were computed with log-transformed values of lipids due to their skewed distribution (Table/Fig 1).

Three Cox proportional hazard regression models were fitted. The treatment group was taken as the main predictor variable. The Cox proportional hazard regression model was used if proportionality assumptions were satisfied; the Cox time-dependent regression hazard model otherwise. The proportionality assumption of each covariate was tested by Schoenfeld’s global test (29).

The HR with 95% CIs were estimated across all the above three models for measurements of lipids at the baseline, 12^th, 24^th and 36^th month follow-ups. The lipid parameters were introduced after seeing the significant difference in lipids between the two treatment groups. This was tested by multivariate Hotelling’s T-squared statistic as the lipids were correlated (28),(29),(30). The difference testing performed on log-transformed values of lipids for both the datasets to meet assumptions as the distributions were skewed. The lipid parameters were found to differ significantly between the two groups at each time point except at baseline (Table/Fig 2).

The eigen values (λ_i) and the corresponding transpose of eigen vectors (e_i^T) were obtained for intensive and standard BP control groups separately. Further, PCA was performed in each group to create new independent variables for the random vector X'=[TC,TG,VLDL,LDL,HDL] with the covariance matrix as:

Using PC equations, data was generated for the first three independent PC’s at baseline, 12^th, 24^th and 36^th months, respectively. These first three PC’s were able to explain more than 99% of the total variation in lipids at each considered time point. The effect of intervention in model III was evaluated by adjusting for the effect of newly formed independent PC’s in the Cox hazard model. The significance of HR’s was judged by their 95% CI’s.

SSNHL case control study dataset: Similarly, the correlation coefficients between lipids parameters were computed with log transformed values of lipids (Table/Fig 1). The BMI was considered as the primary exposure for the SSNHL data. The same three sets of Logistic regression models were fitted for SSNHL dataset. The OR with 95% CI were estimated for BMI which was categorised as normal (BMI between 18.5 to 24.99 kg/m²) and overweight or obese (BMI ≥25 kg/m²) (29). Again, the difference in the lipid parameters between cases and controls was tested using multivariate Hotelling’s T-squared statistic (Table/Fig 2). The lipids were introduced into the model as these differed significantly between the groups.

Similarly, the eigen values (λ_i) and the corresponding transpose of eigen vectors (e_i^T ) were obtained for cases and control separately and corresponding PCA was performed for each group for a random vector X'=[TC,TG,LDL,HDL] with covariance matrix as:

The first three independent PC’s were generated, which were able to explain 99% of total variation of all lipids. The effect of BMI in model III was evaluated by adjusting for the effect of newly formed independent PC’s into logistic regression model. The significance of OR’s was judged by their 95% CI’s. The analysis was carried using R Studio version 3.6.1 (31), Statistical Package for Social Sciences (SPSS) version 19.0 (32) and StataCorp. volume 13 (33).

Results

The correlation coefficient and their significance between the lipid parameters were shown in (Table/Fig 1) for both datasets. For the ACCORD BP trial data, the intensive and standard BP control groups at baseline were statistically similar for lipid parameters but differed significantly at 12^th, 24^th and 36^th months (Table/Fig 2). Similarly, the lipid profiles of cases and control groups were significantly different in the SSNHL data (Table/Fig 2).

For the ACCORD BP trial data, (Table/Fig 3) gives the comparison of effect of intervention (HR and 95% CI) assessed by three different models at the different time of lipid measurements. At baseline, the HR’s were statistically similar with slight variation irrespective of the models used. While at 12^th, 24^th and 36^th months, the scenario of HR’s was different in the three models. The HR’s were successively decreasing with narrowing 95% CI’s as moving from model-I to model-III. For the lipid measurement at 12th month, the HR’s and the corresponding 95% CI in the successive three models were 0.885 (0.713-1.099), 0.860 (0.692-1.068), 0.835 (0.672-1.038). At 24^th month the HR’s and the corresponding 95% CI in the respective models were 0.835 (0.681-1.025), 0.818 (0.667-1.004), 0.806 (0.657- 0.990) and at 36^th month the HR’s and 95% CI for the three models were 0.835 (0.674-1.035), 0.820 (0.656-1.025), 0.695 (0.546-0.883). Moreover, at 24^th and 36^th months, the HR with model-I and model-II were insignificant but found to be significant for the model-III.

For SSNHL data the effect of BMI (OR and 95% CI) was compared between the three models (Table/Fig 4). The OR with 95% CI for models I and II was 1.465 (1.119-1.917) and 1.467 (1.106-1.945), respectively which indicates the similarity of the point estimate and the corresponding precisions did not differ much. But, model III showed a different scenario as compared to models I and II. The OR with 95% CI for model III was 1.988 (1.425-2.773). The OR for model III was relatively elevated as compared to models I and II and 95% CI was wider too.

Discussion

In Cox proportional hazard and logistic regression models, the multicollinearity assumptions on the covariates often get oversighted in medical research. This leads to compromised precision of the estimates. Such covariates needed to be independent when considered in the model. Otherwise, the presence of multicollinearity may distort true estimates and thus, end up with biased findings (1),(3),(4). The multivariate statistical approach which deals with the multiple correlated outcomes has its own applications to deal with such problems. The PCA is one of them which has the potential to derive independent PC’s. Moreover, it reduces the dimension of the correlated data and the first few components can explain almost total variation in the data. Thus, instead of using the correlated covariates as such in the model, a few PC’s can be included in the model without loss of information. This PCA approach addresses the issue of multicollinearity with a smaller number of predictors.

There are few cited studies that had evaluated the effect of interventions/exposure on the outcome. In these studies, the lipids parameters that are associated with the outcome of interest were incorporated into the model as such. Thus, by ignoring the effect of multicollinearity conclusions were made (19),(20),(21),(22),(23),(24). Pedersen TR et al., used the Cox hazard model to compare the event rate of the primary outcome of major coronary events in patients treated with high-dose of atorvastatin against usual-dose. The HR was estimated for the primary endpoint adjusting for the other variables including TC and HDL as the simultaneous covariates. The decision emerged in support of the high dose of Atorvastatin in reducing the primary outcome (19). However, the precision of the estimated HR would have been more reliable, if the multicollinearity among the lipids would have been addressed using PCA. Ting ZWR et al., examined the effects of the use of statins and fibrates on the onset of CVD in Chinese diabetic patients using the Cox model. The HR’s were estimated for the lipids LDL, HDL and TG by adjusting the effects of several identified covariates. These correlated lipids had been considered as the separate covariates in the model. The reliability of the estimates may be questionable as the multicollinearity among them was ignored. (20). Hou Q et al., by using a logistic regression model identified the relevant predictors of the presence of carotid plaque in the general Chinese adults. They identified age, gender, DBP and TC as the independent predictors of carotid plaque. Since, age, DBP and TC are the correlated predictors, the estimates of OR’s of these as well as of gender may not be precise as they did not account for multicollinearity. Atleast by using PCA, the more precise estimate for gender could have been obtained (22). The present study demonstrated the application of PCA technique in dealing with multiple correlated covariates. This could benefit the medical researchers/clinicians to obtain more valid and precise estimates for the effect of intervention/exposure. The findings of the ACCORD BP trial data set and SSNHL case-control study dataset for all the three comparative models suggest the importance of PCA to enhance the reliability of the estimates with improved precision (Table/Fig 3). Although, this study demonstrated the application of PCA to address multicollinearity for continuous correlated covariates. But this concept could be employed for correlated categorical covariates also using PCA technique. It could be a good motivation and an interesting area of future research.

Limitation(s)

This study demonstrated the application of PCA to address multicollinearity for continuous correlated covariates and not for categorical correlated covariates.

Conclusion

The study clearly demonstrates that multicollinearity among the covariates in the model should be addressed before inclusion in the Cox regression or Logistic regression model. The PCA technique could be one of the ways to address this issue to obtain reliable and precise estimates for the covariates of interest.

Acknowledgement

Authors are sincerely grateful to ACCORD Research Materials obtained from National Heart, Lung, and Blood Institute (NHLBI), Biologic Specimen and Data Repository Information Coordinating Centre for providing access to their data through Research Materials Distribution Agreement (RMDA) and authors are also thankful to the ACCORD trial group. Authors also acknowledge the authors of SSNHL study group to make their data publicly available and authors pay their sincere gratitude and regard to the members of the Doctoral advisory committee for their valuable suggestions.

References

Harrell FE. Regression modeling strategies: With applications to linear models, logistic and ordinal regression, and survival analysis. New York: Springer; 2015. [crossref]

Collett D. Modelling survival data in medical research. CRC press. 3rd ed. 2015; Chapter-3: 57-73 & Chapter 8: 295-303.

Rawlings JO, Pantula SG, Dickey DA, editors. Applied regression analysis: A research tool. New York, NY: Springer New York; 1998. [crossref]

Hickey GL, Kontopantelis E, Takkenberg JJ, Beyersdorf F. Statistical primer: Checking model assumptions with regression diagnostics. Interact Cardiovasc Thorac Surg. 2019;28(1):01-08. [crossref] [PubMed]

Arokiaraj SJ. Blood sugar, lipid profile and their correlation: A pilot study in Puducherry. Int J Appl Pure Sci Agric. 2016;2:140-44.

Bønaa KH, Thelle DS. Association between blood pressure and serum lipids in a population. The tromsø study. Circulation. 1991;83(4):1305-14. [crossref] [PubMed]

Sharma S, Merchant J, Fleming SE. Lp (a)-cholesterol is associated with HDL- cholesterol in overweight and obese African American children and is not an independent risk factor for CVD. Cardiovascular Diabetology. 2012;11(1):01-07. [crossref] [PubMed]

Babler RL. Total blood cholesterol, blood triglyceride, and blood HDL correlation. Western Michigan University; 1991.

Qabaha K, Hassan WA, Mansour H, Thanigachalam S, Naser S. Demographic and blood lipid profiles in correlation with heart attacks among mediterraneans. Journal of Nutrition & Food Sciences. 2014.1;4(4):01. [crossref]

10.

Lee JS, Kim DH, Lee HJ, Kim HJ, Koo JW, Choi HG, et al. Lipid profiles and obesity as potential risk factors of sudden sensorineural hearing loss. PLoS One. 2015.10;10(4):e0122496. [crossref] [PubMed]

11.

Mohammed AA. Lipid profile among patients with sudden sensorineural hearing loss. Indian J Otolaryngol Head Neck Surg. 2014;66(4):425-28. [crossref] [PubMed]

12.

Chang IJ, Kang CJ, Yueh CY, Fang KH, Yeh RM, Tsai YT. The relationship between serum lipids and sudden sensorineural hearing loss: A systematic review and meta-analysis. PLoS One. 2015;10(4):e0121025. [crossref] [PubMed]

13.

Tolonen N, Forsblom C, Mäkinen VP, Harjutsalo V, Gordin D, Feodoroff M, et al. Different lipid variables predict incident coronary artery disease in patients with type 1 diabetes with or without diabetic nephropathy: The FinnDiane study. Diabetes Care. 2014;37(8):2374-82. [crossref] [PubMed]

14.

Sone H, Nakagami T, Nishimura R, Tajima N, MEGA Study Group. Comparison of lipid parameters to predict cardiovascular events in Japanese mild-to-moderate hypercholesterolemic patients with and without type 2 diabetes: Subanalysis of the MEGA study. Diabetes Res Clin Pract. 2016;113:14-22. [crossref] [PubMed]

15.

Tohidi M, Hatami M, Hadaegh F, Safarkhani M, Harati H, Azizi F. Lipid measures for prediction of incident cardiovascular disease in diabetic and non-diabetic adults: Results of the 8.6 years follow-up of a population based cohort study. Lipids Health Dis. 2010;9(1):01-09. [crossref] [PubMed]

16.

Sone H, Tanaka S, Tanaka S, Iimuro S, Ishibashi S, Oikawa S, et al. Comparison of various lipid variables as predictors of coronary heart disease in Japanese men and women with type 2 diabetes: Subanalysis of the Japan Diabetes Complications Study. Diabetes care. 2012;35(5):1150-57. [crossref] [PubMed]

17.

Lee Y, Park S, Lee S, Kim Y, Kang MW, Cho S, et al. Lipid profiles and risk of major adverse cardiovascular events in CKD and diabetes: A nationwide population-based study. PloS one. 2020;15(4):e0231328. [crossref] [crossref]

18.

Shai I, Rimm EB, Hankinson SE, Curhan G, Manson JE, Rifai N, et al. Multivariate assessment of lipid parameters as predictors of coronary heart disease among postmenopausal women: Potential implications for clinical guidelines. Circulation. 2004;110(18):2824-30. [crossref] [PubMed]

19.

Pedersen TR, Faergeman O, Kastelein JJ, Olsson AG, Tikkanen MJ, Holme I, et al. High-dose atorvastatin vs usual-dose simvastatin for secondary prevention after myocardial infarction: The IDEAL study: A randomized controlled trial. JAMA. 2005.16;294(19):2437-45. [crossref] [PubMed]

20.

Ting ZWR, Yang X, Yu LW, Luk AO, Kong AP, Tong PC, et al. Lipid control and use of lipid-regulating drugs for prevention of cardiovascular events in Chinese type 2 diabetic patients: A prospective cohort study. Cardiovasc Diabetol. 2010;9(1):01-09. [crossref] [PubMed]

21.

Paredes S, Fonseca L, Ribeiro L, Ramos H, Oliveira JC, Palma I. Novel and traditional lipid profiles in Metabolic Syndrome reveal a high atherogenicity. Scientific Reports. 2019;9(1):01-07. [crossref] [PubMed]

22.

Hou Q, Li S, Gao Y, Tian H. Relations of lipid parameters, other variables with carotid intima-media thickness and plaque in the general Chinese adults: An observational study. Lipids Health Dis. 2018;17(1):107. [crossref] [PubMed]

23.

Nissen SE, Tuzcu EM, Libby P, Thompson PD, Ghali M, Garza D, et al. Effect of antihypertensive agents on cardiovascular events in patients with coronary disease and normal blood pressure: The CAMELOT study: A randomized controlled trial. JAMA. 2004;292(18):2217-25. [crossref] [PubMed]

24.

Xia W, Yao X, Chen Y, Lin J, Vielhauer V, Hu H. Elevated TG/HDL-C and non- HDL-C/HDL-C ratios predict mortality in peritoneal dialysis patients. BMC Nephrology. 2020;21(1):01-09. [crossref] [PubMed]

25.

El-Fallah M, El-Salam A. A note on partial least squares regression for multicollinearity (A comparative study). International Journal of Applied Science and Technology. 2014;4(1):163-69.

26.

Kaplan RM, Irvin VL. Likelihood of null effects of large NHLBI clinical trials has increased over time. PloS One. 2015;10(8):e0132382. [crossref] [PubMed]

27.

Consultation WH. Obesity: Preventing and managing the global epidemic. World Health Organization technical report series. 2000;894:01-253.

28.

Johnson RA, Wichern DW. Applied multivariate statistical analysis. 5th ed. 2002; Chapter-6: 283-285.

29.

Schoenfeld D. Chi-squared goodness-of-fit tests for the proportional hazards regression model. Biometrika. 1980;67(1):145-53. [crossref]

30.

Mishra A, Harichandrakumar KT, Binu VS, Satheesh S, Nair NS. Multivariate approach in analyzing medical data with correlated multiple outcomes: An exploration using ACCORD trial data. Clinical Epidemiology and Global Health. 2021.1;11:100785. [crossref]

31.

R Studio Team. RStudio: Integrated development for R. RStudio, Inc., Boston, MA URL http://www. rstudio. com. 2015;42(14):84.

32.

Statistics IS. IBM Corp. Released 2013. IBM SPSS Statistics for Windows, Version 22.0. Armonk, NY: IBM Corp. Google Search. 2013.

33.

StataCorp LP. Stata multilevel mixed-effects reference manual. College Station, TX: StataCorp LP. 2013;9(10).

Tables and Figures

[Table / Fig - 1] [Table / Fig - 2] [Table / Fig - 3] [Table / Fig - 4]

DOI and Others

DOI: 10.7860/JCDR/2022/55379.16629

Date of Submission: Feb 02, 2022
Date of Peer Review: Mar 30, 2022
Date of Acceptance: Apr 27, 2022
Date of Publishing: Jul 01, 2022

AUTHOR DECLARATION:
• Financial or Other Competing Interests: None
• Was Ethics Committee Approval obtained for this study? The IEC granted waiver of consent for the study.
• Was informed consent obtained from the subjects involved in the study? NA
• For any images presented appropriate consent has been obtained from the subjects. NA

PLAGIARISM CHECKING METHODS:
• Plagiarism X-checker: Feb 05, 2022
• Manual Googling: Apr 26, 2022
• iThenticate Software: Jun 02, 2022 (8%)

ETYMOLOGY: Author Origin

JCDR is now Monthly and more widely Indexed .

Emerging Sources Citation Index (Web of Science, thomsonreuters)
Index Copernicus ICV 2017: 134.54
Academic Search Complete Database
Directory of Open Access Journals (DOAJ)
Embase
EBSCOhost
Google Scholar
HINARI Access to Research in Health Programme
Indian Science Abstracts (ISA)
Journal seek Database
Google
Popline (reproductive health literature)
www.omnimedicalsearch.com