<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>data analysis Archives - Ger Inberg</title>
	<atom:link href="https://gerinberg.com/category/data-analysis/feed/" rel="self" type="application/rss+xml" />
	<link>https://gerinberg.com/category/data-analysis/</link>
	<description>data science developer</description>
	<lastBuildDate>Tue, 03 Sep 2024 12:41:39 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.8.1</generator>

<image>
	<url>https://gerinberg.com/wp-content/uploads/2017/05/favicon-150x150.jpg</url>
	<title>data analysis Archives - Ger Inberg</title>
	<link>https://gerinberg.com/category/data-analysis/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>DrugExposure Diagnostics</title>
		<link>https://gerinberg.com/2023/04/01/drugexposurediagnostics/</link>
		
		<dc:creator><![CDATA[Ger]]></dc:creator>
		<pubDate>Sat, 01 Apr 2023 09:32:00 +0000</pubDate>
				<category><![CDATA[data analysis]]></category>
		<category><![CDATA[software engineering]]></category>
		<category><![CDATA[DrugExposureDiagnostics]]></category>
		<guid isPermaLink="false">https://gerinberg.com/?p=1848</guid>

					<description><![CDATA[<p>DrugExposureDiagnostics: A Comprehensive R Package for Assessing Drug Exposure in Clinical Research Drug exposure is an essential aspect of clinical research, as it directly affects the efficacy and safety of drugs. Measuring drug exposure accurately and understanding the factors that influence it is crucial for [&#8230;]</p>
<p>The post <a href="https://gerinberg.com/2023/04/01/drugexposurediagnostics/">DrugExposure Diagnostics</a> appeared first on <a href="https://gerinberg.com">Ger Inberg</a>.</p>
]]></description>
										<content:encoded><![CDATA[<div class="flex-1 overflow-hidden">
<div class="react-scroll-to-bottom--css-louzy-79elbk h-full dark:bg-gray-800">
<div class="react-scroll-to-bottom--css-louzy-1n7m0yu">
<div class="flex flex-col items-center text-sm dark:bg-gray-800">
<div class="group w-full text-gray-800 dark:text-gray-100 border-b border-black/10 dark:border-gray-900/50 bg-gray-50 dark:bg-[#444654]">
<div class="text-base gap-4 md:gap-6 md:max-w-2xl lg:max-w-2xl xl:max-w-3xl p-4 md:py-6 flex lg:px-0 m-auto">
<div class="relative flex w-[calc(100%-50px)] flex-col gap-1 md:gap-3 lg:w-[calc(100%-115px)]">
<div class="flex flex-grow flex-col gap-3">
<div class="min-h-[20px] flex flex-col items-start gap-4 whitespace-pre-wrap">
<div class="markdown prose w-full break-words dark:prose-invert light">
<div class="flex-1 overflow-hidden">
<div class="react-scroll-to-bottom--css-louzy-79elbk h-full dark:bg-gray-800">
<div class="react-scroll-to-bottom--css-louzy-1n7m0yu">
<div class="flex flex-col items-center text-sm dark:bg-gray-800">
<div class="group w-full text-gray-800 dark:text-gray-100 border-b border-black/10 dark:border-gray-900/50 bg-gray-50 dark:bg-[#444654]">
<div class="text-base gap-4 md:gap-6 md:max-w-2xl lg:max-w-2xl xl:max-w-3xl p-4 md:py-6 flex lg:px-0 m-auto">
<div class="relative flex w-[calc(100%-50px)] flex-col gap-1 md:gap-3 lg:w-[calc(100%-115px)]">
<div class="flex flex-grow flex-col gap-3">
<div class="min-h-[20px] flex flex-col items-start gap-4 whitespace-pre-wrap">
<div class="markdown prose w-full break-words dark:prose-invert light">
<p>DrugExposureDiagnostics: A Comprehensive R Package for Assessing Drug Exposure in Clinical Research</p>
<p>Drug exposure is an essential aspect of clinical research, as it directly affects the efficacy and safety of drugs. Measuring drug exposure accurately and understanding the factors that influence it is crucial for clinical decision-making. This is where the R package DrugExposureDiagnostics comes in handy.</p>
<p>As the author of this R package, I am excited to introduce you to this powerful tool for analyzing drug exposure data. Before delving into the package, let&#8217;s first understand what drug exposure is and why it is crucial in clinical research.</p>
<p>Drug exposure refers to the extent to which a drug enters and stays in the body, thereby producing its intended therapeutic effects. Measuring drug exposure accurately involves capturing key metrics, such as drug concentrations, AUC, Cmax, and Tmax. By doing so, researchers can evaluate drug efficacy and safety and make informed decisions regarding dosing and administration.</p>
<p>One way to capture drug exposure data is through the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM), developed by the Observational Health Data Sciences and Informatics (OHDSI) community. The OMOP CDM standardizes and integrates data from various sources, allowing for large-scale observational studies and analysis.</p>
<p>This is where the R package DrugExposureDiagnostics comes in. It is a comprehensive tool for analyzing drug exposure data in the OMOP CDM format. The package includes functions for calculating various exposure metrics, handling missing data, and summarizing data at different levels, such as by subject or visit. Additionally, it provides tools for identifying outliers and comparing exposure between groups.</p>
<p>DrugExposureDiagnostics has been extensively tested and validated, ensuring that it produces accurate results. The package has been released on the <a href="https://cran.r-project.org/web/packages/DrugExposureDiagnostics/index.html">Comprehensive R Archive Network</a> (CRAN), making it easily accessible to R users worldwide. To use the package, simply install it using the install.packages() function in R and load it using the library() function.</p>
<p>If you are interested in learning more about DrugExposureDiagnostics or trying it out for yourself, visit the <a href="https://github.com/darwin-eu/DrugExposureDiagnostics">package github</a></p>
</div>
</div>
</div>
<div class="flex justify-between">
<div class="text-gray-400 flex self-end lg:self-center justify-center mt-2 gap-3 md:gap-4 lg:gap-1 lg:absolute lg:top-0 lg:translate-x-full lg:right-0 lg:mt-0 lg:pl-2 visible"></div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<div class="absolute bottom-0 left-0 w-full border-t md:border-t-0 dark:border-white/20 md:border-transparent md:dark:border-transparent md:bg-vert-light-gradient bg-white dark:bg-gray-800 md:!bg-transparent dark:md:bg-vert-dark-gradient pt-2">
<form class="stretch mx-2 flex flex-row gap-3 last:mb-2 md:mx-4 md:last:mb-6 lg:mx-auto lg:max-w-3xl">
<div class="relative flex h-full flex-1 md:flex-col">
<div class="flex flex-col w-full py-2 flex-grow md:py-3 md:pl-4 relative border border-black/10 bg-white dark:border-gray-900/50 dark:text-white dark:bg-gray-700 rounded-md shadow-[0_0_10px_rgba(0,0,0,0.10)] dark:shadow-[0_0_15px_rgba(0,0,0,0.10)]"></div>
</div>
</form>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<p>The post <a href="https://gerinberg.com/2023/04/01/drugexposurediagnostics/">DrugExposure Diagnostics</a> appeared first on <a href="https://gerinberg.com">Ger Inberg</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>CITO public analysis</title>
		<link>https://gerinberg.com/2021/03/12/cito-public-analysis/</link>
		
		<dc:creator><![CDATA[Ger]]></dc:creator>
		<pubDate>Fri, 12 Mar 2021 07:24:00 +0000</pubDate>
				<category><![CDATA[data analysis]]></category>
		<category><![CDATA[data viz]]></category>
		<guid isPermaLink="false">https://gerinberg.com/?p=1750</guid>

					<description><![CDATA[<p>CITO is an institute in the Netherlands that support governments and schools so that they can develop world-class testing and monitoring systems to complete their educational programs. They have a lot of data regarding testing scores and it could be interesting to combine this data [&#8230;]</p>
<p>The post <a href="https://gerinberg.com/2021/03/12/cito-public-analysis/">CITO public analysis</a> appeared first on <a href="https://gerinberg.com">Ger Inberg</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p><a href="https://www.cito.nl/" target="_blank" rel="noopener">CITO</a> is an institute in the Netherlands that support governments and schools so that they can develop world-class testing and monitoring systems to complete their educational programs. They have a lot of data regarding testing scores and it could be interesting to combine this data with public data. For example, are testing scores of children living in deprived areas worse than average?</p>
<h5>Exploratory Analysis</h5>
<p>Exploratory data analysis (EDA) is used by data scientists to analyze and investigate data sets and summarize their main characteristics. This is often done by using data visualization methods. The main purpose of EDA is to help look at data before making any assumptions. For me it&#8217;s one of the nicest parts of the data science! Since you don&#8217;t know yet what&#8217;s in the data and there will always be surprises. It&#8217;s like you are on holiday and exploring the area that you seeing for the first time:-)</p>
<p>For example, is a certain variable in the data normally distributed or not? Is there any missing data or duplicated values? In my experience, yes in most cases, there is missing and duplicated data. We need to fix these issues before we can do the real analysis. This phase is called data cleaning, you might have heard about this before.</p>
<h5>Representativeness Analyses</h5>
<p>In general, a representative sample is a group or set chosen from a larger statistical population that adequately replicates the larger group according to whatever characteristic or quality is under study. In case of CITO, we like to know if the sample data set has more or less the same characteristics regarding scores.  For example, are the average and standard deviation of the sample data set close to the ones of the total data set. I have plotted the distributions of the 2 data sets in a single chart, in order to compare them. In the subtitle one can find the average, standard deviation and the median.</p>
<p>Below you can find some of the charts I made for both EDA and the representativeness Analyses. The code is available in a public repository on <a href="https://github.com/ginberg/cito" target="_blank" rel="noopener">github</a>. It can be run using a docker container, R and <a href="https://rstudio.github.io/renv/articles/renv.html" target="_blank" rel="noopener">renv</a> for library management.</p>
<div class="envira-gallery-feed-output"><img decoding="async" class="envira-gallery-feed-image" src="https://gerinberg.com/wp-content/uploads/2021/06/ex_scores-1-640x480.png" title="Exploratory: scores and score per sex" alt="" /></div>
<p>The post <a href="https://gerinberg.com/2021/03/12/cito-public-analysis/">CITO public analysis</a> appeared first on <a href="https://gerinberg.com">Ger Inberg</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Interactive Gene Explorer</title>
		<link>https://gerinberg.com/2019/03/06/interactive-gene-explorer/</link>
					<comments>https://gerinberg.com/2019/03/06/interactive-gene-explorer/#comments</comments>
		
		<dc:creator><![CDATA[Ger]]></dc:creator>
		<pubDate>Wed, 06 Mar 2019 03:42:36 +0000</pubDate>
				<category><![CDATA[data analysis]]></category>
		<category><![CDATA[data viz]]></category>
		<guid isPermaLink="false">https://gerinberg.com/?p=1327</guid>

					<description><![CDATA[<p>Since almost 2 years I am working for a global Bio pharmaceutical company. Together with researchers, I am working on applications to analyze (sometimes big) data to find medicines for patients with serious and life-threatening diseases. As you can imagine, the work is confidential and [&#8230;]</p>
<p>The post <a href="https://gerinberg.com/2019/03/06/interactive-gene-explorer/">Interactive Gene Explorer</a> appeared first on <a href="https://gerinberg.com">Ger Inberg</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>Since almost 2 years I am working for a global Bio pharmaceutical company. Together with researchers, I am working on applications to analyze (sometimes big) data to find medicines for patients with serious and life-threatening diseases. As you can imagine, the work is confidential and so I am not able to share data or results with other people. However, there are a couple of initiatives to open-source some of the packages that are developed within the company, which is awesome! The <a href="https://blog.rstudio.com/2019/01/07/first-shiny-contest/" target="_blank" rel="noopener noreferrer">Shiny contest</a> gave me a good opportunity to display some of these packages in a Shiny application.</p>
<h5>Periscope</h5>
<p><a href="https://github.com/neuhausi/periscope" target="_blank" rel="noopener noreferrer">Periscope</a> is a new R package which provides a predefined but flexible template for new Shiny applications with a default dashboard layout. Furthermore it provides <span style="font-family: 'Source Sans Pro', sans-serif;">user alerts, a nice busy indicator, application reset and logging features. </span></p>
<p><span style="font-family: 'Source Sans Pro', sans-serif;">One of the most important features of the Shiny applications created with this framework is the separation by file of functionality that exists in one of the three Shiny <strong>scopes.</strong> These scopes are: global, server-global, and server-local. The framework forces application developers to consciously consider scoping in Shiny applications by making scoping distinctions very clear without interfering with normal application development. Scoping consideration is important for performance and scaling.  This is critical when working with large datasets and/or across many users. In addition to providing a template application, the framework also contains a number of convenient modules. </span></p>
<ul>
<li>(multi)file download button module</li>
<li>downloadable table module.</li>
</ul>
<h5>CanvasXpress</h5>
<p>The plots in the application are created using <a href="https://canvasxpress.org/html/index.html" target="_blank" rel="noopener noreferrer">CanvasXpress</a>, which is a package for (interactive) data visualization especially for reproducible research.  It supports a large number of visualizations to display scientific and non-scientific data which includes: Area, AreaLine, Bar, BarLine, Boxplot, Bubble, Candlestick, Chord, Circular, Contour, Correlation, Density, Donnut, DotLine, Dotplot, Genome, Heatmap, Histogram, Kaplan-Meier, Layout, Line, Map, Network, NonLinear-Fit, Oncoprint, ParallelCoordinates, Pie, Radar, Remote-Graphs, Sankey, Scatter2D, Scatter3D, ScatterBubble2D, Stacked, StackedLine, StackedPercent, StackedPercentLine, Sunburst, TagCloud, Tree, Treemap, Venn, Video, Violin. This <a href="https://blog.dominodatalab.com/large-visualizations-canvasxpress/" target="_blank" rel="noopener noreferrer">blogpost</a> gives a good overview on how to get started with CanvasXpress.</p>
<p>View the <a href="https://ginberg.shinyapps.io/gene_explorer/" target="_blank" rel="noopener noreferrer">Interactive Gene Explorer</a> or the source code on <a href="https://github.com/ginberg/gene_explorer" target="_blank" rel="noopener noreferrer">github</a>.</p>
<p><a href="https://gerinberg.com/wp-content/uploads/2019/03/gene_explorer.png"><img decoding="async" class="alignnone size-medium wp-image-1328" src="https://gerinberg.com/wp-content/uploads/2019/03/gene_explorer-300x108.png" alt="Gene Explorer" width="300" height="108" srcset="https://gerinberg.com/wp-content/uploads/2019/03/gene_explorer-300x108.png 300w, https://gerinberg.com/wp-content/uploads/2019/03/gene_explorer-768x277.png 768w, https://gerinberg.com/wp-content/uploads/2019/03/gene_explorer-1024x369.png 1024w, https://gerinberg.com/wp-content/uploads/2019/03/gene_explorer-830x299.png 830w, https://gerinberg.com/wp-content/uploads/2019/03/gene_explorer-230x83.png 230w, https://gerinberg.com/wp-content/uploads/2019/03/gene_explorer-350x126.png 350w, https://gerinberg.com/wp-content/uploads/2019/03/gene_explorer-480x173.png 480w, https://gerinberg.com/wp-content/uploads/2019/03/gene_explorer.png 1969w" sizes="(max-width: 300px) 100vw, 300px" /></a></p>
<p>The post <a href="https://gerinberg.com/2019/03/06/interactive-gene-explorer/">Interactive Gene Explorer</a> appeared first on <a href="https://gerinberg.com">Ger Inberg</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://gerinberg.com/2019/03/06/interactive-gene-explorer/feed/</wfw:commentRss>
			<slash:comments>1</slash:comments>
		
		
			</item>
		<item>
		<title>Transparent healthcare</title>
		<link>https://gerinberg.com/2016/08/24/transparent-healthcare/</link>
		
		<dc:creator><![CDATA[Ger]]></dc:creator>
		<pubDate>Wed, 24 Aug 2016 13:03:43 +0000</pubDate>
				<category><![CDATA[data analysis]]></category>
		<guid isPermaLink="false">http://gerinberg.com/?p=632</guid>

					<description><![CDATA[<p>A couple of weeks ago, the Dutch insurance company CZ released a list of treatments with its prices at various hospitals. This is quite unique, because till now the health sector is far from transparent. Hopefully this is part of a trend that will benefit patients [&#8230;]</p>
<p>The post <a href="https://gerinberg.com/2016/08/24/transparent-healthcare/">Transparent healthcare</a> appeared first on <a href="https://gerinberg.com">Ger Inberg</a>.</p>
]]></description>
										<content:encoded><![CDATA[<div>A couple of weeks ago, the Dutch insurance company <span class="il">CZ</span> released a list of treatments with its prices at various hospitals. This is quite unique, because till now the health sector is far from transparent. Hopefully this is part of a trend that will benefit patients in choosing the best hospital for their needs.</div>
<div>Based on the released <a href="https://www.cz.nl/over-cz/inkoop-van-zorg/wat-kost-uw-behandeling-in-het-ziekenhuis" target="_blank">pricelist</a>, I have done some analysis. See below for the main points from this analysis or view the <a href="https://ginberg.shinyapps.io/zorgtarieven/" target="_blank">visualization</a> I made.</div>
<div></div>
<div>
<hr />
</div>
<h6>Treatments with biggest difference in price</h6>
<p>The top 3 treatments (in Dutch) with biggest price difference</p>
<ol>
<li>1 of 2 dagbehandelingen of 6 of meer polikliniekbezoeken bij een huidaandoening met bultjes en schilfers (min =€183,87, max =€989,01, <strong>diff = €805,14</strong>)</li>
<li>Begeleiding door IC-arts of intensivist van een patient die vervoerd wordt van de IC van het ene ziekenhuis naar de IC van een ander ziekenhuis, waarbij vervoer inclusief wachttijd ambulance, overdrac tarief (min =€205,19, max =€978,84, <strong>diff = €773,65</strong>)</li>
<li>Medebehandeling door: een klinisch geriater (min =€166,67, max =€935,07, <strong>diff = €768,40</strong>)</li>
</ol>
<p>&nbsp;</p>
<h6>Hospitals with (on average) the most/least expensive treatments</h6>
<p>As seen in the last paragraph, the differences in price for a given treatment can be big. It is interesting to know which hospitals are the most expensive and which ones are the cheapest. However, a hospital can be expensive for a given treatment but cheap for another treatment. A reason for this could be that the hospital is specialized in certain treatments.</p>
<p>The top 3 hospitals (in Dutch) with on average the cheapest treatments. The price in brackets is the average amount below the average price for all treatments the hospital provides.</p>
<ol>
<li>
<p id="rstudio_console_output" class="GOSLM4IHOB" tabindex="0"><a href="https://www.nkal.nl/" target="_blank">Stichting Nederlands Kenniscentrum Arbeid &amp; Longaandoeningen</a> (€ 233,55)</p>
</li>
<li>
<p id="rstudio_console_output" class="GOSLM4IHOB" tabindex="0">F.P.L. Van Loon ( € 184,09)</p>
</li>
<li>
<p id="rstudio_console_output" class="GOSLM4IHOB" tabindex="0">St. Psoriasisdagbehandelingscentrum Midden-Nederland ( € 183,77)</p>
</li>
</ol>
<p>And the top 3 hospitals (in Dutch) with on average the most expensive treatments.</p>
<ol>
<li>
<p id="rstudio_console_output" class="GOSLM4IHOB" tabindex="0">Epilepsie Centrum Kempenhaeghe (€ -247,80)</p>
</li>
<li>
<p id="rstudio_console_output" class="GOSLM4IHOB" tabindex="0">Stichting Beste Zorg (nok Brunssum) ( € -218,30)</p>
</li>
<li>
<p id="rstudio_console_output" class="GOSLM4IHOB" tabindex="0">Diabeter B.V. ( € -127,58)</p>
</li>
</ol>
<p>&nbsp;</p>
<p><a href="http://gerinberg.com/wp-content/uploads/2016/08/zorgtarieven-1.png"><img fetchpriority="high" decoding="async" class="alignnone wp-image-660 size-medium" src="http://gerinberg.com/wp-content/uploads/2016/08/zorgtarieven-1-300x197.png" width="300" height="197" srcset="https://gerinberg.com/wp-content/uploads/2016/08/zorgtarieven-1-300x197.png 300w, https://gerinberg.com/wp-content/uploads/2016/08/zorgtarieven-1-768x504.png 768w, https://gerinberg.com/wp-content/uploads/2016/08/zorgtarieven-1-1024x672.png 1024w, https://gerinberg.com/wp-content/uploads/2016/08/zorgtarieven-1-830x544.png 830w, https://gerinberg.com/wp-content/uploads/2016/08/zorgtarieven-1-230x151.png 230w, https://gerinberg.com/wp-content/uploads/2016/08/zorgtarieven-1-350x230.png 350w, https://gerinberg.com/wp-content/uploads/2016/08/zorgtarieven-1-480x315.png 480w, https://gerinberg.com/wp-content/uploads/2016/08/zorgtarieven-1.png 1575w" sizes="(max-width: 300px) 100vw, 300px" /></a></p>
<p>The post <a href="https://gerinberg.com/2016/08/24/transparent-healthcare/">Transparent healthcare</a> appeared first on <a href="https://gerinberg.com">Ger Inberg</a>.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>
