<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[BioBox Blog]]></title><description><![CDATA[The Decision OS for AI-driven Pharma]]></description><link>https://blog.biobox.io</link><image><url>https://substackcdn.com/image/fetch/$s_!RYsr!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04e14b78-1bd5-46df-afbb-26da6dfaa2b7_260x260.png</url><title>BioBox Blog</title><link>https://blog.biobox.io</link></image><generator>Substack</generator><lastBuildDate>Thu, 09 Apr 2026 15:09:04 GMT</lastBuildDate><atom:link href="https://blog.biobox.io/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[BioBox Analytics]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[biobox@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[biobox@substack.com]]></itunes:email><itunes:name><![CDATA[Christopher Li]]></itunes:name></itunes:owner><itunes:author><![CDATA[Christopher Li]]></itunes:author><googleplay:owner><![CDATA[biobox@substack.com]]></googleplay:owner><googleplay:email><![CDATA[biobox@substack.com]]></googleplay:email><googleplay:author><![CDATA[Christopher Li]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Beware the Ouroboros of Biology.]]></title><description><![CDATA[A system struggling to break from consensus.]]></description><link>https://blog.biobox.io/p/beware-the-ouroboros-of-biology</link><guid isPermaLink="false">https://blog.biobox.io/p/beware-the-ouroboros-of-biology</guid><dc:creator><![CDATA[Christopher Li]]></dc:creator><pubDate>Mon, 02 Mar 2026 18:21:42 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!AWrj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cad734b-99b9-4e6a-b214-caba5234d9e5_2816x1536.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!AWrj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cad734b-99b9-4e6a-b214-caba5234d9e5_2816x1536.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AWrj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cad734b-99b9-4e6a-b214-caba5234d9e5_2816x1536.png 424w, https://substackcdn.com/image/fetch/$s_!AWrj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cad734b-99b9-4e6a-b214-caba5234d9e5_2816x1536.png 848w, https://substackcdn.com/image/fetch/$s_!AWrj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cad734b-99b9-4e6a-b214-caba5234d9e5_2816x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!AWrj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cad734b-99b9-4e6a-b214-caba5234d9e5_2816x1536.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AWrj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cad734b-99b9-4e6a-b214-caba5234d9e5_2816x1536.png" width="1456" height="794" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3cad734b-99b9-4e6a-b214-caba5234d9e5_2816x1536.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:794,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:9378376,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.biobox.io/i/185435299?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cad734b-99b9-4e6a-b214-caba5234d9e5_2816x1536.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!AWrj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cad734b-99b9-4e6a-b214-caba5234d9e5_2816x1536.png 424w, https://substackcdn.com/image/fetch/$s_!AWrj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cad734b-99b9-4e6a-b214-caba5234d9e5_2816x1536.png 848w, https://substackcdn.com/image/fetch/$s_!AWrj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cad734b-99b9-4e6a-b214-caba5234d9e5_2816x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!AWrj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cad734b-99b9-4e6a-b214-caba5234d9e5_2816x1536.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Much of the AI application ecosystem, from RAG to tool use to massive token context windows, is optimizing for a single objective: <strong>reducing response entropy by feeding the model more information. </strong>The more context you provide the more the distribution narrows. </p><p>This is a powerful system that produced the co-pilots that summarize papers in seconds and now, AI-powered platforms that run workflows and do bioinformatics analysis, and foundation models that excel at physics/chemistry tasks (e.g. Alphafold, <a href="https://www.chaidiscovery.com/">Chai</a>, Boltz). For problems where the &#8220;right&#8221; answer is derivable from available information, reducing entropy through context works beautifully.</p><p><a href="http://harvey.ai">Harvey</a>, the AI for law, is one of the clearest examples of this. Many legal tasks reward fast convergence over a known body of rules, precedent, and language. It is hard to beat a system that can absorb a vast body of statutes, case law, and legal argument and retrieve it instantly. The rules are relatively legible, and much of the relevant context is available. This kind of problem rewards the ability to eliminate response entropy and converge quickly and precisely.</p><p>That is why so many knowledge industries are vulnerable to this type of disruption. And the biggest prize is pharmaceuticals. It is the Formula 1 of the knowledge economy. The implicit thesis was simple: instead of legal briefs, feed the system scientific literature, and it will learn biology.</p><p>The result has been a torrent of &#8220;AI Co-Scientist&#8221; and &#8220;Neurosymbolic AI&#8221; companies entering the most overhyped, overfunded, and epistemologically confused segment in vertical AI. And that confusion is a direct consequence of applying convergent tools to a fundamentally divergent problem.</p><p>There is a set of problems where entropy reduction breaks down entirely because it punishes convergence and rewards divergence. The highest-value decisions in drug discovery are exactly this kind of problem.</p><p>The decisions that matter, like which targets to pursue, patient populations to bet on, and which mechanisms to trust when the biology is ambiguous, are not made by converging on available information in literature. They are made by someone who has seen something others haven't, interpreted failure differently than the rest, or held a conviction that ran against consensus long enough to be proven right.</p><p>Alpha lives in the divergence.</p><p>And this is where the Ouroboros appears.</p><h1>The Ouroboros of Biology.</h1><p>Drug discovery relies on two principal factors: <strong>Asymmetric information</strong> and <strong>Differentiated Thinking</strong>. You win because you know something they don&#8217;t and you have smarter scientists who think outside the box. Because of these competitive dynamics, no one is incentivized to share their asymmetric knowledge, let alone publish it.</p><p><strong>We train models on the published record of biology, then ask them to generate the next frontier of biological insight from that same record.</strong> That is the Ouroboros. A system trying to escape the limits of consensus by recursively consuming consensus. But the published record is not biology.</p><blockquote><p><strong>No one in the history of drug discovery made a drug because they read the literature better.</strong></p></blockquote><p>The scientific literature is a curated, systematically optimistic, self-referential record of what scientists were willing to report, what journals were willing to print, and what funding agencies were willing to support. Papers cite papers that cite papers that no one could reproduce. Positive results get published with a narrative. Negative results disappear into reports, decks, and private team memory.</p><p>The top scientific R&amp;D teams know this already. Inside serious drug discovery organizations, this is almost banal. But for the outsiders building in this space, there is this tendency to over index on the value of literature as ground truth. Poor understanding of this reality is most obvious when vendors market their knowledge graph or foundation model to be trained on hundreds of millions of papers and abstracts (yes&#8230;even conference abstracts) as if sheer volume somehow makes the system better.</p><h1>A More Structured Ouroboros.</h1><p><em>&#8220;Fine. Literature alone is noisy. That&#8217;s why we use a knowledge graph to structure the claims. And we also integrate real-world and proprietary customer data into the graph.&#8221;</em></p><p>This is directionally correct but hides an insidious trap.</p><p>The value of a knowledge graph is the ontology. The ontology is not neutral, it is judgment. Every definition in the schema encodes a view of the world. What counts as the same thing? What makes things different? What gets linked? What gets collapsed? What do words mean?</p><p>An ontology is a scientific world view made operational.</p><p>The most important question is: <strong>whose graph is it?</strong></p><p><strong>If it is not your ontology, it is not your knowledge graph.<br>If it is not your knowledge graph, it is not your science.<br>If it is not your science, it is not your decision.</strong></p><p>The moment a vendor says they &#8220;integrate customer data into their graph&#8221;, <strong>you should ask what scientific judgment you are implicitly outsourcing because you&#8217;ve just relinquished control</strong>. They are deciding how your internal assays map to external biology. How your translational signals get normalized. How your negative data gets represented. What counts as corroboration. What counts as contradiction. What counts as enough evidence to connect one claim to another.</p><p>You cannot have this both ways. You cannot claim that your proprietary ontology-driven evidence graph is valuable <strong>and</strong> ingesting customer data into that proprietary graph. You are either infrastructure that allows the customer to instantiate, govern, and evolve their own scientific worldview. Or you are a scientific intermediary that aggregates, reshapes, and ultimately substitutes for that worldview.</p><p><em><strong>A simple test is to ask the vendor to change the underlying ontology to accommodate your internal definitions. See what happens.</strong></em></p><h1>The Contradiction at the Center</h1><p>This is why so many of these platforms feel impressive in demo mode and suspicious in principle. They can absolutely help teams search faster, summarize faster, traverse claims faster, and maybe even operate workflows faster. <strong>But as soon as you diverge from consensus, everything starts to break.</strong></p><p>A pharma company does not win because it has access to information. Everyone has access to information. It wins because it developed a way of seeing that information differently, grounded in data others do not have and judgment others cannot replicate. <strong>This is the reasoning layer that cannot be outsourced without consequence.</strong> <strong>It is where scientists encode scientific reasoning and capture the differentiated thinking</strong> a team uses to decide what to believe, what to ignore, what to test, and where to place conviction under uncertainty.</p><h1>How do we get out of this cycle?</h1><ol><li><p><strong>Reset on expectations.</strong> You are NOT going to get a Harvey for drug discovery. It&#8217;s a different problem class (convergent vs. divergent).</p></li><li><p><strong>Stop this. </strong>These types of posts are so inflammatory I need a shot of Dupixent.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Wxxe!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c169cc2-cfd3-4346-8791-75af589be127_1154x1018.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Wxxe!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c169cc2-cfd3-4346-8791-75af589be127_1154x1018.png 424w, https://substackcdn.com/image/fetch/$s_!Wxxe!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c169cc2-cfd3-4346-8791-75af589be127_1154x1018.png 848w, https://substackcdn.com/image/fetch/$s_!Wxxe!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c169cc2-cfd3-4346-8791-75af589be127_1154x1018.png 1272w, https://substackcdn.com/image/fetch/$s_!Wxxe!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c169cc2-cfd3-4346-8791-75af589be127_1154x1018.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Wxxe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c169cc2-cfd3-4346-8791-75af589be127_1154x1018.png" width="528" height="465.7746967071057" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7c169cc2-cfd3-4346-8791-75af589be127_1154x1018.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1018,&quot;width&quot;:1154,&quot;resizeWidth&quot;:528,&quot;bytes&quot;:193587,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.biobox.io/i/185435299?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c169cc2-cfd3-4346-8791-75af589be127_1154x1018.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Wxxe!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c169cc2-cfd3-4346-8791-75af589be127_1154x1018.png 424w, https://substackcdn.com/image/fetch/$s_!Wxxe!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c169cc2-cfd3-4346-8791-75af589be127_1154x1018.png 848w, https://substackcdn.com/image/fetch/$s_!Wxxe!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c169cc2-cfd3-4346-8791-75af589be127_1154x1018.png 1272w, https://substackcdn.com/image/fetch/$s_!Wxxe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c169cc2-cfd3-4346-8791-75af589be127_1154x1018.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Real VC twitter post</figcaption></figure></div><h1>Final Thoughts.</h1><p>Drug discovery does not reward systems that converge fastest on the published record. It rewards teams that know something others do not, interpret evidence differently, and maintain conviction under uncertainty.</p><p>That means the future of AI in biology will not belong to whoever reads the most papers or structures the most claims. It will belong to whoever helps scientific organizations encode their own ontology, their own context, and their own judgment without surrendering them to a vendor-controlled worldview.</p><p>Otherwise, the snake keeps eating its tail.</p><p></p><p></p><div><hr></div><p><em><a href="https://biobox.io">BioBox</a> is the Decision Operating System for AI-driven pharma.</em></p>]]></content:encoded></item><item><title><![CDATA[Biology is a graph.]]></title><description><![CDATA[Why every pharma needs a knowledge graph as foundational infrastructure.]]></description><link>https://blog.biobox.io/p/biology-is-a-graph</link><guid isPermaLink="false">https://blog.biobox.io/p/biology-is-a-graph</guid><dc:creator><![CDATA[Christopher Li]]></dc:creator><pubDate>Mon, 03 Nov 2025 20:05:13 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!RYsr!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04e14b78-1bd5-46df-afbb-26da6dfaa2b7_260x260.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h1>Biology Doesn&#8217;t Fit in Tables</h1><p>If you&#8217;ve worked in drug discovery, or just biological sciences in general, you&#8217;ve encountered the frustration of spending a ridiculous amount of time pulling data together to answer seemingly straightforward questions like &#8220;What are the druggable targets in pathway X that are overexpressed in tumor type Y?&#8221;.</p><ul><li><p>Opening six different databases</p></li><li><p>Downloading CSV files from PubMed, TCGA, ChEMBL, and your internal systems</p></li><li><p>Writing brittle Python scripts to join these datasets</p></li><li><p>Discovering that gene names don&#8217;t match across sources (HUGO vs. Entrez vs. Ensembl)</p></li><li><p>Spending three days on data wrangling before you can even start analysis</p></li><li><p>Producing an answer that&#8217;s already stale because you can&#8217;t easily update it</p></li></ul><p>There are those who believe that agentic systems or &#8220;AI scientists&#8221; will eliminate these problems, they don&#8217;t (which is a topic for another blog post, maybe).</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading BioBox Blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>The reality is that <strong>this isn&#8217;t a tooling problem. It&#8217;s a representation problem. </strong>We&#8217;ve been trying to force inherently graph-structured data into tabular formats for decades and it&#8217;s killing productivity and forcing us to build things that don&#8217;t need to exist if you just modeled data properly.</p><p>The argument for this post is simple: <strong>Biology is a graph. Any other representation is a lossy approximation that handicaps our ability to do science at scale.</strong></p><h1>Why Biology is Fundamentally Graph Structured</h1><h2>Biological Meaning Exists in Relationships</h2><p>An isolated biological entity has almost no meaning. The information content is encoded in how the parts interact, regulate, and influence each other. Consider TP53, perhaps the most studied gene in all of cancer biology. In any given database you&#8217;ll see an entry that looks like this:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!v66s!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F907531ca-9584-49a7-882a-d197d55e43fb_3412x256.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!v66s!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F907531ca-9584-49a7-882a-d197d55e43fb_3412x256.png 424w, https://substackcdn.com/image/fetch/$s_!v66s!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F907531ca-9584-49a7-882a-d197d55e43fb_3412x256.png 848w, https://substackcdn.com/image/fetch/$s_!v66s!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F907531ca-9584-49a7-882a-d197d55e43fb_3412x256.png 1272w, https://substackcdn.com/image/fetch/$s_!v66s!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F907531ca-9584-49a7-882a-d197d55e43fb_3412x256.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!v66s!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F907531ca-9584-49a7-882a-d197d55e43fb_3412x256.png" width="1456" height="109" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/907531ca-9584-49a7-882a-d197d55e43fb_3412x256.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:109,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:80762,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.biobox.io/i/177896347?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F907531ca-9584-49a7-882a-d197d55e43fb_3412x256.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!v66s!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F907531ca-9584-49a7-882a-d197d55e43fb_3412x256.png 424w, https://substackcdn.com/image/fetch/$s_!v66s!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F907531ca-9584-49a7-882a-d197d55e43fb_3412x256.png 848w, https://substackcdn.com/image/fetch/$s_!v66s!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F907531ca-9584-49a7-882a-d197d55e43fb_3412x256.png 1272w, https://substackcdn.com/image/fetch/$s_!v66s!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F907531ca-9584-49a7-882a-d197d55e43fb_3412x256.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Does this tell you anything useful about p53? Does it explain why it&#8217;s the guardian of the genome? Does it hint at why 1000s of papers have been published on it?</p><p>No. This representation strips away everything that makes p53 biologically meaningful.</p><p><strong>Biological meaning is an emergent property of the network.</strong></p><p><strong>As a disease driver,</strong> p53 is mutated in more than half of all human cancers. But it&#8217;s not uniform and the spectra differs dramatically depending on the cancer type. For example, in Lung AD, mutations are mostly missense and cluster within DBD create stable yet dysfunctional proteins. In ovarian cancers, p53 mutations are mixtures of missense and truncating variants that eliminate protein function altogether. These differences have important biological and therapeutic implications. A single mutational frequency column cannot capture this type of nuance.</p><p><strong>As a transcription regulator,</strong> p53 controls expression of 100s of downstream genes. When DNA damage is detected, TP53 upregulates CDKN1A (p21) to arrest the cell cycle, giving time for repair. It activates BBC3 (PUMA) and BAX to trigger apoptosis if damage is irreparable. It regulates RRM2B and SCO2 to modulate metabolism. It induces SESN1 and SESN2 to manage oxidative stress. The circuits that p53 regulates is not a singular output. It&#8217;s a context-dependent network of regulatory relationships that changes based on cell type, stress type, and cellular state.</p><p><strong>As a protein,</strong> p53 exists in a sprawling web of physical interactions that determine its activity. In unstressed cells, MDM2 targets it for degradation to keep levels low. MDM4<em> </em>reinforces this repression. Acetylation by p300/CBP enhances transcriptional activity by stabilizing it. Deacetylation by SIRT1 reverses activation. The point is that it&#8217;s not a simple on/off switch. These are complex dynamically regulated networks of protein-protein interactions, post-translational modifications, and feedback loops that are, again, context-dependent.</p><p><strong>As a therapeutic target</strong>, TP53 defines treatment strategies through multiple relationship types. In tumors with wild-type p53, MDM2 inhibitors can reactivate the p53 pathway. In tumors with mutant p53, the mutation creates new vulnerabilities where loss of TP53 function produces synthetic lethality with WEE1 inhibitors, ATR inhibitors, and PARP inhibitors in certain genetic contexts. Mutant p53 can also gain oncogenic functions through interactions with other transcription factors. Understanding how to treat a TP53-altered cancer requires traversing from the mutation through the pathway network to potential therapeutic interventions.</p><h2>Tables joins become nuclear level explosions.</h2><p>Try to place them in tables in a relational db and they might look as follows:</p><ul><li><p>genes table with basic gene info</p></li><li><p>gene_disease table linking disease associations</p></li><li><p>gen_expression table with transcriptional expression data across tissues/cells/clinical popiulations</p></li><li><p>ppi table</p></li><li><p>regulatory_relationships table with transcription factor targets</p></li><li><p>pathways table with pathway membership</p></li><li><p>mutations table with variant data</p></li><li><p>drug_targets table with therapeutic relationships</p></li></ul><p>This seems reasonable until you try to answer actual biological questions.</p><p><strong>&#8220;What are the druggable proteins that interact with genes in the p53 pathway that are overexpressed in lung adenocarcinoma?&#8221;</strong></p><p>In SQL, this requires:</p><ol><li><p>Query the pathways table to get genes in the p53 pathway</p></li><li><p>Join to the gene_expression table filtered for lung adenocarcinoma</p></li><li><p>Join to the protein_interactions table to get interacting proteins</p></li><li><p>Join to the drug_targets table to filter for druggable proteins</p></li><li><p>Potentially join back to the genes table to get protein-coding gene information</p></li></ol><p>That&#8217;s 4-5 joins across different tables. Now add in the complications:</p><ul><li><p>Gene identifiers don&#8217;t match across tables (HUGO symbols vs. Entrez IDs vs. Ensembl IDs)</p></li><li><p>You need to filter by expression thresholds, but what threshold?</p></li><li><p>&#8220;Overexpressed&#8221; needs to be compared against normal tissue, that&#8217;s another join, also what threshold?</p></li><li><p>&#8220;Druggable&#8221; has multiple definitions (ligandable pocket, known ligands, approved drugs)&#8212;more branching logic</p></li><li><p>Pathway membership is ambiguous (is indirect regulation included? what about post-translational regulation?)</p></li></ul><p>Each of these complications adds more joins, more logic, more query complexity. Change one criterion and you need to rewrite significant portions.</p><h2>Context-Dependent Relationships Cannot Be Flattened</h2><p>This is the most fundamental problem with tabular representations. Biological relationships are entirely context-dependent, and context doesn&#8217;t fit in columns neatly.</p><p>If you want to model Gene X regulates Gene Y, this isn&#8217;t binary, it depends on:</p><ul><li><p>Cell type</p></li><li><p>Developmental stage</p></li><li><p>Disease state</p></li><li><p>Environmental conditions</p></li><li><p>Genetic background</p></li><li><p>Measurement</p></li><li><p>Experimental conditions</p></li></ul><p>In a table, you have three options to choose from, all of them are bad.</p><ol><li><p>Ignore the context. Store just &#8220;X regulates Y&#8221; and lose accuracy.</p></li><li><p>Add context columns to the table. But how many columns do you add? How often does this schema change? How do you query across contexts?</p></li><li><p>Create separate records for each context. Massive duplication.</p></li></ol><p>In a graph, this can be natively modeled and captured by traversing graph nodes. It&#8217;s also easy to chain multiple patterns together to get more and more complex relationships as network topologies.</p><h2>Biological Identity Depends on Network Position</h2><p>What a specific entity &#8220;is&#8221; depends on where it sits in the graph (or subgraph).</p><p>For example, is PKC an oncogene or a tumor suppressor? Well, it depends. Which tissue? Which pathway context? PKC promotes cell survival in some contexts and apoptosis in others. To understand the functional role, you must examine its network position.</p><p>Is CDK4/6 inhibition growth-suppressive or growth-promoting? In ER+ breast cancer with intact Rb, it&#8217;s suppressive. That&#8217;s why palbociclib works. In Rb-null cancers, blocking CDK4/6 can paradoxically enhance growth by removing cell cycle checkpoints. The therapeutic effect depends on the network context.</p><p>This is a fundamental principle: <strong>biological function is not an intrinsic property of an entity, it&#8217;s an emergent property of that entity&#8217;s position in a network of relationships.</strong></p><p>Tables force you to assign properties to entities: &#8220;CDK4.function = cell cycle progression.&#8221; Graphs let you represent the truth: &#8220;CDK4 function emerges from its relationships with cyclins, CDK inhibitors, Rb, E2F, and downstream targets, which differ by cell type and genetic context.&#8221;</p><h1>You need a graph.</h1><p>So what? Why should you care?</p><p><strong>Productivity cost</strong>: Scientists spend 30-40% of their time wrangling data instead of analyzing it. Most of this time is spent joining, aligning, and integrating datasets i.e., reconstructing the relationship network that should have been preserved in the first place. No, an AI agent doesn&#8217;t help you do this either.</p><p><strong>Correctness cost</strong>: Flattening relationships loses context, which leads to incorrect conclusions. How many drug targets have been pursued based on gene expression data that ignored tissue context? How many pathways are misunderstood because protein isoforms were conflated?</p><p><strong>Innovation cost</strong>: Complex questions that require multi-hop reasoning are simply not asked because they&#8217;re too hard to answer in SQL. Scientists constrain their hypotheses to fit the limitations of their database, not the boundaries of biological possibility.</p><p>This is why every major drug discovery organization needs a knowledge graph. Not because graphs are trendy or cool, but because <strong>biology is a graph, and representing it otherwise is scientifically incorrect and practically limiting.</strong></p><p></p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading BioBox Blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Applied graph analytics for indication expansion in drug discovery.]]></title><description><![CDATA[Part 2 - Graph Patterns for Indication Selection]]></description><link>https://blog.biobox.io/p/applied-graph-analytics-for-indication</link><guid isPermaLink="false">https://blog.biobox.io/p/applied-graph-analytics-for-indication</guid><dc:creator><![CDATA[Christopher Li]]></dc:creator><pubDate>Tue, 11 Feb 2025 21:04:41 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34af9109-53c1-4239-af09-9559a59ea536_2884x1751.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>This is a continuation of a multi-part series on Graph Patterns for Indication Selection.</em></p><p><em><a href="https://blog.biobox.io/p/graph-pattern-series-indication-selection">Part 1 - Patient Recruitment using Graphs</a></em></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading BioBox Blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div><hr></div><h1>Introduction</h1><p><strong>Indication expansion</strong> aims to increase the value of your drug by finding additional diseases that can benefit from it. Typically the drug is already in clinic or in market. More indications = more money. This analysis tells you <em>&#8220;what&#8221;</em> diseases makes sense based on biological principles. Another type of analysis known as <strong>indication sequencing</strong> describes the <em>&#8220;how&#8221; </em>and <em>&#8220;when&#8221;</em> to expand strategically.</p><p>All therapeutics, at a very fundamental level, basically translates to &#8220;give a <strong>substance</strong> to a <strong>biological thing</strong> to modulate a <strong>biological event</strong> that, when it is abnormal, is part of some <strong>disease</strong>&#8221;.</p><p>The key intuition here is to think of the association between a drug and the indication as a function of the biological effects it can modulate within a biological context. For example, you take an Advil when you have a migraine but Advil treats inflammation, which happens to be part of the symptoms of a migraine.</p><p><strong>What exactly is a biological event?</strong></p><p>A biological event refers to any measurable change within a living system caused by a specific interaction, such as the activation or inhibition of a biochemical pathway, receptor, enzyme, or gene.</p><p>How the substance interacts with the target to exert the <em>correct change</em> to a biological event is what we refer to as the <strong>mechanism of action </strong>(MOA). Biological complexity gives us an opportunity to exploit fundamental processes in different ways through a variety of targets including genes, proteins, protein-protein interactions, specific nucleotides, epigenetic modifications, and so on. </p><p>The participants in these effects vary widely, including proteins, small molecules, macro-complexes, and entire tissues. Biological effects also depend on context: the same molecule may behave differently in distinct tissues or under varying conditions, such as stress or infection.</p><p>The use of graphs in indication expansion is well-known. To understand why graphs are so powerful for this type of work, we will trace <a href="https://en.wikipedia.org/wiki/Dupilumab">Dupilumab</a>, a biologic that was initially approved for atopic dermatitis (AD) but has since <a href="https://www.drugs.com/history/dupixent.html">gained approvals</a> across multiple chronic inflammatory diseases such as COPD and asthma.</p><h1>Graphing Biology</h1><p>Dupilumab is a monoclonal antibody that acts as a receptor antagonist by binding to the alpha-subunit of interleukin-4 receptor (IL4R).</p><p>I&#8217;ve actually seen knowledge graphs in the wild that look like this.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bpJ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa84e96b7-750a-4473-9d28-ea713161e67f_1142x286.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bpJ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa84e96b7-750a-4473-9d28-ea713161e67f_1142x286.png 424w, https://substackcdn.com/image/fetch/$s_!bpJ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa84e96b7-750a-4473-9d28-ea713161e67f_1142x286.png 848w, https://substackcdn.com/image/fetch/$s_!bpJ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa84e96b7-750a-4473-9d28-ea713161e67f_1142x286.png 1272w, https://substackcdn.com/image/fetch/$s_!bpJ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa84e96b7-750a-4473-9d28-ea713161e67f_1142x286.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bpJ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa84e96b7-750a-4473-9d28-ea713161e67f_1142x286.png" width="531" height="132.98248686514887" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a84e96b7-750a-4473-9d28-ea713161e67f_1142x286.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:286,&quot;width&quot;:1142,&quot;resizeWidth&quot;:531,&quot;bytes&quot;:35479,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!bpJ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa84e96b7-750a-4473-9d28-ea713161e67f_1142x286.png 424w, https://substackcdn.com/image/fetch/$s_!bpJ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa84e96b7-750a-4473-9d28-ea713161e67f_1142x286.png 848w, https://substackcdn.com/image/fetch/$s_!bpJ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa84e96b7-750a-4473-9d28-ea713161e67f_1142x286.png 1272w, https://substackcdn.com/image/fetch/$s_!bpJ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa84e96b7-750a-4473-9d28-ea713161e67f_1142x286.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">An example of a useless graph structure</figcaption></figure></div><p>This is useless. It gives you no analytical advantage, regardless of how &#8220;expertly curated&#8221; it is. The graph topology does not enable you to make deductions or inferences. Most of the value in your graph is in the design of the ontology.</p><p><strong>Here&#8217;s what you should do instead.</strong></p><p>Use a concept called <em><strong>Event</strong></em> that captures useful information about biological reactions such as what the input substrate, output and the context of the event.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!i-Jb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febb4ddec-b993-4c0b-bcf2-970d7002ad8d_994x790.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!i-Jb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febb4ddec-b993-4c0b-bcf2-970d7002ad8d_994x790.png 424w, https://substackcdn.com/image/fetch/$s_!i-Jb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febb4ddec-b993-4c0b-bcf2-970d7002ad8d_994x790.png 848w, https://substackcdn.com/image/fetch/$s_!i-Jb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febb4ddec-b993-4c0b-bcf2-970d7002ad8d_994x790.png 1272w, https://substackcdn.com/image/fetch/$s_!i-Jb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febb4ddec-b993-4c0b-bcf2-970d7002ad8d_994x790.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!i-Jb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febb4ddec-b993-4c0b-bcf2-970d7002ad8d_994x790.png" width="570" height="453.01810865191146" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ebb4ddec-b993-4c0b-bcf2-970d7002ad8d_994x790.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:790,&quot;width&quot;:994,&quot;resizeWidth&quot;:570,&quot;bytes&quot;:89563,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!i-Jb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febb4ddec-b993-4c0b-bcf2-970d7002ad8d_994x790.png 424w, https://substackcdn.com/image/fetch/$s_!i-Jb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febb4ddec-b993-4c0b-bcf2-970d7002ad8d_994x790.png 848w, https://substackcdn.com/image/fetch/$s_!i-Jb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febb4ddec-b993-4c0b-bcf2-970d7002ad8d_994x790.png 1272w, https://substackcdn.com/image/fetch/$s_!i-Jb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febb4ddec-b993-4c0b-bcf2-970d7002ad8d_994x790.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Graph representation of IL4 ligand binding to IL4R</figcaption></figure></div><p>Next, describe the specific event that the drug is affecting. Dupilumab binds to IL4R and occludes the fibronectin domain, preventing IL4 from binding.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!haOk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ff2a40f-c93a-48a7-9aaf-16efe400b400_1200x832.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!haOk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ff2a40f-c93a-48a7-9aaf-16efe400b400_1200x832.png 424w, https://substackcdn.com/image/fetch/$s_!haOk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ff2a40f-c93a-48a7-9aaf-16efe400b400_1200x832.png 848w, https://substackcdn.com/image/fetch/$s_!haOk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ff2a40f-c93a-48a7-9aaf-16efe400b400_1200x832.png 1272w, https://substackcdn.com/image/fetch/$s_!haOk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ff2a40f-c93a-48a7-9aaf-16efe400b400_1200x832.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!haOk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ff2a40f-c93a-48a7-9aaf-16efe400b400_1200x832.png" width="601" height="416.6933333333333" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4ff2a40f-c93a-48a7-9aaf-16efe400b400_1200x832.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:832,&quot;width&quot;:1200,&quot;resizeWidth&quot;:601,&quot;bytes&quot;:126579,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!haOk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ff2a40f-c93a-48a7-9aaf-16efe400b400_1200x832.png 424w, https://substackcdn.com/image/fetch/$s_!haOk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ff2a40f-c93a-48a7-9aaf-16efe400b400_1200x832.png 848w, https://substackcdn.com/image/fetch/$s_!haOk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ff2a40f-c93a-48a7-9aaf-16efe400b400_1200x832.png 1272w, https://substackcdn.com/image/fetch/$s_!haOk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ff2a40f-c93a-48a7-9aaf-16efe400b400_1200x832.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Next, we introduce the <em><strong>Process</strong></em> concept to assign the logical order to a collection of events. Zooming out and expanding the diagram above:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3BX7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F866abcb2-5376-48ab-8c22-fe1f3c809693_1396x2054.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3BX7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F866abcb2-5376-48ab-8c22-fe1f3c809693_1396x2054.png 424w, https://substackcdn.com/image/fetch/$s_!3BX7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F866abcb2-5376-48ab-8c22-fe1f3c809693_1396x2054.png 848w, https://substackcdn.com/image/fetch/$s_!3BX7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F866abcb2-5376-48ab-8c22-fe1f3c809693_1396x2054.png 1272w, https://substackcdn.com/image/fetch/$s_!3BX7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F866abcb2-5376-48ab-8c22-fe1f3c809693_1396x2054.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3BX7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F866abcb2-5376-48ab-8c22-fe1f3c809693_1396x2054.png" width="1396" height="2054" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/866abcb2-5376-48ab-8c22-fe1f3c809693_1396x2054.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:2054,&quot;width&quot;:1396,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:287048,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3BX7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F866abcb2-5376-48ab-8c22-fe1f3c809693_1396x2054.png 424w, https://substackcdn.com/image/fetch/$s_!3BX7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F866abcb2-5376-48ab-8c22-fe1f3c809693_1396x2054.png 848w, https://substackcdn.com/image/fetch/$s_!3BX7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F866abcb2-5376-48ab-8c22-fe1f3c809693_1396x2054.png 1272w, https://substackcdn.com/image/fetch/$s_!3BX7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F866abcb2-5376-48ab-8c22-fe1f3c809693_1396x2054.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"></figcaption></figure></div><p>Dupilumab works because it attenuates Th2 pathway activation by crippling STAT6 activation. We can map the impact of the inhibition through evaluating the transcriptional targets of STAT6. To find these targets, you can rely on literature mining or determine these target empirically using epigenetic data sources such as ENCODE. Among these targets include <em>CCL13, IGHE</em>, <em>IGHG1</em>, and <em>IGHG4 </em>that drive type 2 inflammation. Importantly, STAT6 activation leads to IL4R upregulation and increase IL4 secretion, and executes a positive feedback loop to further amplify the response.</p><h1>Causal Biology Modelling &amp; Inference</h1><p>Dupilumab is a blockbuster drug with 6 approved indications and more in the pipeline. This &#8220;pipeline in a product&#8221; type drug is extremely profitable because the drug targets a specific mechanism that is shared across a collection of diseases, in this case, STAT6 activation in type II immune response related diseases. </p><p>What happens when STAT6 is activated? What phenotypes are expected? What diseases are relevant?</p><p>We can leverage graph algorithms to answer these questions.</p><p>First, let&#8217;s build and visualize the network. </p><ul><li><p>Starting from STAT6, mark all its transcriptional targets using ChIP-seq evidence. Specifically, you are looking for co-localization of ChIP-seq peaks for STAT6, H3K4me3, and H3K27Ac.</p></li><li><p>For any STAT6 targets, that are known to have DNA binding domains or are known transcription factors, expand out the transcriptional targets.</p></li><li><p>Repeat the process 2-3 times outwards</p></li><li><p>For all the genes marked - load in disease associations</p></li><li><p>Apply <a href="https://www.researchgate.net/publication/229019459_New_circular_drawing_algorithms">circular drawing algorithm</a> for visualization</p></li></ul><p>The positions of nodes in this projection are not arbitrary - they are determined by the topology of the network. Nodes that are more closely related (for example, those that share many edges or are part of the same community) tend to be placed next to one another. You are essentially seeing a one&#8208;dimensional (angular) projection of the network&#8217;s connectivity. Nodes placed next to each other are likely to interact or be functionally related, while nodes that are far apart on the circle are less directly connected</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Wwqj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34af9109-53c1-4239-af09-9559a59ea536_2884x1751.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Wwqj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34af9109-53c1-4239-af09-9559a59ea536_2884x1751.png 424w, https://substackcdn.com/image/fetch/$s_!Wwqj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34af9109-53c1-4239-af09-9559a59ea536_2884x1751.png 848w, https://substackcdn.com/image/fetch/$s_!Wwqj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34af9109-53c1-4239-af09-9559a59ea536_2884x1751.png 1272w, https://substackcdn.com/image/fetch/$s_!Wwqj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34af9109-53c1-4239-af09-9559a59ea536_2884x1751.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Wwqj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34af9109-53c1-4239-af09-9559a59ea536_2884x1751.png" width="1456" height="884" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/34af9109-53c1-4239-af09-9559a59ea536_2884x1751.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:884,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2424697,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Wwqj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34af9109-53c1-4239-af09-9559a59ea536_2884x1751.png 424w, https://substackcdn.com/image/fetch/$s_!Wwqj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34af9109-53c1-4239-af09-9559a59ea536_2884x1751.png 848w, https://substackcdn.com/image/fetch/$s_!Wwqj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34af9109-53c1-4239-af09-9559a59ea536_2884x1751.png 1272w, https://substackcdn.com/image/fetch/$s_!Wwqj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34af9109-53c1-4239-af09-9559a59ea536_2884x1751.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>In this application, we are doing a sanity check to see if the projection aligns with what we&#8217;d expect to see. For example, we know that Dupilumab is indicated for atopic eczema, so we should expect to find it somewhere here in this network. Indeed, we find it placed alongside <em>CLDN1</em> and a variety of skin related disorders.</p><p>So far so good.</p><p>An interesting cluster with high convergence around <em>STAT3 </em>and a collection of chronic inflammatory diseases such as ulcerative colitis, psoriasis, and IBD.</p><p>The connection become clear when we run a path-finding algorithm between <em>STAT3 </em>and <em>STAT6</em>. Turns out, <em>STAT6</em> drives the expression of <em>IL4R </em>and <em>IL4</em> secretion. <em>IL4R</em> in heterodimer with <em>IL13RA1 </em>or <em>IL2RG </em>are both capable of phosphorylating <em>STAT3</em>, representing an alternate mechanism outside of the canonical IL-6 family activation loops. I&#8217;d suspect that Regeneron will starting going after treatment refractory UC and IBD soon.</p><h1>Assessing efficacy for strategic expansion</h1><p>&#8220;Pipeline in a product&#8221; are the holy-grail products that every pharma is chasing. When staging the launch sequence, one of the successful ways to build momentum is to <a href="https://www.iqvia.com/library/white-papers/success-multiplied-launch-excellence-for-multi-indication-assets">use a narrow-first approach</a>. Among the different variables for optimizing this launch strategy, one of them is to ensure you line up indications that you know are going to demonstrate strong efficacy and high market differentiation from standard of care. Translated into systems biology, we can interpret this as saying, does our drug impact the causal biological process <strong>the most</strong>. </p><p>How do you quantify that? <strong>Walk through the graph and calculate the path weights. (more on this in part 3 of this series)</strong></p><p>To illustrate this, let&#8217;s use the recent example of <a href="https://www.fiercebiotech.com/biotech/bristol-myers-backs-out-dupixent-fight-axing-allergy-asset-despite-phase-3-win">BMS&#8217;s Cendakimab dropping out against Dupilumab</a>. This was a wise decision because Cendakimab would have 100% lost this fight.</p><p>Cendakimab is a IL13 inhibitor. The goal is still to decrease IL4/IL13 signaling, which is the same as saying STAT6 suppression. There critical event is STAT6 phosphorylation, which happens under IL4R-alpha dependent heterodimerization. We can kick this cascade off at these entry events:</p><ul><li><p>IL4 binds to IL4R &#8594; recruits IL13RA1</p></li><li><p>IL13 binds to IL4R &#8594; recruits IL13RA1</p></li><li><p>IL13 binds to IL13RA1 &#8594; recruits IL4R</p></li></ul><p>At first glance, it might look like an IL13 blockade alone could achieve meaningful, or at least, comparable results because it serves as 2 potential entry points. This argument falls apart when we include the <strong>biological context</strong> in which these events are taking place. Specifically, IL13 binds to IL13RA2 (a decoy receptor) with much higher affinity than IL13RA1. Also, IL4 is still able to exert its effect, independent of IL13 blockade.</p><p>At best, Cendakimab might edge out a win on dosing regiment efficiency and a narrower ADR profile, but it&#8217;s not enough to drive differentiation and improvement over the current standard of care.</p><h1>Conclusion</h1><p>In this post, we dived deeper into how network graph analytics can help inform critical decisions made along the drug discovery landscape. Building a model to capture the complexity of systems biology can only be done in a structured ontology and knowledge graph. The investment in this resource unlocks powerful capabilities that research teams can use to answer questions in indication selection and strategic expansion in a principled way.</p><div><hr></div><h1><em>What&#8217;s Next?</em></h1><p>Upcoming in this series, we&#8217;ll switch gears into building scoring models and graph neural networks that can help take safety risks off the table when evaluating indications.</p><div><hr></div><p><strong><a href="https://biobox.io/">BioBox</a></strong> is the knowledge infrastructure for modern biopharma research, built for drug hunters who need to integrate multi-modal data, engineer knowledge, and test hypotheses at scale. To learn more, please visit our website at https://biobox.io or <a href="mailto:sales@biobox.io">click here to contact us</a>.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading BioBox Blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Graph Pattern Series: Indication selection.]]></title><description><![CDATA[Part 1 - Introduction, Patient Recruitment with Graphs]]></description><link>https://blog.biobox.io/p/graph-pattern-series-indication-selection</link><guid isPermaLink="false">https://blog.biobox.io/p/graph-pattern-series-indication-selection</guid><dc:creator><![CDATA[Christopher Li]]></dc:creator><pubDate>Wed, 22 Jan 2025 17:38:43 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0780425b-b1b9-467a-a78f-6ac677838219_1379x1114.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Note: This is the first of a multi-part series on how some of the best biotechs make strong data-driven decisions using BioBox knowledge graphs to set up their pipelines for success.</em></p><h1>Introduction</h1><p>Indication selection is the process of selecting and ranking potential diseases or conditions (indications) that a drug candidate could target, based on scientific, medical, regulatory, and commercial factors. This strategic decision requires participation from leadership across multiple functions in the company. </p><p>By the time we engage with biopharma, they already have a general idea of what therapeutic areas (e.g. CNS, liver disease, auto-immune, etc.) to focus on. Usually, this is set in the company&#8217;s mission, a specific area of expertise, and/or something their platform is uniquely positioned to solve for. The asset&#8217;s stage of development is also important to consider. The starting points for indication selection strategy are different based on the <em>type </em>of biotech company. Because of these nuances, it was not surprising to find that each customer had a slightly different perspective on how diseases should be ranked. As a result, we&#8217;ve ingested dozens of different data sources and built many knowledge graph variants for customers. </p><p>In this blog post, we&#8217;ll share a blueprint of best practices for modeling data inside a knowledge graph to help biotech teams evaluate their competitive positioning based on feedback from our customers.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading BioBox Blog! Subscribe for free to receive new posts and updates.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h1>Organizing Principles</h1><p>There are 2 entry points for this discussion: <strong>Diseases </strong>and <strong>Processes.</strong> This distinction is important when describing treatment modalities. For example, in immuno-oncology, one of the desired outcomes is to activate T-cell response. These therapies may have broad application across a portfolio of diseases. It is also useful to connect <strong>Phenotypes</strong> with diseases to improve resolution. </p><h1>Commercial Attractiveness</h1><p>Working with business and medical affairs teams broadened my perspective on how much multi-modal data goes into a indication decision. But it basically boils down to this equation:</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\frac{ \\text{Value if it works} \\bullet \\text{Probability of success}  }{  \\text{Time to value} \\bullet \\text{Money needed to find out}  }&quot;,&quot;id&quot;:&quot;TESZIFMWCP&quot;}" data-component-name="LatexBlockToDOM"></div><p>The goal is to max out this ratio. Let&#8217;s talk about some strategies that we&#8217;ve seen work.</p><h2>Patient Recruitment</h2><p>Your ability to recruit enough patients to satisfy the study criterion is a vital consideration. High competition for patients will make your trial run longer, delaying the time to value. Overextending the trial duration means more staff on payroll. The overall effect is the denominator increases, by a lot, and hurts your commercial attractiveness.</p><p>Several factors contribute to patient recruitment potential:</p><ol><li><p>Pick an indication that has high prevalence = larger patient supply</p></li><li><p>Large unmet need</p></li><li><p>Competing trials recruiting for the same patient populations</p></li></ol><p>Everyone knows about 1 and 2. There&#8217;s not much that can be done there to give you an advantage. But it turns out we can use graphs to avoid setting up competing trials. Every clinical trial will list their inclusion and exclusion criteria for participants. Customers we&#8217;ve worked with extracted the unstructured data into a structured set that we then model into the graph.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kM3X!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcabcd584-3765-487d-8bcb-55a687f2bdda_1462x846.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kM3X!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcabcd584-3765-487d-8bcb-55a687f2bdda_1462x846.png 424w, https://substackcdn.com/image/fetch/$s_!kM3X!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcabcd584-3765-487d-8bcb-55a687f2bdda_1462x846.png 848w, https://substackcdn.com/image/fetch/$s_!kM3X!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcabcd584-3765-487d-8bcb-55a687f2bdda_1462x846.png 1272w, https://substackcdn.com/image/fetch/$s_!kM3X!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcabcd584-3765-487d-8bcb-55a687f2bdda_1462x846.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kM3X!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcabcd584-3765-487d-8bcb-55a687f2bdda_1462x846.png" width="543" height="314.38804945054943" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cabcd584-3765-487d-8bcb-55a687f2bdda_1462x846.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:843,&quot;width&quot;:1456,&quot;resizeWidth&quot;:543,&quot;bytes&quot;:177815,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!kM3X!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcabcd584-3765-487d-8bcb-55a687f2bdda_1462x846.png 424w, https://substackcdn.com/image/fetch/$s_!kM3X!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcabcd584-3765-487d-8bcb-55a687f2bdda_1462x846.png 848w, https://substackcdn.com/image/fetch/$s_!kM3X!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcabcd584-3765-487d-8bcb-55a687f2bdda_1462x846.png 1272w, https://substackcdn.com/image/fetch/$s_!kM3X!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcabcd584-3765-487d-8bcb-55a687f2bdda_1462x846.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!KMIF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0780425b-b1b9-467a-a78f-6ac677838219_1379x1114.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!KMIF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0780425b-b1b9-467a-a78f-6ac677838219_1379x1114.png 424w, https://substackcdn.com/image/fetch/$s_!KMIF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0780425b-b1b9-467a-a78f-6ac677838219_1379x1114.png 848w, https://substackcdn.com/image/fetch/$s_!KMIF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0780425b-b1b9-467a-a78f-6ac677838219_1379x1114.png 1272w, https://substackcdn.com/image/fetch/$s_!KMIF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0780425b-b1b9-467a-a78f-6ac677838219_1379x1114.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!KMIF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0780425b-b1b9-467a-a78f-6ac677838219_1379x1114.png" width="468" height="378.06526468455405" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0780425b-b1b9-467a-a78f-6ac677838219_1379x1114.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1114,&quot;width&quot;:1379,&quot;resizeWidth&quot;:468,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!KMIF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0780425b-b1b9-467a-a78f-6ac677838219_1379x1114.png 424w, https://substackcdn.com/image/fetch/$s_!KMIF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0780425b-b1b9-467a-a78f-6ac677838219_1379x1114.png 848w, https://substackcdn.com/image/fetch/$s_!KMIF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0780425b-b1b9-467a-a78f-6ac677838219_1379x1114.png 1272w, https://substackcdn.com/image/fetch/$s_!KMIF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0780425b-b1b9-467a-a78f-6ac677838219_1379x1114.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>After repeating this for the exclusion criterion, we can build a graph representation model (mono-partite projection) that calculates similarities of Clinical Trial nodes to help scaffold our own inclusion criterion that will deliver on our scientific/regulatory goals, but minimizes the similarity to other actively recruiting trials.</p><p>At decision-time, the process basically works as follows:</p><ol><li><p>Enumerate your ideal inclusion/exclusion criteria that will maximize your chance of the trial passing (we&#8217;ll cover this topic in part 2 of the series)</p></li><li><p>Run the similarity calculations</p></li><li><p>Work with medical affairs and ClinOps team to drop/augment a criteria</p></li><li><p>Re-run the similarity calculations</p></li><li><p>Repeat until either ClinOps/medical affairs denies another change or similarity scores reaches a global minima</p></li></ol><h3>Bonus: Geo-spatial data to minimize site overlap</h3><p>This next aspect of the graph was not something we expected to be done in our discipline, but yielded information that was remarkably strategic, even outside of drug development (more on this later). The problem was that even when you have the right indication, the right inclusion/exclusion criteria, you still have to physically recruit the patients and bring them to the sites. While the global disease burden is high, if the local recruitment location is highly saturated with similar trials, you&#8217;ll still face a recruitment challenge.</p><p>Each clinical trial that is active and/or recruiting is required to disclose their study sites. The API response for study locations conveniently includes the geospatial coordinates (longitude, latitude). This adds a new dimension to use in our decision making. We can evaluate how far apart the trials are from a clinical characteristic perspective <strong>and we can calculate the physical distance between the trial locations</strong>. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zzoZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa28b5d67-5bd6-48f5-8098-efc79fbaf845_610x820.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zzoZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa28b5d67-5bd6-48f5-8098-efc79fbaf845_610x820.png 424w, https://substackcdn.com/image/fetch/$s_!zzoZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa28b5d67-5bd6-48f5-8098-efc79fbaf845_610x820.png 848w, https://substackcdn.com/image/fetch/$s_!zzoZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa28b5d67-5bd6-48f5-8098-efc79fbaf845_610x820.png 1272w, https://substackcdn.com/image/fetch/$s_!zzoZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa28b5d67-5bd6-48f5-8098-efc79fbaf845_610x820.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zzoZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa28b5d67-5bd6-48f5-8098-efc79fbaf845_610x820.png" width="338" height="454.3606557377049" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a28b5d67-5bd6-48f5-8098-efc79fbaf845_610x820.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:820,&quot;width&quot;:610,&quot;resizeWidth&quot;:338,&quot;bytes&quot;:80576,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zzoZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa28b5d67-5bd6-48f5-8098-efc79fbaf845_610x820.png 424w, https://substackcdn.com/image/fetch/$s_!zzoZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa28b5d67-5bd6-48f5-8098-efc79fbaf845_610x820.png 848w, https://substackcdn.com/image/fetch/$s_!zzoZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa28b5d67-5bd6-48f5-8098-efc79fbaf845_610x820.png 1272w, https://substackcdn.com/image/fetch/$s_!zzoZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa28b5d67-5bd6-48f5-8098-efc79fbaf845_610x820.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Use-cases outside of drug discovery</h3><p>I recently reconnected with an old colleague who is now in a leadership role at a major Canadian hospital. I shared with him the geo-spatial strategy which brought up a use-case I had not previously considered. <strong>Attracting more clinical trials to your research site. </strong></p><p>Research hospitals benefit greatly from conducting clinical trials, it&#8217;s a significant source of revenue. However, there is an activation time required at the research site to prepare the clinical research operations to execute successfully. It is a very reactive type of work where lucrative opportunities are passed because the timing didn&#8217;t align. But if you knew what the trends are for the upcoming trials that are going to start recruiting and have a solid mapping of their clinical characteristics, you can preemptively activate your research operations so that when recruitment opens, your site is ready to go. Furthermore, you can use the spatial data to reverse engineer which trials are competing for the same populations within the same locations and prep/offer your trial site to satisfy their recruitment needs and drive revenue into your institution.</p><div><hr></div><h1><em>What&#8217;s Next?</em></h1><p>Upcoming in this series, we&#8217;ll continue to focus on indication selection patterns. Specifically, we&#8217;ll be sharing best practices to increase the "probability of success" variable in the equation such as mechanism of action analysis and biomarker mapping.</p><div><hr></div><p><strong><a href="https://biobox.io">BioBox</a></strong> is the knowledge infrastructure for modern biopharma research, built for drug hunters who need to integrate multi-modal data, engineer knowledge, and test hypotheses at scale. To learn more, please visit our website at https://biobox.io or <a href="mailto:sales@biobox.io">reach out to one of the team members</a>.</p>]]></content:encoded></item><item><title><![CDATA[The Reactome Knowledgebase: Powering Biological Research with Knowledge Graphs]]></title><description><![CDATA[A look into how the best scientists and system biologists organize biological information.]]></description><link>https://blog.biobox.io/p/the-reactome-knowledgebase-powering</link><guid isPermaLink="false">https://blog.biobox.io/p/the-reactome-knowledgebase-powering</guid><dc:creator><![CDATA[Christopher Li]]></dc:creator><pubDate>Mon, 23 Sep 2024 14:54:53 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!QDnQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0092c60a-6f3d-4200-99c4-16c01b199e70_1600x1471.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>This week we are featuring an article written in collaboration with <a href="https://www.linkedin.com/in/nancy-t-li/">Nancy T Li,</a> PhD (Reactome Outreach Lead).&nbsp;Nancy leads community engagement, training, and outreach for the Reactome Knowledgebase. </em></p><h1>Introduction</h1><p>Biological states and transitions between these states are controlled and regulated processes. There are numerous chemicals, molecules, complexes that work to coordinate these processes, that together, give rise to form and function. Our task, as scientists and drug hunters, is to understand how these processes work so that we can exploit them and find cures for diseases. To understand causality, we need reliable and up-to-date information from sources that we can trust. For biological pathways and systems biology, the de facto choice for <a href="https://biobox.io">BioBox</a> and our biopharma partners has been the <a href="https://reactome.org/">Reactome knowledgebase</a>.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading BioBox Blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h1>Reactome Knowledgebase</h1><p>Reactome is a publicly funded, open access, open source database, created in 2002 by <a href="https://www.linkedin.com/in/lincoln-stein-74532a16b/">Lincoln Stein</a> at CSHL, Ewan Birney at EBI-EMBL, and Suzannah Lewis at Lawrence Berkeley National Laboratory. <strong>Reactome is now the leading open resource for human-curated pathway knowledge</strong>, and is a key resource in BioBox, used for making reliable predictions.</p><h2>The Bedrock of Reliability: Reactome&#8217;s Curation Practices</h2><p>In the dynamic realm of bioinformatics and biomedical research, reliable, high-quality data is the lifeblood of scientific advancement. Researchers depend on databases like Reactome to provide accurate and comprehensive information about biological pathways. Central to Reactome's reliability is its meticulous curation process.&nbsp;</p><p>Reactome&#8217;s curation process is rigorous and multi-faceted, ensuring that the data it provides is both accurate and relevant:</p><ol><li><p><strong>Literature Review and Data Extraction</strong>: Curators at Reactome are PhD-level biological experts, who conduct an extensive review of scientific literature to extract knowledge about biological pathways. This step demands a deep understanding of molecular biology and the ability to discern significant findings that can be integrated into pathway models.</p></li><li><p><strong>Data Integration and Standardization</strong>: Extracted data is integrated into Reactome according to Reactome&#8217;s object-based data model, and standardized using controlled vocabularies and ontologies such as the Gene Ontology (GO). This standardization ensures consistency, making it easier for researchers to query and analyze the information.</p></li><li><p><strong>Expert Review and Validation</strong>: Reactome curators work with domain experts to review and validate the curated data, similar to how academic journals conduct peer-review before publication. These experts ensure that the pathways accurately reflect current scientific knowledge and experimental evidence.</p></li><li><p><strong>Community Involvement</strong>: Reactome encourages the scientific community to contribute data, suggest corrections, and provide feedback. This collaborative approach helps keep the database current with the latest scientific discoveries. <a href="https://reactome.org/community/collaboration">Here </a>is a web page, showing pathways that are ready for external review<strong>[1]</strong>.</p></li></ol><h2><strong>Transforming Data with Knowledge Graphs</strong></h2><p>Knowledge graphs have become a transformative tool in the bioinformatics landscape. By structuring data as interconnected entities and relationships, knowledge graphs provide a more intuitive and powerful way to represent and analyze biological information. Reactome&#8217;s adoption of a knowledge graph perspective enhances its curation practices and opens up new avenues for research.</p><p>By organizing data within Neo4j, Reactome not only enhances data reliability but also unlocks new opportunities for researchers to explore and interpret complex biological systems. See <a href="https://doi.org/10.1371/journal.pcbi.1005968">here</a> for a publication by Fabregat et al, 2018, discussing the Reactome graph database <strong>[2].</strong></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QDnQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0092c60a-6f3d-4200-99c4-16c01b199e70_1600x1471.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QDnQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0092c60a-6f3d-4200-99c4-16c01b199e70_1600x1471.png 424w, https://substackcdn.com/image/fetch/$s_!QDnQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0092c60a-6f3d-4200-99c4-16c01b199e70_1600x1471.png 848w, https://substackcdn.com/image/fetch/$s_!QDnQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0092c60a-6f3d-4200-99c4-16c01b199e70_1600x1471.png 1272w, https://substackcdn.com/image/fetch/$s_!QDnQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0092c60a-6f3d-4200-99c4-16c01b199e70_1600x1471.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QDnQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0092c60a-6f3d-4200-99c4-16c01b199e70_1600x1471.png" width="474" height="435.9107142857143" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0092c60a-6f3d-4200-99c4-16c01b199e70_1600x1471.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1339,&quot;width&quot;:1456,&quot;resizeWidth&quot;:474,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QDnQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0092c60a-6f3d-4200-99c4-16c01b199e70_1600x1471.png 424w, https://substackcdn.com/image/fetch/$s_!QDnQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0092c60a-6f3d-4200-99c4-16c01b199e70_1600x1471.png 848w, https://substackcdn.com/image/fetch/$s_!QDnQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0092c60a-6f3d-4200-99c4-16c01b199e70_1600x1471.png 1272w, https://substackcdn.com/image/fetch/$s_!QDnQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0092c60a-6f3d-4200-99c4-16c01b199e70_1600x1471.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Statistics for Reactome graph database contents as of version 89.</figcaption></figure></div><p><strong>Data Interconnectivity</strong>:</p><ul><li><p><strong>Entity Relationships</strong>: In a knowledge graph, each piece of data is an entity connected by relationships. This interconnected structure allows researchers to see not just isolated data points but the rich web of interactions and dependencies between them. For instance, a knowledge graph can reveal how a particular gene is involved in multiple pathways and how these pathways inter-relate.</p></li><li><p><strong>Contextual Understanding</strong>: Knowledge graphs provide context by linking related entities. Researchers can understand how a particular biological pathway fits into larger cellular processes and how alterations in one pathway might affect others, leading to more holistic, systems-level insights.<br></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hJTv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25189213-785b-43d7-8965-29d9f761bbd5_1600x726.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hJTv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25189213-785b-43d7-8965-29d9f761bbd5_1600x726.png 424w, https://substackcdn.com/image/fetch/$s_!hJTv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25189213-785b-43d7-8965-29d9f761bbd5_1600x726.png 848w, https://substackcdn.com/image/fetch/$s_!hJTv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25189213-785b-43d7-8965-29d9f761bbd5_1600x726.png 1272w, https://substackcdn.com/image/fetch/$s_!hJTv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25189213-785b-43d7-8965-29d9f761bbd5_1600x726.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hJTv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25189213-785b-43d7-8965-29d9f761bbd5_1600x726.png" width="696" height="315.9725274725275" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/25189213-785b-43d7-8965-29d9f761bbd5_1600x726.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:661,&quot;width&quot;:1456,&quot;resizeWidth&quot;:696,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!hJTv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25189213-785b-43d7-8965-29d9f761bbd5_1600x726.png 424w, https://substackcdn.com/image/fetch/$s_!hJTv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25189213-785b-43d7-8965-29d9f761bbd5_1600x726.png 848w, https://substackcdn.com/image/fetch/$s_!hJTv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25189213-785b-43d7-8965-29d9f761bbd5_1600x726.png 1272w, https://substackcdn.com/image/fetch/$s_!hJTv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25189213-785b-43d7-8965-29d9f761bbd5_1600x726.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>Example of the Reactome Graph Database (v89), querying for a dissociation reaction, relating pathway, input complex, output proteins, and the UniProt reference entities.</em></figcaption></figure></div></li></ul><p><strong>Enhanced Data Analysis</strong>:</p><ul><li><p><strong>Advanced Querying</strong>: Knowledge graphs enable sophisticated querying capabilities. Researchers can formulate complex queries that span multiple entities and relationships, uncovering insights that would be difficult to achieve with traditional database queries.</p></li><li><p><strong>Predictive Analytics</strong>: By leveraging the interconnected nature of knowledge graphs, AI and machine learning algorithms can predict new relationships and interactions within the data. This predictive capability can lead to the discovery of novel pathways and biological mechanisms.</p></li></ul><p>To try this out, Reactome has provided documentation <a href="https://reactome.org/dev/graph-database">here</a> that describes how to install and try out Reactome&#8217;s Neo4j graph database<strong> [3].</strong></p><p><strong>AI and Machine Learning</strong>:</p><ul><li><p><strong>AI-Augmented Data Curation</strong>: AI and machine learning algorithms can automate aspects of the curation process. For example, natural language processing (NLP) can extract relevant information from scientific literature and integrate it into the knowledge graph, speeding up the data curation process. The Reactome team is actively working toward finding reliable methods to augment curation processes using AI. Stay tuned for more information in the future!</p></li><li><p><strong>Pattern Recognition</strong>: Machine learning can identify patterns and correlations within the knowledge graph that might not be immediately apparent. Researchers can leverage Reactome&#8217;s Neo4j graph database to find new insights, driving new hypotheses and research directions.</p></li></ul><h3><strong>Meeting Researchers&#8217; Needs: Reliability, Accessibility, and Discovery</strong></h3><p>For researchers, the reliability and accessibility of data are critical. By employing knowledge graphs, Reactome enhances these aspects and offers additional benefits:</p><ol><li><p><strong>Comprehensive and Accurate Data</strong>: The rigorous curation practices ensure that data within Reactome is accurate and comprehensive. The knowledge graph structure further ensures that data is contextually relevant and interconnected, providing a richer understanding of biological pathways.</p></li><li><p><strong>User-Friendly Interface</strong>: The knowledge graph framework supports advanced visualization and interaction tools. Researchers can easily navigate through interconnected data, explore relationships, and visualize complex pathways in an intuitive manner.</p></li><li><p><strong>Up-to-Date Information</strong>: The dynamic nature of knowledge graphs allows for continuous updates and refinements.</p></li><li><p><strong>Facilitating Discovery</strong>: Knowledge graphs not only provide current data but also facilitate the discovery of new knowledge. Researchers can explore uncharted relationships and generate new insights, driving innovation and scientific progress.</p></li></ol><h1>Using Reactome Data in BioBox</h1><p>Loading Reactome data into your BioBox knowledge graph can be done through importing the Reactome data package in the external data package listing.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3MIs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14046f26-3c1d-4200-96ae-2341788e9376_3190x2042.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3MIs!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14046f26-3c1d-4200-96ae-2341788e9376_3190x2042.png 424w, https://substackcdn.com/image/fetch/$s_!3MIs!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14046f26-3c1d-4200-96ae-2341788e9376_3190x2042.png 848w, https://substackcdn.com/image/fetch/$s_!3MIs!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14046f26-3c1d-4200-96ae-2341788e9376_3190x2042.png 1272w, https://substackcdn.com/image/fetch/$s_!3MIs!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14046f26-3c1d-4200-96ae-2341788e9376_3190x2042.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3MIs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14046f26-3c1d-4200-96ae-2341788e9376_3190x2042.png" width="1456" height="932" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/14046f26-3c1d-4200-96ae-2341788e9376_3190x2042.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:932,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1130682,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3MIs!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14046f26-3c1d-4200-96ae-2341788e9376_3190x2042.png 424w, https://substackcdn.com/image/fetch/$s_!3MIs!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14046f26-3c1d-4200-96ae-2341788e9376_3190x2042.png 848w, https://substackcdn.com/image/fetch/$s_!3MIs!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14046f26-3c1d-4200-96ae-2341788e9376_3190x2042.png 1272w, https://substackcdn.com/image/fetch/$s_!3MIs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14046f26-3c1d-4200-96ae-2341788e9376_3190x2042.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This data package includes concept and relationship definitions that will incorporate into your custom ontology. Objects loaded through this data package preserves the  data and mapping from the Reactome knowledge base and integrates with your private knowledge graph for increased comprehension.</p><h1><strong>Conclusion</strong></h1><p>At BioBox, we are thrilled to be partnered with Reactome. Reactome&#8217;s commitment to rigorous curation practices, coupled with the power of knowledge graphs, makes it an indispensable resource for researchers. By organizing data in an interconnected, contextual framework, Reactome enhances data reliability and opens up new opportunities for exploration and discovery. As the integration of data analysis and AI technologies continues to advance, Reactome remains at the forefront, providing researchers with the tools they need to push the boundaries of scientific knowledge.</p><p></p><p><strong>References</strong></p><p>[1] Link to Reactome Collaborator Zone webpage: <a href="https://reactome.org/community/collaboration">https://reactome.org/community/collaboration</a></p><p>[2] Fabregat A, Korninger F, Viteri G, Sidiropoulos K, Marin-Garcia P, Ping P, Wu G, Stein L, D'Eustachio P, Hermjakob H. Reactome graph database: Efficient access to complex pathway data. PLoS Comput Biol.Jan 29. <a href="https://www.ncbi.nlm.nih.gov/pubmed/29377902">PubMed.</a></p><p>[2] Link to Reactome Developer Zone documentation: <a href="https://reactome.org/dev/graph-database">https://reactome.org/dev/graph-database</a></p><div><hr></div><h3>About BioBox</h3><p>BioBox is a knowledge infrastructure for modern biopharma research teams to accelerate drug discovery and make better decisions in preclinical studies. Biopharma research teams from startups to top 20 pharms use BioBox to transform multi-omic data into knowledge graphs and use them to drive decision making in target prioritization, indication selection, and biomarker discovery.</p><p>To learn more visit <a href="https://biobox.io">biobox.io</a> or <a href="mailto:sales@biobox.io">reach out to get in touch</a> with a team member.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading BioBox Blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[How to transform multi-omic data into knowledge graphs and do useful things with them.]]></title><description><![CDATA[A practical guide to omics data modeling in graphs.]]></description><link>https://blog.biobox.io/p/how-to-transform-multi-omic-data</link><guid isPermaLink="false">https://blog.biobox.io/p/how-to-transform-multi-omic-data</guid><dc:creator><![CDATA[Christopher Li]]></dc:creator><pubDate>Mon, 26 Aug 2024 14:31:14 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!3CwH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb85ce50b-5a5c-43a2-b5f7-5a13ed7e9b85_1988x1136.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h1>Introduction</h1><p>Knowledge graphs (KGs) are powerful tools for reasoning and analysis. There is a misconception that KGs only apply to text-based information. The most common objection we hear is that KGs are unsuitable for data science. However, with a few changes, thoughtful data modeling, and novel software, we can unlock the power of these KGs and explore information in data-driven and novel ways.</p><p>In this post, I&#8217;ll share some frameworks we use at <a href="https://biobox.io">BioBox</a> to craft custom multi-modal knowledge graphs built upon datasets spanning various sequencing technologies. Then, we&#8217;ll walk through how this graph is used in the research process to answer questions.</p><h1>Goals of data integration</h1><p>Using different experimental techniques and combining data logically can deepen our understanding of biological phenomena. For example, you can use ChIP and ATAC-seq to explore the regulatory control that explains the up/down regulation observed in matched RNA-seq data. Interpreting these datasets requires information about genes, molecular functions, biological processes, etc. Each dataset provides thousands of data points, and the search space is enormous. The goal of data integration is to improve our ability to separate signals from noise.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading! Subscribe to the BioBox blog to keep up with platform updates, news, and more.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><h1>Data schemas</h1><h2>Metadata capture</h2><p>The best practice is to isolate all the important variables that can be used for logical flows as nodes in the graph rather than storing them as properties in the node.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3CwH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb85ce50b-5a5c-43a2-b5f7-5a13ed7e9b85_1988x1136.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3CwH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb85ce50b-5a5c-43a2-b5f7-5a13ed7e9b85_1988x1136.png 424w, https://substackcdn.com/image/fetch/$s_!3CwH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb85ce50b-5a5c-43a2-b5f7-5a13ed7e9b85_1988x1136.png 848w, https://substackcdn.com/image/fetch/$s_!3CwH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb85ce50b-5a5c-43a2-b5f7-5a13ed7e9b85_1988x1136.png 1272w, https://substackcdn.com/image/fetch/$s_!3CwH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb85ce50b-5a5c-43a2-b5f7-5a13ed7e9b85_1988x1136.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3CwH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb85ce50b-5a5c-43a2-b5f7-5a13ed7e9b85_1988x1136.png" width="439" height="250.85714285714286" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b85ce50b-5a5c-43a2-b5f7-5a13ed7e9b85_1988x1136.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:832,&quot;width&quot;:1456,&quot;resizeWidth&quot;:439,&quot;bytes&quot;:146990,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3CwH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb85ce50b-5a5c-43a2-b5f7-5a13ed7e9b85_1988x1136.png 424w, https://substackcdn.com/image/fetch/$s_!3CwH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb85ce50b-5a5c-43a2-b5f7-5a13ed7e9b85_1988x1136.png 848w, https://substackcdn.com/image/fetch/$s_!3CwH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb85ce50b-5a5c-43a2-b5f7-5a13ed7e9b85_1988x1136.png 1272w, https://substackcdn.com/image/fetch/$s_!3CwH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb85ce50b-5a5c-43a2-b5f7-5a13ed7e9b85_1988x1136.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Example metadata **schema**</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QyIV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff32cc7ad-a663-4081-be5c-c2e2fe7ce340_2276x1132.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QyIV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff32cc7ad-a663-4081-be5c-c2e2fe7ce340_2276x1132.png 424w, https://substackcdn.com/image/fetch/$s_!QyIV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff32cc7ad-a663-4081-be5c-c2e2fe7ce340_2276x1132.png 848w, https://substackcdn.com/image/fetch/$s_!QyIV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff32cc7ad-a663-4081-be5c-c2e2fe7ce340_2276x1132.png 1272w, https://substackcdn.com/image/fetch/$s_!QyIV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff32cc7ad-a663-4081-be5c-c2e2fe7ce340_2276x1132.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QyIV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff32cc7ad-a663-4081-be5c-c2e2fe7ce340_2276x1132.png" width="539" height="268.0192307692308" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f32cc7ad-a663-4081-be5c-c2e2fe7ce340_2276x1132.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:724,&quot;width&quot;:1456,&quot;resizeWidth&quot;:539,&quot;bytes&quot;:172959,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QyIV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff32cc7ad-a663-4081-be5c-c2e2fe7ce340_2276x1132.png 424w, https://substackcdn.com/image/fetch/$s_!QyIV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff32cc7ad-a663-4081-be5c-c2e2fe7ce340_2276x1132.png 848w, https://substackcdn.com/image/fetch/$s_!QyIV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff32cc7ad-a663-4081-be5c-c2e2fe7ce340_2276x1132.png 1272w, https://substackcdn.com/image/fetch/$s_!QyIV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff32cc7ad-a663-4081-be5c-c2e2fe7ce340_2276x1132.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Example metadata record</figcaption></figure></div><p><strong>Why separate disease and cell type?</strong></p><p>It&#8217;s simple: we care about the relationships that diseases and cell types have.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qV22!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32ba36c0-26d0-477e-8789-ec699fcb4daf_1732x654.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qV22!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32ba36c0-26d0-477e-8789-ec699fcb4daf_1732x654.png 424w, https://substackcdn.com/image/fetch/$s_!qV22!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32ba36c0-26d0-477e-8789-ec699fcb4daf_1732x654.png 848w, https://substackcdn.com/image/fetch/$s_!qV22!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32ba36c0-26d0-477e-8789-ec699fcb4daf_1732x654.png 1272w, https://substackcdn.com/image/fetch/$s_!qV22!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32ba36c0-26d0-477e-8789-ec699fcb4daf_1732x654.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qV22!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32ba36c0-26d0-477e-8789-ec699fcb4daf_1732x654.png" width="1456" height="550" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/32ba36c0-26d0-477e-8789-ec699fcb4daf_1732x654.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:550,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:131350,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!qV22!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32ba36c0-26d0-477e-8789-ec699fcb4daf_1732x654.png 424w, https://substackcdn.com/image/fetch/$s_!qV22!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32ba36c0-26d0-477e-8789-ec699fcb4daf_1732x654.png 848w, https://substackcdn.com/image/fetch/$s_!qV22!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32ba36c0-26d0-477e-8789-ec699fcb4daf_1732x654.png 1272w, https://substackcdn.com/image/fetch/$s_!qV22!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32ba36c0-26d0-477e-8789-ec699fcb4daf_1732x654.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Specifically, we can exploit the ability to walk from semantically related cell types and diseases. By leveraging the power of connected graphs, we can expand our search space for questions like, &#8220;Find all glioma samples,&#8221; and walk down the paths. Consider what would happen if we stored these values as properties inside the Sample or Donor nodes. We would need to exhaustively search all samples for some key-value match for every value along the semantic chain of values (glioma, astrocytic tumor, astrocytoma), which becomes prohibitively inefficient.</p><h2>RNAseq</h2><p>RNA-seq is relatively straightforward to model. These experiments produce two types of datasets: Gene Expression and Differential Expression. You want to capture some information about the transcriptional state of a gene in a biological context (e.g., disease vs. normal, cell population A vs. cell population B, etc.). The gene expression graph mapping is relatively straightforward once the metadata is appropriately set up. We can describe the expression of a gene within a particular RNA-seq library with a direct edge as follows:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QVQ-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d3df1fd-b048-4f4f-b6fb-4f1180d82b08_796x196.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QVQ-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d3df1fd-b048-4f4f-b6fb-4f1180d82b08_796x196.png 424w, https://substackcdn.com/image/fetch/$s_!QVQ-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d3df1fd-b048-4f4f-b6fb-4f1180d82b08_796x196.png 848w, https://substackcdn.com/image/fetch/$s_!QVQ-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d3df1fd-b048-4f4f-b6fb-4f1180d82b08_796x196.png 1272w, https://substackcdn.com/image/fetch/$s_!QVQ-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d3df1fd-b048-4f4f-b6fb-4f1180d82b08_796x196.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QVQ-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d3df1fd-b048-4f4f-b6fb-4f1180d82b08_796x196.png" width="435" height="107.1105527638191" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7d3df1fd-b048-4f4f-b6fb-4f1180d82b08_796x196.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:196,&quot;width&quot;:796,&quot;resizeWidth&quot;:435,&quot;bytes&quot;:26644,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QVQ-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d3df1fd-b048-4f4f-b6fb-4f1180d82b08_796x196.png 424w, https://substackcdn.com/image/fetch/$s_!QVQ-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d3df1fd-b048-4f4f-b6fb-4f1180d82b08_796x196.png 848w, https://substackcdn.com/image/fetch/$s_!QVQ-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d3df1fd-b048-4f4f-b6fb-4f1180d82b08_796x196.png 1272w, https://substackcdn.com/image/fetch/$s_!QVQ-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d3df1fd-b048-4f4f-b6fb-4f1180d82b08_796x196.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Gene expression schema</figcaption></figure></div><p>Differential gene expression (DGE) datasets require an additional hyper-edge to capture the comparison information.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!z0xu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3135219-de89-471b-a977-ed14cc0f5e0a_840x546.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!z0xu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3135219-de89-471b-a977-ed14cc0f5e0a_840x546.png 424w, https://substackcdn.com/image/fetch/$s_!z0xu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3135219-de89-471b-a977-ed14cc0f5e0a_840x546.png 848w, https://substackcdn.com/image/fetch/$s_!z0xu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3135219-de89-471b-a977-ed14cc0f5e0a_840x546.png 1272w, https://substackcdn.com/image/fetch/$s_!z0xu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3135219-de89-471b-a977-ed14cc0f5e0a_840x546.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!z0xu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3135219-de89-471b-a977-ed14cc0f5e0a_840x546.png" width="393" height="255.45" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b3135219-de89-471b-a977-ed14cc0f5e0a_840x546.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:546,&quot;width&quot;:840,&quot;resizeWidth&quot;:393,&quot;bytes&quot;:57597,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!z0xu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3135219-de89-471b-a977-ed14cc0f5e0a_840x546.png 424w, https://substackcdn.com/image/fetch/$s_!z0xu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3135219-de89-471b-a977-ed14cc0f5e0a_840x546.png 848w, https://substackcdn.com/image/fetch/$s_!z0xu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3135219-de89-471b-a977-ed14cc0f5e0a_840x546.png 1272w, https://substackcdn.com/image/fetch/$s_!z0xu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3135219-de89-471b-a977-ed14cc0f5e0a_840x546.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">DGE schema</figcaption></figure></div><p>Because we use a property graph as the base, we can persist the quantitative values of the log2 fold change, p-value statistics, count values, etc., directly inside the edge. It also becomes useful to incorporate these values as edge weights in other graph-based algorithms (more on that in the future). This is a unique characteristic of property graphs and data graphs that differs from traditional RDF-based knowledge graphs.</p><h2>ChIPseq</h2><p>Tracking epigenomic observations requires the graph to comprehend genomic coordinates. To do this, we start by creating 1 kilobase (kb) bins across the entire genome and represent them as beads on a chain.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ddk5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505c341-ea6c-4ce5-b8b6-ff367b499c32_1418x428.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ddk5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505c341-ea6c-4ce5-b8b6-ff367b499c32_1418x428.png 424w, https://substackcdn.com/image/fetch/$s_!Ddk5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505c341-ea6c-4ce5-b8b6-ff367b499c32_1418x428.png 848w, https://substackcdn.com/image/fetch/$s_!Ddk5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505c341-ea6c-4ce5-b8b6-ff367b499c32_1418x428.png 1272w, https://substackcdn.com/image/fetch/$s_!Ddk5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505c341-ea6c-4ce5-b8b6-ff367b499c32_1418x428.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ddk5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505c341-ea6c-4ce5-b8b6-ff367b499c32_1418x428.png" width="507" height="153.0296191819464" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c505c341-ea6c-4ce5-b8b6-ff367b499c32_1418x428.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:428,&quot;width&quot;:1418,&quot;resizeWidth&quot;:507,&quot;bytes&quot;:75460,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Ddk5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505c341-ea6c-4ce5-b8b6-ff367b499c32_1418x428.png 424w, https://substackcdn.com/image/fetch/$s_!Ddk5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505c341-ea6c-4ce5-b8b6-ff367b499c32_1418x428.png 848w, https://substackcdn.com/image/fetch/$s_!Ddk5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505c341-ea6c-4ce5-b8b6-ff367b499c32_1418x428.png 1272w, https://substackcdn.com/image/fetch/$s_!Ddk5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505c341-ea6c-4ce5-b8b6-ff367b499c32_1418x428.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Then, we can load ChIP-seq observations, such as enrichment peaks, into our graph. Each peak is represented as a distinct node with edges to genomic range nodes that demarcate the genomic region from which the peak was detected.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Xvp0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd29d38b-17fa-4498-a5a0-f4537d547bf6_1556x968.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Xvp0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd29d38b-17fa-4498-a5a0-f4537d547bf6_1556x968.png 424w, https://substackcdn.com/image/fetch/$s_!Xvp0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd29d38b-17fa-4498-a5a0-f4537d547bf6_1556x968.png 848w, https://substackcdn.com/image/fetch/$s_!Xvp0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd29d38b-17fa-4498-a5a0-f4537d547bf6_1556x968.png 1272w, https://substackcdn.com/image/fetch/$s_!Xvp0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd29d38b-17fa-4498-a5a0-f4537d547bf6_1556x968.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Xvp0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd29d38b-17fa-4498-a5a0-f4537d547bf6_1556x968.png" width="567" height="352.8173076923077" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fd29d38b-17fa-4498-a5a0-f4537d547bf6_1556x968.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:906,&quot;width&quot;:1456,&quot;resizeWidth&quot;:567,&quot;bytes&quot;:141080,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Xvp0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd29d38b-17fa-4498-a5a0-f4537d547bf6_1556x968.png 424w, https://substackcdn.com/image/fetch/$s_!Xvp0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd29d38b-17fa-4498-a5a0-f4537d547bf6_1556x968.png 848w, https://substackcdn.com/image/fetch/$s_!Xvp0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd29d38b-17fa-4498-a5a0-f4537d547bf6_1556x968.png 1272w, https://substackcdn.com/image/fetch/$s_!Xvp0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffd29d38b-17fa-4498-a5a0-f4537d547bf6_1556x968.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In our graph, we have also loaded regulatory gene features, such as promoters and enhancers. By combining this information, we can compose more complex and meaningful patterns. For example, in the figure above, the topology describes an EP300 peak bound detected to overlap with gene ABC123&#8217;s promoter region, suggesting possible epigenetic control.</p><h1>Doing useful things with multi-omic graphs.</h1><p>The usefulness of data comes down to whether or not it can impact decision-making. Ultimately, we are hunting for biological explainability so that we can attribute the importance of a target to the overall etiology of a disease. This forms the basis for a drug program to spend millions of dollars around this therapeutic hypothesis. Combining data modalities enhances the ability of scientists to see the bigger picture and make better decisions.</p><h1>Extending the graph to answer questions</h1><p>There are only two requirements for graph model construction:</p><ol><li><p>There are two concepts defined (Bipartite graph).</p></li><li><p>There are relationships in your ontology that can directly or indirectly connect these concepts together.</p></li></ol><p><strong>Types of Bipartite Graphs built on the BioBox Platform:</strong></p><ul><li><p><strong>Gene &lt;&gt; Disease</strong></p></li><li><p><strong>Gene &lt;&gt; Gene Module</strong></p></li><li><p><strong>Drug &lt;&gt; Adverse Event</strong></p></li><li><p><strong>Protein &lt;&gt; Phenotype</strong></p></li></ul><p><strong>Example: Transcriptional Circuits &amp; Pleiotropy</strong></p><p>From transcriptomics, we can generate differentially expressed genes and run the count data through GSEA to yield some enriched pathways and gene sets. Typically, we&#8217;re hunting for pathways that contribute to an observed phenotype of our disease area. Chances are you will have an overwhelming amount of DE genes and pathway hits. How can we systematically qualify and eliminate targets in a principled way?</p><p>One approach is to layer on epigenetic data to identify targets that have the highest impact across multiple dysregulated pathways.</p><ol><li><p>Collect the list of genes that were up-regulated and found in enriched gene modules of interest.</p></li><li><p>Evaluate the context-specific binding of transcriptional factors (TF) at the gene list promoter regions.</p></li><li><p>Evaluate the context-specific accessibility of promoter regions for gene list.</p></li><li><p>Rank the TFs based on occurrence</p></li></ol><p>Together, we are filtering for the strongest transcription factors driving the up-regulation of genes involved in disease pathways/mechanisms. It turns out when the data is modeled and connected in the graph, the context-heavy question is trivial to search for.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Xtsk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb28633a9-e204-48be-a445-767cffd34b7a_1064x1120.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Xtsk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb28633a9-e204-48be-a445-767cffd34b7a_1064x1120.png 424w, https://substackcdn.com/image/fetch/$s_!Xtsk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb28633a9-e204-48be-a445-767cffd34b7a_1064x1120.png 848w, https://substackcdn.com/image/fetch/$s_!Xtsk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb28633a9-e204-48be-a445-767cffd34b7a_1064x1120.png 1272w, https://substackcdn.com/image/fetch/$s_!Xtsk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb28633a9-e204-48be-a445-767cffd34b7a_1064x1120.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Xtsk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb28633a9-e204-48be-a445-767cffd34b7a_1064x1120.png" width="1064" height="1120" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b28633a9-e204-48be-a445-767cffd34b7a_1064x1120.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1120,&quot;width&quot;:1064,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:161584,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Xtsk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb28633a9-e204-48be-a445-767cffd34b7a_1064x1120.png 424w, https://substackcdn.com/image/fetch/$s_!Xtsk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb28633a9-e204-48be-a445-767cffd34b7a_1064x1120.png 848w, https://substackcdn.com/image/fetch/$s_!Xtsk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb28633a9-e204-48be-a445-767cffd34b7a_1064x1120.png 1272w, https://substackcdn.com/image/fetch/$s_!Xtsk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb28633a9-e204-48be-a445-767cffd34b7a_1064x1120.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>In practice, this means we are building a graph model to compute the score between two concepts: Disease and Gene. Each layer of data integrated is an additional piece of information that builds up a scientific hypothesis. Each line of evidence can be used to validate or invalidate a hypothesis to varying degrees. In other words, these lines of evidence can be positively or negatively weighted values. The direction depends on the research question and the hypothesis.</p><p>For example, in a target prioritization exercise:</p><ul><li><p>+ 0.3, Gene is up-regulated in disease vs. normal</p></li><li><p>+ 0.1, Gene participates in disease-specific pathways of interest</p></li><li><p>+ 0.2, Gene is a TF</p></li><li><p>+ 0.4, Gene product binds to promoters of disease vs. normal up-regulated genes</p></li><li><p>- 0.3, Gene is down-regulated in disease vs. normal</p></li><li><p>- 0.1, Gene product binds to promoters that are not accessible</p></li></ul><p>The bullets above are reachable graph paths in the knowledge graph as defined by the ontology. An instance of this graph path is an idempotent and unique trace. For example, given 10 differential expression datasets, there can be (up to) 10 paths in the graph connecting a disease to a gene.</p><p><strong>This is an important difference from traditional knowledge graphs.</strong></p><p>On repeated observations, you should probably choose a scaling factor so it doesn&#8217;t dominate the overall score. By default, we use harmonic sums, but you&#8217;re free to choose whichever method you desire. Summing across positive and negative weighted paths will give you an overall aggregated score to rank objects from these two concepts together.</p><h1>Conclusion</h1><p>Crafting a knowledge graph isn&#8217;t just about connecting data; it&#8217;s about empowering researchers to uncover new insights, make informed decisions, and drive innovation in ways that were previously unimaginable. By integrating diverse data modalities&#8212;whether it's transcriptomics, epigenomics, or beyond&#8212;we can construct a rich, interconnected tapestry of information that shines a light on the complex biological mechanisms at play.</p><p>At BioBox, we've seen firsthand how a well-constructed knowledge graph can transform the research process, turning what was once a daunting sea of data into a navigable map that leads directly to actionable discoveries. Whether you&#8217;re prioritizing drug targets, exploring disease mechanisms, or seeking new therapeutic hypotheses, the power of a multi-modal knowledge graph can be the difference between a promising lead and a breakthrough.</p><p>But remember, <strong>the true value of a knowledge graph lies not in its complexity but in its utility</strong>. It&#8217;s not just about storing data in a visually appealing structure&#8212;it&#8217;s about using that structure to generate meaningful results that can drive your research forward.</p><p>So, as you consider building your own knowledge graph, ask yourself: How will it serve your team? What questions will it help answer? With the right design and a clear purpose, your knowledge graph could be the key to unlocking the next big discovery in your field.</p><p>If you&#8217;re ready to harness the power of knowledge graphs in your research, visit us at <a href="https://biobox.io">biobox.io</a> to learn more and connect with our team of experts. Let&#8217;s explore the future of data-driven science together.</p><p></p>]]></content:encoded></item><item><title><![CDATA[Knowledge Infrastructure for Modern Biopharma]]></title><description><![CDATA[How teams are using the BioBox Data Intelligence Platform to solve complex drug discovery data challenges]]></description><link>https://blog.biobox.io/p/knowledge-infrastructure-for-modern</link><guid isPermaLink="false">https://blog.biobox.io/p/knowledge-infrastructure-for-modern</guid><dc:creator><![CDATA[Lauren Phillips]]></dc:creator><pubDate>Wed, 17 Jul 2024 19:40:36 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/146689981/1ab924104d2cd716a568f0e98ca0d552.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Understanding the biology of a disease is the backbone of a successful drug program. This is much easier said than done when working with thousands of multi-modal data points and cross functional teams. </p><p>Every day new AI and data management tools are hitting the market, promising to revolutionize drug discovery. The majority claim to accelerate insights and facilitate data driven decisions, but it&#8217;s often difficult to understand exactly how they are helping teams accomplish this. </p><p>In this article I will walk you through exactly how a drug discovery team used the BioBox platform to solve complex data challenges. </p><p>Within 8 weeks the data team was able to:</p><ul><li><p>Build a proprietary evidence ranking and prioritization system that led to the identification of 10 novel targets.</p></li><li><p>Prioritize 2 new indications for existing assets.</p></li><li><p>Curate a custom representation of disease biology.</p></li><li><p>Leverage machine learning algorithms to identify serendipitous data connections.</p></li><li><p>Fact check their internal AI solutions that often fall victim to hallucinations.</p></li></ul><h2>Challenge: Creating custom representation of disease biology with harmonized public + private data</h2><h4>Building a custom data graph</h4><p>The foundation of the platform is a data graph, think of this as a custom GPS for navigating disease biology. It enables teams to keep track of important biological relationships such as disease, genes, variants, pathways and their relationships. </p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;1f4f4100-eb39-4fef-8db8-a80c143ffc95&quot;,&quot;caption&quot;:&quot;Introduction Biomedical data is highly fragmented. Just for describing genes alone there are at least 3 major systems for identifiers (e.g. Ensembl, Refseq, Entrez). There are countless concurrent efforts in ontology development, dataset harmonization, and public knowledge base creation. Together, they provide the context that scientists use to&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;lg&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Building a GPS for disease biology&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:11993470,&quot;name&quot;:&quot;Christopher Li&quot;,&quot;bio&quot;:null,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/867b2052-523e-47d1-8400-da653823576f_500x500.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2024-05-28T13:55:18.259Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5f7e253-91b9-4ace-81b9-4961d1003008_3584x2090.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://blog.biobox.io/p/building-a-gps-for-disease-biology&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:144948744,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:0,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;BioBox Blog&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04e14b78-1bd5-46df-afbb-26da6dfaa2b7_260x260.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p>Here is where we differentiate from a lot of knowledge graph providers, the foundation of the custom graph curated for each team is quantitive sequencing data such as Single Cell or Whole Genome Sequencing data. In addition to this, we provide a wide variety of curated, versioned and well annotated consortium data such as OpenTargets, Reactome and NIH clinical trials. </p><p>Nothing is worse than picking up a new software tool and having to put in countless hours importing data and setting up the new system. Our graph curators worked with the client to make the set up process as simple as possible. </p><ol><li><p>Our client described important biological relationships and questions they would like to answer. </p></li><li><p>The client provided us with access to the data they would like to upload to the platform. </p></li><li><p>We took care of the heavy lifting, tailoring their data graph schema and uploading their proprietary data. We ensured the teams were provided with adequate documentation, UI, and API support should they choose to do any data graph management on their own. </p></li></ol><p>Within 48 hours the team had a custom data graph ready for exploration. </p><p><strong>Problems solved</strong> </p><ul><li><p><strong>Creating and managing a data graph</strong>. We provided the tools and infrastructure to ensure that the graph could be easily updated and maintained.</p></li><li><p><strong>Data wrangling and harmonization</strong>. We took care of the tedious data wrangling and harmonization to ensure the client&#8217;s data was seamlessly integrated. </p></li><li><p><strong>Knowledge base versioning and maintenance.</strong> We maintain versioned data packages and ontologies, making third party knowledge base integrations effortless. </p></li></ul><h2>Challenge: Identifying Serendipitous Data Connections</h2><h4>Traversing the graph</h4><p>Once the data graph was set up, the team used the graph explorer to traverse their data. They were able to build compelling narrative around targets, diseases, and cell types. Graph explorer sessions were saved and shared with colleagues for real time collaboration. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZcY1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e5211f8-483f-49c6-a448-4d644075f1d7_3750x2388.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZcY1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e5211f8-483f-49c6-a448-4d644075f1d7_3750x2388.png 424w, https://substackcdn.com/image/fetch/$s_!ZcY1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e5211f8-483f-49c6-a448-4d644075f1d7_3750x2388.png 848w, https://substackcdn.com/image/fetch/$s_!ZcY1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e5211f8-483f-49c6-a448-4d644075f1d7_3750x2388.png 1272w, https://substackcdn.com/image/fetch/$s_!ZcY1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e5211f8-483f-49c6-a448-4d644075f1d7_3750x2388.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZcY1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e5211f8-483f-49c6-a448-4d644075f1d7_3750x2388.png" width="1456" height="927" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0e5211f8-483f-49c6-a448-4d644075f1d7_3750x2388.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:927,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1534057,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ZcY1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e5211f8-483f-49c6-a448-4d644075f1d7_3750x2388.png 424w, https://substackcdn.com/image/fetch/$s_!ZcY1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e5211f8-483f-49c6-a448-4d644075f1d7_3750x2388.png 848w, https://substackcdn.com/image/fetch/$s_!ZcY1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e5211f8-483f-49c6-a448-4d644075f1d7_3750x2388.png 1272w, https://substackcdn.com/image/fetch/$s_!ZcY1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e5211f8-483f-49c6-a448-4d644075f1d7_3750x2388.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The team was able to execute graph algorithms like page rank to identify the relative importance of data points and all shortest paths to reveal serendipitous data connections.</p><p><strong>Problems solved</strong> </p><ul><li><p>Identifying critical data points that would have been overlooked had the data remained fragmented.</p></li><li><p>Building compelling narratives surrounding targets of interest. </p></li></ul><h2>Challenge: Building Evidence Ranking and Prioritization Systems</h2><p>The BioBox platform does not autonomously predict targets or tell teams which indications they should pursue. It enables scientific teams to use their domain expertise to curate custom evidence ranking and prioritization systems by leveraging their fully connected data graph.  </p><p>The team configured several graph models tailored to specific use cases:</p><ul><li><p><strong>Variant prioritization</strong>. This graph model provided a ranked list of variants identified across public GWAS studies and proprietary WGS studies that would put patients at risk for diseases of interest. Variants were ranked according to the team&#8217;s scientific criteria. </p></li><li><p><strong>Indication prioritization.</strong> This graph model provided a ranked list of indications that would be suitable for an asset that they have in phase 1 clinical trials. Indications were ranked according to the team&#8217;s scientific criteria.</p></li><li><p><strong>Pathway prioritization.</strong> This graph model provided a ranked list of pathways that were perturbed within a disease of interest. Pathways were ranked according to the team&#8217;s scientific criteria. </p></li></ul><ul><li><p><strong>Target prioritization.</strong> This graph model provided a ranked list of genes for diseases of interest according to the scientific criteria that they value. </p></li></ul><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;3641a49d-8461-4d37-a8f3-503c4afa3a6a&quot;,&quot;caption&quot;:&quot;Step 1: Defining Hypotheses and Organizing Data Before utilizing the BioBox platform, the company addressed the following key questions to establish a clear direction: What is our therapeutic approach? What is our biological hypothesis? What internal data will we work with?&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;lg&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Target Prioritization: A case study on the BioBox Platform &quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:170284250,&quot;name&quot;:&quot;Lauren Phillips&quot;,&quot;bio&quot;:&quot;Chief Product Officer @ BioBox Analytics&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f315c3e5-16f3-485a-a3fd-d2eab2ed61e2_2790x2060.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2024-05-29T15:02:03.644Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9a58c00-96ab-41bd-94a9-01b10ee66b2b_3420x2328.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://blog.biobox.io/p/target-prioritization-a-case-study&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:145068216,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:0,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;BioBox Blog&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04e14b78-1bd5-46df-afbb-26da6dfaa2b7_260x260.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p><strong>Problems solved</strong> </p><ul><li><p><strong>Team domain expertise was siloed from quantitative data.</strong> Through the curation of a graph model, teams were able to develop a framework of domain expertise for a variety of use cases. </p></li><li><p><strong>Prior to the use of BioBox countless hours were spent updating/ rerunning reports and analyses.</strong> All graph model reports autoupdate upon the injection of new data. </p></li><li><p><strong>Managing data from an active discovery pipeline</strong>. All reports are timestamped and versioned. This provided the team with a comprehensive understanding of how data points were changing in priority in as new data was injected. </p></li></ul><h2>Challenge: Complex Multi-hop Data Lookups </h2><h4>Intuitive Query Language</h4><p>Knowledge graphs are notoriously difficult to query. No one wants to write in cypher or SPARQL. </p><p>The team used our intuitive query language to traverse their ontology and ask complex multi-hop questions without unnecessary table joins. Multi-hop questions pulled information from multiple data sources within the graph.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!xCYW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a51ec8-a801-4de6-b461-27ea727de992_3750x2388.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!xCYW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a51ec8-a801-4de6-b461-27ea727de992_3750x2388.png 424w, https://substackcdn.com/image/fetch/$s_!xCYW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a51ec8-a801-4de6-b461-27ea727de992_3750x2388.png 848w, https://substackcdn.com/image/fetch/$s_!xCYW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a51ec8-a801-4de6-b461-27ea727de992_3750x2388.png 1272w, https://substackcdn.com/image/fetch/$s_!xCYW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a51ec8-a801-4de6-b461-27ea727de992_3750x2388.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!xCYW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a51ec8-a801-4de6-b461-27ea727de992_3750x2388.png" width="1456" height="927" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/16a51ec8-a801-4de6-b461-27ea727de992_3750x2388.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:927,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1485126,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!xCYW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a51ec8-a801-4de6-b461-27ea727de992_3750x2388.png 424w, https://substackcdn.com/image/fetch/$s_!xCYW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a51ec8-a801-4de6-b461-27ea727de992_3750x2388.png 848w, https://substackcdn.com/image/fetch/$s_!xCYW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a51ec8-a801-4de6-b461-27ea727de992_3750x2388.png 1272w, https://substackcdn.com/image/fetch/$s_!xCYW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a51ec8-a801-4de6-b461-27ea727de992_3750x2388.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Questions such as such as &#8220; Which genes participate in the RNA polymerase II Transcription Pathway and are upregulated in internal tumor samples "? could be answered in seconds. </p><p>Important queries were saved so that they could be executed at any time.</p><p><strong>Problems solved</strong> </p><ul><li><p><strong>Unnecessary table wrangling and table joins</strong>. Countless hours are spent consolidating data before important value generating analyses can begin. Using the query language teams were able to obtain answers within seconds rather than hours. </p></li><li><p><strong>Steep coding learning curve typically associated with knowledge graphs. </strong>All team members were able to retrieve information from the knowledge graph without the use of code. </p></li></ul><h2>Challenge: Fact checking AI solutions </h2><h4>Natural language - Multi-omic GraphRAG</h4><p>New AI tools are hitting the market each day, many of which fall victim to hallucination. The team was working towards an AI strategy and wanted a ground source of truth derived from data they could trust. </p><p>The team leveraged our natural language GraphRAG to converse with their sequencing data. </p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;d54c90bc-e1d5-4f3c-b0cc-a2b3d87088e0&quot;,&quot;caption&quot;:&quot;As new AI solutions hit the market every day, data scientists are forced to discern the truth from hallucinations. Knowledge Graphs It is imperative to have a central source of truth to fact check the information provided by LLMs. One way in which data teams are tackling this challenging is through the use of knowledge graph. In short, a knowledge graph i&#8230;&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;lg&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Fact Check your AI : Multi-Omic GraphRAG&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:170284250,&quot;name&quot;:&quot;Lauren Phillips&quot;,&quot;bio&quot;:&quot;Chief Product Officer @ BioBox Analytics&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f315c3e5-16f3-485a-a3fd-d2eab2ed61e2_2790x2060.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2024-07-08T17:37:12.474Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4f4902-4fa2-41b0-8243-315655e65edd_1792x1024.webp&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://blog.biobox.io/p/fact-check-your-ai-multi-omic-graphrag&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:146399069,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:2,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;BioBox Blog&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04e14b78-1bd5-46df-afbb-26da6dfaa2b7_260x260.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p><strong>Problems Solved</strong> </p><ul><li><p><strong>Develop a ground source of truth to fact check their AI.</strong> Instead of relying solely on new AI tools that are prone to hallucination, the team was able to use our platform to fact check information against quantitative sequencing data within their graph.  </p></li></ul><p>By using the BioBox Platform our client was able to curate a custom data graph without the tedious pain points typically associated with knowledge graph management. </p><p>This enabled them to  </p><ul><li><p>Build a proprietary evidence ranking and prioritization system that led to the identification of 10 novel targets.</p></li><li><p>Prioritize 2 new indications for existing assets.</p></li><li><p>Curate a custom representation of disease biology.</p></li><li><p>Leverage machine learning algorithms to identify serendipitous data connections.</p></li><li><p>Fact check their internal AI solutions that often fall victim to hallucinations.</p><p></p></li></ul><blockquote><p><em>The BioBox data graph has given us a 360 view of our data. Data points that could have previously slipped through the cracks are now front and center. This has resulted in us spending significantly less time wondering if we made the right decisions in our discovery pipeline because we are leaving no stones unturned. </em></p><p>Director of Translational Biology </p></blockquote><div><hr></div><p>Interested in fact checking your AI solutions and conversing with your data? Send us an email at sales@biobox.io, we enjoy complex data challenges.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Interested in drug discovery, AI, and knowledge graphs? Subscribe to the BioBox Blog</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Fact Check your AI : Multi-Omic GraphRAG]]></title><description><![CDATA[If you could converse with your sequencing data, what would you ask it ?]]></description><link>https://blog.biobox.io/p/fact-check-your-ai-multi-omic-graphrag</link><guid isPermaLink="false">https://blog.biobox.io/p/fact-check-your-ai-multi-omic-graphrag</guid><dc:creator><![CDATA[Lauren Phillips]]></dc:creator><pubDate>Mon, 08 Jul 2024 17:37:12 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!2D9e!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4f4902-4fa2-41b0-8243-315655e65edd_1792x1024.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2D9e!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4f4902-4fa2-41b0-8243-315655e65edd_1792x1024.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2D9e!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4f4902-4fa2-41b0-8243-315655e65edd_1792x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!2D9e!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4f4902-4fa2-41b0-8243-315655e65edd_1792x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!2D9e!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4f4902-4fa2-41b0-8243-315655e65edd_1792x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!2D9e!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4f4902-4fa2-41b0-8243-315655e65edd_1792x1024.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2D9e!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4f4902-4fa2-41b0-8243-315655e65edd_1792x1024.webp" width="1456" height="832" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0a4f4902-4fa2-41b0-8243-315655e65edd_1792x1024.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:832,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:517484,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2D9e!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4f4902-4fa2-41b0-8243-315655e65edd_1792x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!2D9e!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4f4902-4fa2-41b0-8243-315655e65edd_1792x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!2D9e!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4f4902-4fa2-41b0-8243-315655e65edd_1792x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!2D9e!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4f4902-4fa2-41b0-8243-315655e65edd_1792x1024.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>As new AI solutions hit the market every day, data scientists are forced to discern the truth from hallucinations.</p><h2>Knowledge Graphs</h2><p>It is imperative to have a central source of truth to fact check the information provided by LLMs. One way in which data teams are tackling this challenging is through the use of knowledge graph. In short, a knowledge graph is a semantic representation of data points and their relationships. When it comes to drug discovery, think of it as a GPS for understanding disease biology. </p><p>At BioBox, we provide therapeutic teams with the tools to manage and curate a custom knowledge graph composed of private multi-omic sequencing data and public knowledge bases.</p><p>For a deep dive on how we leverage them to understand complex biological relationships check out this article below.</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;76929a5e-78f0-43ef-b64c-90af8ae4cb0d&quot;,&quot;caption&quot;:&quot;We are now living in the era of biological big data. The cost of sequencing is rapidly decreasing. Simultaneously, we are witnessing the rise of bio-computing tools and platforms that enable scientists to process sequencing data at remarkable speed and scale. This abundance of data, when harnessed effectively, holds the key to making informed decisions &#8230;&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;lg&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Beyond Data: The Rise of Knowledge Graphs in Accelerating Drug Discovery&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:11993470,&quot;name&quot;:&quot;Christopher Li&quot;,&quot;bio&quot;:null,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/867b2052-523e-47d1-8400-da653823576f_500x500.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2023-09-25T16:08:49.698Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a9357b-9e0e-4614-b934-3b0f1e2a3fe1_852x852.webp&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://blog.biobox.io/p/beyond-data-the-rise-of-knowledge&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:137154017,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:0,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;BioBox Blog&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04e14b78-1bd5-46df-afbb-26da6dfaa2b7_260x260.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><h2>Fact Checking LLMs with GraphRAG</h2><p>It&#8217;s no secret that knowledge graphs are typically hard to traverse - no one likes an ontology hairball. Not everyone wants to write in Cypher or memorize their entire graph schema. At BioBox, we have built a <a href="https://blog.biobox.io/p/biobox-biodata-intelligence-platform?r=2tds62&amp;utm_campaign=post&amp;utm_medium=web">suite of tools to make extracting valuable information</a> from the graph as easy as possible, one of which is GraphRAG. </p><p>RAG (Retrieval-Agumented Generation) is a framework used to improve the accuracy of information provided by LLMs. It improves LLM accuracy by retrieving information from external sources. This provides the LLM with up to date information grounded on the basis of relevant data rather than solely relying on the data the LLM was trained with.  GraphRAG take this a step further, and retrieves information from a knowledge graph.  GraphRAG enables teams to use natural language and obtain information directly from their knowledge graph.</p><p>Last week, <a href="https://www.microsoft.com/en-us/research/blog/graphrag-new-tool-for-complex-data-discovery-now-on-github/">data scientists at Microsoft published GraphRAG to github,</a> enabling users to leverage an LLM to extract knowledge from a collection of proprietary text documents. </p><p><strong>At BioBox we are excited to announce GraphRAG support for our multi-omic sequencing based graphs.</strong> Instead of spending hours wrangling data and fighting table.joins teams can use natural language and obtain information directly from their graph within seconds. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!af2G!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2323d048-b861-4b66-b0d0-380549be89cf_936x397.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!af2G!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2323d048-b861-4b66-b0d0-380549be89cf_936x397.png 424w, https://substackcdn.com/image/fetch/$s_!af2G!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2323d048-b861-4b66-b0d0-380549be89cf_936x397.png 848w, https://substackcdn.com/image/fetch/$s_!af2G!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2323d048-b861-4b66-b0d0-380549be89cf_936x397.png 1272w, https://substackcdn.com/image/fetch/$s_!af2G!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2323d048-b861-4b66-b0d0-380549be89cf_936x397.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!af2G!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2323d048-b861-4b66-b0d0-380549be89cf_936x397.png" width="936" height="397" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2323d048-b861-4b66-b0d0-380549be89cf_936x397.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:397,&quot;width&quot;:936,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:66816,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!af2G!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2323d048-b861-4b66-b0d0-380549be89cf_936x397.png 424w, https://substackcdn.com/image/fetch/$s_!af2G!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2323d048-b861-4b66-b0d0-380549be89cf_936x397.png 848w, https://substackcdn.com/image/fetch/$s_!af2G!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2323d048-b861-4b66-b0d0-380549be89cf_936x397.png 1272w, https://substackcdn.com/image/fetch/$s_!af2G!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2323d048-b861-4b66-b0d0-380549be89cf_936x397.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">GraphRAG on the BioBox platform. Diagram inspired by Neo4J https://neo4j.com/developer-blog/knowledge-graph-rag-application/</figcaption></figure></div><p>Check out the demo below. </p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;cdab6a01-1a37-45f9-b48d-fd93039555a9&quot;,&quot;duration&quot;:null}"></div><p>Our bread and butter is heterogeneous sequencing based data, however we support a wide variety of knowledge bases including NIH clinical trials, Reactome, Alliance Genome, OpenTargets and more. </p><p>This allows for questions such as </p><ul><li><p>What are the genes targeted by drugs used in clinical trials that study the disease Renal cell carcinoma and have an overall status of terminated?</p></li><li><p>What are the biological processes that genes that drugs act on that have a clinical precedence for the disease Renal cell carcinoma are involved in?&nbsp;</p></li><li><p>Which variants put you at risk for Renal cell carcinoma?</p><p></p></li></ul><p>We have been developing a closed beta with several therapeutic teams. Over the next few months we will be continuing to improve our GraphRAG to support more complex multi-hop queries and will be releasing it to all users. </p><p><br>Interested in fact checking your AI solutions and conversing with your data? Send us an email at support@biobox.io, we enjoy complex data challenges.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Interested in drug discovery, AI, and knowledge graphs? Subscribe to the BioBox Blog</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><div><hr></div>]]></content:encoded></item><item><title><![CDATA[From Technology to Therapy: FDA unlocks new opportunities & challenges for platform therapeutic companies ]]></title><description><![CDATA[How new FDA guidelines are unlocking market opportunities for platform technology companies.]]></description><link>https://blog.biobox.io/p/from-technology-to-therapy-fda-unlocks</link><guid isPermaLink="false">https://blog.biobox.io/p/from-technology-to-therapy-fda-unlocks</guid><dc:creator><![CDATA[Lauren Phillips]]></dc:creator><pubDate>Fri, 07 Jun 2024 13:36:56 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!BME1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed45935c-3b04-4197-b8ba-3e9aa38bb7bf_1792x1024.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The FDA has released a new set of guidelines that may encourage platform therapeutic companies to pursue new drug targets and indications. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!BME1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed45935c-3b04-4197-b8ba-3e9aa38bb7bf_1792x1024.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BME1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed45935c-3b04-4197-b8ba-3e9aa38bb7bf_1792x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!BME1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed45935c-3b04-4197-b8ba-3e9aa38bb7bf_1792x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!BME1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed45935c-3b04-4197-b8ba-3e9aa38bb7bf_1792x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!BME1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed45935c-3b04-4197-b8ba-3e9aa38bb7bf_1792x1024.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BME1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed45935c-3b04-4197-b8ba-3e9aa38bb7bf_1792x1024.webp" width="1456" height="832" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ed45935c-3b04-4197-b8ba-3e9aa38bb7bf_1792x1024.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:832,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:718468,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!BME1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed45935c-3b04-4197-b8ba-3e9aa38bb7bf_1792x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!BME1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed45935c-3b04-4197-b8ba-3e9aa38bb7bf_1792x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!BME1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed45935c-3b04-4197-b8ba-3e9aa38bb7bf_1792x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!BME1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed45935c-3b04-4197-b8ba-3e9aa38bb7bf_1792x1024.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Platform therapeutic companies can be divided into two primary categories; those who actively pursue novel targets and those who prioritize licensing their platform technology often opting for well characterized targets. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Interested in learning more about drug discovery, knowledge graphs and AI? Subscribe to the BioBox Blog</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>According to the FDA, platform companies utilize a well understood and reproducible technology that includes a molecular structure, mechanism of action, delivery method, vector, nucleic acid sequence or any combination of the aforementioned. </p><p>The technology must: </p><ul><li><p>Be essential to the structure or function of a drug. </p></li><li><p>Can be adapted for, or used by more than one drug sharing similar structural elements.</p></li><li><p>Facilitate the development of more than one drug through a standardized process.</p></li></ul><p>A significant value proposition for platform companies lies not only the success of a single program but how many drug programs and indications it can facilitate, thus the desire to be a &#8220;designated platform technology&#8221;. </p><p>To be considered a &#8220;designated platform technology&#8221; the platform must: </p><ul><li><p>Be used by a drug approved under section 505 of the <a href="https://www.fda.gov/regulatory-information/laws-enforced-fda/federal-food-drug-and-cosmetic-act-fdc-act">FD&amp;C Act</a> or section 351 of the PHS Act. </p></li><li><p>Have preliminary evidence that demonstrates the platform technology can be used by more than one drug without adverse effects on quality, manufacturing or safety.</p></li><li><p>Data indicates that the technology has a reasonable likelihood to bring significant efficiencies to the drug development, manufacturing or review processes. </p></li></ul><p>Historically, the process for reusing platform technologies in new applications has been long and inefficient. However, new guidelines aim to operationalize this process. </p><h2>Guidance for Industry : Platform Technology Designation Program for Drug Development </h2><h4>What has changed? </h4><p>If a platform has an approved <a href="https://www.fda.gov/drugs/types-applications/abbreviated-new-drug-application-anda">ANDA</a>, <a href="https://www.fda.gov/drugs/types-applications/new-drug-application-nda">NDA</a> or <a href="https://www.fda.gov/drugs/types-applications/therapeutic-biologics-applications-bla">BLA</a> companies can request designation of that platform technology to enable leveraging of the technology in new or future applications.</p><h4>What are the benefits? </h4><p><strong>Shorter and more efficient timelines</strong></p><ul><li><p>Engaging early with the FDA enables team to receive timely advice. </p></li></ul><p><strong>FDA prioritization</strong></p><ul><li><p>If there is significant public health benefit the FDA <em>may </em>prioritize additional engagements leveraging the designated platform technology. </p></li></ul><p><strong>Have an arsenal of data for subsequent programs</strong> </p><ul><li><p>No need to repeat work that was already done, evidence that was used to support the efficacy of a platform technology for a previous submission can be used for subsequent applications. This includes batch, stability and non-clinical safety data.</p></li></ul><h4>What is required? </h4><p>Submissions require the following </p><ul><li><p>Description of the platform technology and how it meets FDA standards. </p></li><li><p>The approved ANDA, NDA or BLA for the technology. </p></li><li><p>Identification of shared structural elements between drug products and how the element facilitates the use of the platform technology. </p></li><li><p>Scientific support for the use of the platform technology across multiple drugs and how this would not affect safety, quality or manufacturing. </p></li><li><p>Risk assessment to evaluate differences between previous and proposed drug product.</p></li><li><p>Information to justify why the platform technology would bring significant efficiencies to the drug development , manufacturing or review process.</p></li></ul><p>Read the <a href="https://www.fda.gov/media/178938/download">full document here.</a></p><p>The guidelines are currently in review and the FDA is <a href="https://www.federalregister.gov/documents/2024/05/29/2024-11686/platform-technology-designation-program-draft-guidance-for-industry-availability-agency-information">accepting comments</a> until July 29th. </p><h3>The road to platform designation became a little easier, now what? </h3><p>Companies that were traditionally focused solely on licensing their technology might start looking towards developing their own target pipelines. Companies that already have a target pipeline may expand their approach to additional indications and targets.</p><blockquote><p>These processes involve complex decision making and requires a deep understanding of disease biology, patient population, and market needs. </p></blockquote><h2>Target selection and indication prioritization challenges </h2><p>Target selection and indication prioritization are inherently difficult.  These challenges become heightened for platform companies who have dedicated the majority of their resources to developing their technology. </p><p>Some of these challenges include</p><ol><li><p><strong>Complexity of Disease Biology</strong> </p><ul><li><p>Understanding the intricate mechanisms of diseases and identifying relevant molecular targets is a significant challenge. Diseases often involve multiple pathways and interactions, making it difficult to pinpoint the most effective targets for therapeutic intervention.</p></li></ul></li><li><p><strong>Multi-Modal Data</strong> <strong>Integration</strong></p><ul><li><p>Integrating multi-omic public and private data to identify actionable insights  requires countless hours of data wrangling and database maintenance. </p></li></ul></li><li><p><strong>Cross functional Collaboration</strong> </p><ul><li><p>Computational and Translational teams need a medium to share hypothesis, data and collaborate in real time. </p></li></ul></li><li><p><strong>Designing the right experiments to evaluate a target</strong></p><ul><li><p>Identifying the in vivo and in vitro experiments that will best evaluate a target is notoriously complex.   </p></li></ul></li><li><p><strong>Having enough evidence to support a target or indication</strong></p><ul><li><p>Combing through thousands of data points to find enough high conviction evidence to support a target often seems like a never ending task.</p></li></ul><p></p></li></ol><h2><strong>The BioBox Data Intelligence Platform: A Solution</strong></h2><p>The BioBox data intelligence platform can help platform therapeutic companies overcome these challenges. Here's how:</p><p><strong>Build a GPS to understand the complexity of disease biology: </strong>The backbone of target selection is understanding the biology of the disease. Through a fully customizable knowledge graph teams can map out the important biological concepts and relationships that the care about in the context of a disease.  Enabling them to identify serendipitous relationships and novel connections. We break down why this is important here. </p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;60150169-d527-48cb-8532-c9d6a7befd2c&quot;,&quot;caption&quot;:&quot;Introduction Biomedical data is highly fragmented. Just for describing genes alone there are at least 3 major systems for identifiers (e.g. Ensembl, Refseq, Entrez). There are countless concurrent efforts in ontology development, dataset harmonization, and public knowledge base creation. Together, they provide the context that scientists use to&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;lg&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Building a GPS for disease biology&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:11993470,&quot;name&quot;:&quot;Christopher Li&quot;,&quot;bio&quot;:null,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/867b2052-523e-47d1-8400-da653823576f_500x500.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2024-05-28T13:55:18.259Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5f7e253-91b9-4ace-81b9-4961d1003008_3584x2090.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://blog.biobox.io/p/building-a-gps-for-disease-biology&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:144948744,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:0,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;BioBox Blog&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04e14b78-1bd5-46df-afbb-26da6dfaa2b7_260x260.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p><strong>Seamless Data Integration</strong>: With our API and SDK we make multi-omic data integration as easy as possible so you can leave the data wrangling and Table.Joins behind. </p><p><strong>Curate custom target and indication prioritization reports: </strong>Teams can leverage the knowledge graph to curate custom target and indication prioritization reports. Prioritize data points on the basis of scientific criteria that matters to the team.  Check out our case study on how we enable a research team to develop a target prioritization system for renal cell carcinoma here </p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;21604044-065c-4cf7-a694-86b86bc618ea&quot;,&quot;caption&quot;:&quot;Step 1: Defining Hypotheses and Organizing Data Before utilizing the BioBox platform, the company addressed the following key questions to establish a clear direction: What is our therapeutic approach? What is our biological hypothesis? What internal data will we work with?&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;lg&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Target Prioritization: A case study on the BioBox Platform &quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:170284250,&quot;name&quot;:&quot;Lauren Phillips&quot;,&quot;bio&quot;:&quot;Chief Product Officer @ BioBox Analytics&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f315c3e5-16f3-485a-a3fd-d2eab2ed61e2_2790x2060.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2024-05-29T15:02:03.644Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9a58c00-96ab-41bd-94a9-01b10ee66b2b_3420x2328.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://blog.biobox.io/p/target-prioritization-a-case-study&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:145068216,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:0,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;BioBox Blog&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04e14b78-1bd5-46df-afbb-26da6dfaa2b7_260x260.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p><strong>Searchable and accessible data in one place.</strong> Keep track of all of the data supporting a program or a target in a single unified resource that is accessible by all team members. Search for specific relationships and data points across the graph. </p><p><strong>Develop an arsenal of targets. </strong>It&#8217;s no secret that not all targets make it through the validation phase. After substantial wet lab interrogation many targets get cut from the pipeline. Instead of having to start from ground zero, stockpile prospective targets and have backups for your backups. </p><p><strong>Scaleable Resource.</strong> The data intelligence platform is designed to grow and scale with your team and data. As new data is streamed into the platform reports and analyses automatically update. </p><p>If you would like to learn how we enable therapeutic research teams to save time and resources while creating a collaborative ecosystem for target prioritization<a href="https://biobox.io/book-a-demo"> book a demo with us here. </a></p><div><hr></div><h1>About BioBox</h1><p><a href="https://biobox.io/">BioBox</a> is a data intelligence platform for drug discovery. Rapidly build a custom data graph and deploy graph ML models for target prioritization, indication selection, and MOA analysis. Teams using BioBox make faster and better data driven decisions to de-risk drug programs and get assets into clinical studies quicker.</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading BioBox Blog! Interested in learning more about drug discovery, knowledge graphs and AI? Subscribe to the BioBox Blog</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Target Prioritization: A case study on the BioBox Platform ]]></title><description><![CDATA[How an early stage therapeutic company built a custom target prioritization system for renal cell carcinoma on the BioBox Platform.]]></description><link>https://blog.biobox.io/p/target-prioritization-a-case-study</link><guid isPermaLink="false">https://blog.biobox.io/p/target-prioritization-a-case-study</guid><dc:creator><![CDATA[Lauren Phillips]]></dc:creator><pubDate>Wed, 29 May 2024 15:02:03 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!SQSF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9a58c00-96ab-41bd-94a9-01b10ee66b2b_3420x2328.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2><strong>Step 1: Defining Hypotheses and Organizing Data</strong></h2><p>Before utilizing the BioBox platform, the company addressed the following key questions to establish a clear direction:</p><ol><li><p>What is our therapeutic approach?</p></li><li><p>What is our biological hypothesis? </p></li><li><p>What internal data will we work with?</p></li><li><p>What external data sources are relevant?</p></li></ol><p>Answering these questions provided a foundation for the structure of their knowledge graph.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Interested in Knowledge Graphs, Machine Learning or Drug Discovery? Subscribe to learn more.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2><strong>Step 2: Configuring a Custom Knowledge Graph</strong></h2><p>The core of the BioBox platform is a fully customizable knowledge graph. Think of the knowledge graph as a GPS for disease biology where the roads are relationships between destinations such as genes, pathways, variants etc. Explicit directions (data sources) tell you how you can get from point A to B. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!SQSF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9a58c00-96ab-41bd-94a9-01b10ee66b2b_3420x2328.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!SQSF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9a58c00-96ab-41bd-94a9-01b10ee66b2b_3420x2328.png 424w, https://substackcdn.com/image/fetch/$s_!SQSF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9a58c00-96ab-41bd-94a9-01b10ee66b2b_3420x2328.png 848w, https://substackcdn.com/image/fetch/$s_!SQSF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9a58c00-96ab-41bd-94a9-01b10ee66b2b_3420x2328.png 1272w, https://substackcdn.com/image/fetch/$s_!SQSF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9a58c00-96ab-41bd-94a9-01b10ee66b2b_3420x2328.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!SQSF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9a58c00-96ab-41bd-94a9-01b10ee66b2b_3420x2328.png" width="1456" height="991" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f9a58c00-96ab-41bd-94a9-01b10ee66b2b_3420x2328.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:991,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1694229,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!SQSF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9a58c00-96ab-41bd-94a9-01b10ee66b2b_3420x2328.png 424w, https://substackcdn.com/image/fetch/$s_!SQSF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9a58c00-96ab-41bd-94a9-01b10ee66b2b_3420x2328.png 848w, https://substackcdn.com/image/fetch/$s_!SQSF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9a58c00-96ab-41bd-94a9-01b10ee66b2b_3420x2328.png 1272w, https://substackcdn.com/image/fetch/$s_!SQSF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9a58c00-96ab-41bd-94a9-01b10ee66b2b_3420x2328.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Creating and maintaining a knowledge graph and the associated ontologies, is often a complex and time-consuming task. BioBox simplified this process, allowing therapeutic teams to focus on research rather than data wrangling.</p><p>Translational and computational teams mapped out the important biological relationships on the BioBox platform. The team started by loading preconfigured data packs from BioBox, which provided a robust initial framework. These data packs contained versioned ontologies and observations from consortiums such as OpenTargets, Alliance Genome, Clinical Trials and more. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6-Rj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e59be76-6f72-41fa-90c1-ef9bee7a4a0e_3420x2328.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6-Rj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e59be76-6f72-41fa-90c1-ef9bee7a4a0e_3420x2328.png 424w, https://substackcdn.com/image/fetch/$s_!6-Rj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e59be76-6f72-41fa-90c1-ef9bee7a4a0e_3420x2328.png 848w, https://substackcdn.com/image/fetch/$s_!6-Rj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e59be76-6f72-41fa-90c1-ef9bee7a4a0e_3420x2328.png 1272w, https://substackcdn.com/image/fetch/$s_!6-Rj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e59be76-6f72-41fa-90c1-ef9bee7a4a0e_3420x2328.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6-Rj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e59be76-6f72-41fa-90c1-ef9bee7a4a0e_3420x2328.png" width="1456" height="991" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3e59be76-6f72-41fa-90c1-ef9bee7a4a0e_3420x2328.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:991,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1162507,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6-Rj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e59be76-6f72-41fa-90c1-ef9bee7a4a0e_3420x2328.png 424w, https://substackcdn.com/image/fetch/$s_!6-Rj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e59be76-6f72-41fa-90c1-ef9bee7a4a0e_3420x2328.png 848w, https://substackcdn.com/image/fetch/$s_!6-Rj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e59be76-6f72-41fa-90c1-ef9bee7a4a0e_3420x2328.png 1272w, https://substackcdn.com/image/fetch/$s_!6-Rj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e59be76-6f72-41fa-90c1-ef9bee7a4a0e_3420x2328.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>They then uploaded their internal data, integrating all biological metadata with predefined concepts and relationships within the knowledge graph.</p><blockquote><p>Wrangling ontologies, consolidating public knowledge bases and integrating internal data are processes that often take weeks to accomplish. Through the use of the intuitive UI and API we were able to accomplish this in a few hours. </p><p>- Director of computational biology</p></blockquote><p>For more information on knowledge graphs and their role in drug discovery, refer to our detailed explanation <a href="https://blog.biobox.io/p/beyond-data-the-rise-of-knowledge">here</a>.</p><h2><strong>Step 3: Curating a Graph Model</strong></h2><p>A graph model maps important relationships between two concepts within the knowledge graph. For target prioritization, the concepts used were Genes and Diseases.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!V6v6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcc70b6d-4b0f-4167-9732-3bc6f8c126c8_3332x2240.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!V6v6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcc70b6d-4b0f-4167-9732-3bc6f8c126c8_3332x2240.png 424w, https://substackcdn.com/image/fetch/$s_!V6v6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcc70b6d-4b0f-4167-9732-3bc6f8c126c8_3332x2240.png 848w, https://substackcdn.com/image/fetch/$s_!V6v6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcc70b6d-4b0f-4167-9732-3bc6f8c126c8_3332x2240.png 1272w, https://substackcdn.com/image/fetch/$s_!V6v6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcc70b6d-4b0f-4167-9732-3bc6f8c126c8_3332x2240.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!V6v6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcc70b6d-4b0f-4167-9732-3bc6f8c126c8_3332x2240.png" width="1456" height="979" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dcc70b6d-4b0f-4167-9732-3bc6f8c126c8_3332x2240.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:979,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1003907,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!V6v6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcc70b6d-4b0f-4167-9732-3bc6f8c126c8_3332x2240.png 424w, https://substackcdn.com/image/fetch/$s_!V6v6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcc70b6d-4b0f-4167-9732-3bc6f8c126c8_3332x2240.png 848w, https://substackcdn.com/image/fetch/$s_!V6v6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcc70b6d-4b0f-4167-9732-3bc6f8c126c8_3332x2240.png 1272w, https://substackcdn.com/image/fetch/$s_!V6v6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdcc70b6d-4b0f-4167-9732-3bc6f8c126c8_3332x2240.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!K4Jj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aee358d-f5a6-4934-bc8d-a2736763101f_3420x2328.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!K4Jj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aee358d-f5a6-4934-bc8d-a2736763101f_3420x2328.png 424w, https://substackcdn.com/image/fetch/$s_!K4Jj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aee358d-f5a6-4934-bc8d-a2736763101f_3420x2328.png 848w, https://substackcdn.com/image/fetch/$s_!K4Jj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aee358d-f5a6-4934-bc8d-a2736763101f_3420x2328.png 1272w, https://substackcdn.com/image/fetch/$s_!K4Jj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aee358d-f5a6-4934-bc8d-a2736763101f_3420x2328.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!K4Jj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aee358d-f5a6-4934-bc8d-a2736763101f_3420x2328.png" width="1456" height="991" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5aee358d-f5a6-4934-bc8d-a2736763101f_3420x2328.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:991,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1155260,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!K4Jj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aee358d-f5a6-4934-bc8d-a2736763101f_3420x2328.png 424w, https://substackcdn.com/image/fetch/$s_!K4Jj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aee358d-f5a6-4934-bc8d-a2736763101f_3420x2328.png 848w, https://substackcdn.com/image/fetch/$s_!K4Jj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aee358d-f5a6-4934-bc8d-a2736763101f_3420x2328.png 1272w, https://substackcdn.com/image/fetch/$s_!K4Jj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aee358d-f5a6-4934-bc8d-a2736763101f_3420x2328.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Each relationship, or "line of evidence," received a weighted score based on its importance. The team defined scientific criteria linking genes to diseases and categorized lines of evidence. Here are a few examples: </p><ul><li><p><strong>Gene Expression:</strong></p><ul><li><p>Upregulated genes from internal bulk RNAseq and scRNAseq data</p></li><li><p>Upregulated genes from TCGA bulk RNAseq data</p></li></ul></li><li><p><strong>Genomic Observations:</strong></p><ul><li><p>Variant frequencies in diseased populations (Internal data)</p></li><li><p>Variants that contribute to increase gene expression (OpenTargets)</p></li><li><p>Variants that are at risk/ protective for a disease (OpenTargets)</p></li></ul></li><li><p><strong>Epigenetic Information</strong></p><ul><li><p>Promoter regions that are hypomethylated in disease populations (methylation array (Illumina 850k)</p></li><li><p>Promoter regions there are acetylated in disease populations (H3K27ac ChIPseq)</p></li></ul></li><li><p><strong>Pathways and Biological Function:</strong></p><ul><li><p>Genes associated with specific pathways (Internal data + Reactome) </p></li><li><p>Genes that are markers for disease (Internal data + Alliance of Genome Resources)</p></li></ul></li><li><p><strong>Safety and Efficacy:</strong></p><ul><li><p>Clinical trial information, including trials with drug failures (ClinicalTrials)</p></li><li><p>Safety liabilities associated with genes (OpenTargets)</p></li><li><p>Drugs approved to target gene of interest (Chembl)</p></li></ul></li></ul><p>Negative scores were assigned to lines of evidence indicating poor target suitability, such as genes targeted by drugs that failed clinical trials. These scores were used to compute an overall weighted score for each gene, ranking them according to the team&#8217;s scientific criteria. </p><h2><strong>Step 4: Reporting &amp; Ranking Targets</strong></h2><p>Once the graph model was curated, the team generated reports to evaluate targets. They selected "Renal Cell Carcinoma EFO:0000681" as the disease of interest and left Genes unbiased. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!0y7I!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631acdc-c28f-4000-a929-443c38e9581d_3420x2328.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0y7I!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631acdc-c28f-4000-a929-443c38e9581d_3420x2328.png 424w, https://substackcdn.com/image/fetch/$s_!0y7I!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631acdc-c28f-4000-a929-443c38e9581d_3420x2328.png 848w, https://substackcdn.com/image/fetch/$s_!0y7I!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631acdc-c28f-4000-a929-443c38e9581d_3420x2328.png 1272w, https://substackcdn.com/image/fetch/$s_!0y7I!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631acdc-c28f-4000-a929-443c38e9581d_3420x2328.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0y7I!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631acdc-c28f-4000-a929-443c38e9581d_3420x2328.png" width="1456" height="991" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7631acdc-c28f-4000-a929-443c38e9581d_3420x2328.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:991,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1071697,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!0y7I!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631acdc-c28f-4000-a929-443c38e9581d_3420x2328.png 424w, https://substackcdn.com/image/fetch/$s_!0y7I!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631acdc-c28f-4000-a929-443c38e9581d_3420x2328.png 848w, https://substackcdn.com/image/fetch/$s_!0y7I!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631acdc-c28f-4000-a929-443c38e9581d_3420x2328.png 1272w, https://substackcdn.com/image/fetch/$s_!0y7I!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631acdc-c28f-4000-a929-443c38e9581d_3420x2328.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Within seconds, a report of genes ranked according to the scientific criteria that they curated was generated. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-PVm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabea17f9-8a72-4677-8af4-b557c9e7e366_3420x2328.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-PVm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabea17f9-8a72-4677-8af4-b557c9e7e366_3420x2328.png 424w, https://substackcdn.com/image/fetch/$s_!-PVm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabea17f9-8a72-4677-8af4-b557c9e7e366_3420x2328.png 848w, https://substackcdn.com/image/fetch/$s_!-PVm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabea17f9-8a72-4677-8af4-b557c9e7e366_3420x2328.png 1272w, https://substackcdn.com/image/fetch/$s_!-PVm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabea17f9-8a72-4677-8af4-b557c9e7e366_3420x2328.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-PVm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabea17f9-8a72-4677-8af4-b557c9e7e366_3420x2328.png" width="1456" height="991" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/abea17f9-8a72-4677-8af4-b557c9e7e366_3420x2328.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:991,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:849915,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-PVm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabea17f9-8a72-4677-8af4-b557c9e7e366_3420x2328.png 424w, https://substackcdn.com/image/fetch/$s_!-PVm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabea17f9-8a72-4677-8af4-b557c9e7e366_3420x2328.png 848w, https://substackcdn.com/image/fetch/$s_!-PVm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabea17f9-8a72-4677-8af4-b557c9e7e366_3420x2328.png 1272w, https://substackcdn.com/image/fetch/$s_!-PVm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabea17f9-8a72-4677-8af4-b557c9e7e366_3420x2328.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>By selecting a target and a line of evidence scientists could see the supporting data identified within their knowledge graph.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_d87!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71739ca3-3247-4d47-813d-c44059bbc100_3420x2328.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_d87!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71739ca3-3247-4d47-813d-c44059bbc100_3420x2328.png 424w, https://substackcdn.com/image/fetch/$s_!_d87!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71739ca3-3247-4d47-813d-c44059bbc100_3420x2328.png 848w, https://substackcdn.com/image/fetch/$s_!_d87!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71739ca3-3247-4d47-813d-c44059bbc100_3420x2328.png 1272w, https://substackcdn.com/image/fetch/$s_!_d87!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71739ca3-3247-4d47-813d-c44059bbc100_3420x2328.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_d87!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71739ca3-3247-4d47-813d-c44059bbc100_3420x2328.png" width="1456" height="991" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/71739ca3-3247-4d47-813d-c44059bbc100_3420x2328.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:991,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:907936,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_d87!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71739ca3-3247-4d47-813d-c44059bbc100_3420x2328.png 424w, https://substackcdn.com/image/fetch/$s_!_d87!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71739ca3-3247-4d47-813d-c44059bbc100_3420x2328.png 848w, https://substackcdn.com/image/fetch/$s_!_d87!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71739ca3-3247-4d47-813d-c44059bbc100_3420x2328.png 1272w, https://substackcdn.com/image/fetch/$s_!_d87!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71739ca3-3247-4d47-813d-c44059bbc100_3420x2328.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The initial report highlighted well-characterized targets like <a href="https://jeccr.biomedcentral.com/articles/10.1186/s13046-021-02026-1">BRIC5, AKT, and HIFa</a>. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!c2rU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F257996c7-5f9d-4fff-92d8-5ba8527e2a61_3420x2328.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!c2rU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F257996c7-5f9d-4fff-92d8-5ba8527e2a61_3420x2328.png 424w, https://substackcdn.com/image/fetch/$s_!c2rU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F257996c7-5f9d-4fff-92d8-5ba8527e2a61_3420x2328.png 848w, https://substackcdn.com/image/fetch/$s_!c2rU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F257996c7-5f9d-4fff-92d8-5ba8527e2a61_3420x2328.png 1272w, https://substackcdn.com/image/fetch/$s_!c2rU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F257996c7-5f9d-4fff-92d8-5ba8527e2a61_3420x2328.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!c2rU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F257996c7-5f9d-4fff-92d8-5ba8527e2a61_3420x2328.png" width="1456" height="991" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/257996c7-5f9d-4fff-92d8-5ba8527e2a61_3420x2328.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:991,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:785995,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!c2rU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F257996c7-5f9d-4fff-92d8-5ba8527e2a61_3420x2328.png 424w, https://substackcdn.com/image/fetch/$s_!c2rU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F257996c7-5f9d-4fff-92d8-5ba8527e2a61_3420x2328.png 848w, https://substackcdn.com/image/fetch/$s_!c2rU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F257996c7-5f9d-4fff-92d8-5ba8527e2a61_3420x2328.png 1272w, https://substackcdn.com/image/fetch/$s_!c2rU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F257996c7-5f9d-4fff-92d8-5ba8527e2a61_3420x2328.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Using similarity and community detection algorithms, natively enabled in the graph model, the team was able to <strong>explore novel targets</strong>. Running a Jaccard similarity algorithm yielded targets that scored <strong>similar to well-characterized targets</strong> along the same lines of evidence. In other words, these targets might behave very closely to well-studied targets like AKT etc. This represented just one example of how the BioBox platform enabled principled and data-driven hypothesis generation through the exploitation of the graph data. </p><p>Based on the evaluation, the team adjusted weights and added additional lines of evidence to refine their prioritization system. </p><p>As new data was integrated into the knowledge graph, reports and target scores automatically updated, saving the computational team valuable time. </p><p>The impact of proprietary data on target prioritization was evident as the team compared reports from graph models with and without their internal data. </p><p>To further characterize a target the team explored all of its relationships within the graph explorer. A full break down on our graph explorer can be <a href="https://blog.biobox.io/p/building-a-gps-for-disease-biology">found here</a>. </p><h2><strong>Conclusion</strong></h2><p>By leveraging the BioBox platform, the company efficiently built a custom target prioritization system for renal cell carcinoma. The comprehensive integration of internal and external data sources, coupled with BioBox's customizable knowledge graph and robust analytical tools, provided several key benefits:</p><ul><li><p><strong>Time Savings</strong>: The streamlined ontology management and automated data integration allowed the team to quickly get started and maintain momentum, reducing the time typically spent on data wrangling.</p></li><li><p><strong>Cost Efficiency</strong>: By efficiently organizing and analyzing data, the company was able to focus resources on the most promising targets, reducing unnecessary expenditures on less viable candidates.</p></li><li><p><strong>Enhanced Confidence</strong>: The weighted scoring system and robust evidence framework gave the team greater confidence in their target selection, supported by clear, quantifiable data.</p></li><li><p><strong>Comprehensive Evidence</strong>: The inclusion of diverse lines of evidence, such as gene expression, genomic observations, and clinical trial data, provided a thorough understanding of each target's potential, ensuring well-informed decision-making.</p></li><li><p><strong>Cross-Functional Collaboration</strong>: The platform&#8217;s collaborative features enabled seamless communication and data sharing among team members, fostering a cohesive and efficient working environment.</p><p></p></li></ul><p><strong>Note:</strong> Proprietary data has been redacted for privacy. Sequencing-based observations in images above have been derived from TCGA data.</p><div><hr></div><h1>About BioBox</h1><p><a href="https://biobox.io/">BioBox</a> is a data intelligence platform for drug discovery. Rapidly build a custom data graph and deploy graph ML models for target prioritization, indication selection, and MOA analysis. Teams using BioBox make faster and better data driven decisions to de-risk drug programs and get assets into clinical studies quicker.</p><p>To learn more about how BioBox can supercharge your discovery pipeline, <a href="https://biobox.io/book-a-demo">book a demo</a> with one of our team members!</p><div><hr></div><p></p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading the BioBox Blog! Subscribe to learn more about knowledge graphs, machine learning and drug discovery.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Building a GPS for disease biology]]></title><description><![CDATA[Unravel and discover hidden connections using the BioBox Graph Explorer]]></description><link>https://blog.biobox.io/p/building-a-gps-for-disease-biology</link><guid isPermaLink="false">https://blog.biobox.io/p/building-a-gps-for-disease-biology</guid><dc:creator><![CDATA[Christopher Li]]></dc:creator><pubDate>Tue, 28 May 2024 13:55:18 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5f7e253-91b9-4ace-81b9-4961d1003008_3584x2090.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h1>Introduction</h1><p>Biomedical data is highly fragmented. Just for describing genes alone there are at least 3 major systems for identifiers (e.g. Ensembl, Refseq, Entrez). There are countless concurrent efforts in ontology development, dataset harmonization, and public knowledge base creation. Together, they provide the context that scientists use to <strong>interpret</strong> data. However, in practice, whether you start with literature, or go right to the source data, at some point in your journey you&#8217;re going to end up with 100 browser tabs open trying to connect the dots together.</p><p><strong>It&#8217;s not you, its the data.</strong></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading BioBox Blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>To solve this, <a href="https://biobox.io">BioBox</a> developed the Graph Explorer. An interactive application to traverse your custom knowledge graph, just like a map, and develop a GPS for understanding disease biology.</p><h1>BioBox Graph Explorer</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!tLOy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679109b9-f791-40f4-9cec-d872929b3d5c_4320x2734.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!tLOy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679109b9-f791-40f4-9cec-d872929b3d5c_4320x2734.png 424w, https://substackcdn.com/image/fetch/$s_!tLOy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679109b9-f791-40f4-9cec-d872929b3d5c_4320x2734.png 848w, https://substackcdn.com/image/fetch/$s_!tLOy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679109b9-f791-40f4-9cec-d872929b3d5c_4320x2734.png 1272w, https://substackcdn.com/image/fetch/$s_!tLOy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679109b9-f791-40f4-9cec-d872929b3d5c_4320x2734.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!tLOy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679109b9-f791-40f4-9cec-d872929b3d5c_4320x2734.png" width="1456" height="921" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/679109b9-f791-40f4-9cec-d872929b3d5c_4320x2734.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:921,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1653345,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!tLOy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679109b9-f791-40f4-9cec-d872929b3d5c_4320x2734.png 424w, https://substackcdn.com/image/fetch/$s_!tLOy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679109b9-f791-40f4-9cec-d872929b3d5c_4320x2734.png 848w, https://substackcdn.com/image/fetch/$s_!tLOy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679109b9-f791-40f4-9cec-d872929b3d5c_4320x2734.png 1272w, https://substackcdn.com/image/fetch/$s_!tLOy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F679109b9-f791-40f4-9cec-d872929b3d5c_4320x2734.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The Graph Explorer is an ontology-aware application that allows scientists to:</p><ol><li><p><strong>Discover Hidden Connections</strong>: Dive deep into the interrelationships between real-world data objects. Whether you're linking genes to diseases or phenotypes to pathways, the Graph Explorer reveals patterns that are crucial for breakthrough discoveries.</p></li><li><p><strong>Enhanced Data Mining Capabilities</strong>: Leverage advanced algorithms such as A*, Dijkstra&#8217;s, PageRank, and k-means to navigate and dissect complex data structures. Whether you're finding the shortest paths between two indications of interest or identifying significant gene regulators within your interaction network, our tool simplifies these tasks, making them more accessible and actionable.</p></li><li><p><strong>Collaborative Exploration</strong>: Share your findings effortlessly with colleagues. The Graph Explorer allows you to save and share exploration sessions, ensuring that your team builds on collective knowledge and biological insights, fostering a collaborative research environment.</p></li></ol><p></p><h2>Navigate Through Information Quickly</h2><p>Hours of valuable research time are wasted trying to figure out how data is connected and sifting through endless webpages and databases. The graph explorer centralizes all these disparate data sources into a singular view.</p><p>Load in an object from your data graph and instantly see all the connections associated with it. Importantly, relationships have semantic value and convey a meaningful association between two real-world objects.</p><p>For example, in a gene-centric view, starting with <em><strong>KLF4</strong>:</em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!AWDL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4fdd8f-6d37-45a0-9073-cbc46f1b0bf0_3496x2002.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AWDL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4fdd8f-6d37-45a0-9073-cbc46f1b0bf0_3496x2002.png 424w, https://substackcdn.com/image/fetch/$s_!AWDL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4fdd8f-6d37-45a0-9073-cbc46f1b0bf0_3496x2002.png 848w, https://substackcdn.com/image/fetch/$s_!AWDL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4fdd8f-6d37-45a0-9073-cbc46f1b0bf0_3496x2002.png 1272w, https://substackcdn.com/image/fetch/$s_!AWDL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4fdd8f-6d37-45a0-9073-cbc46f1b0bf0_3496x2002.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AWDL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4fdd8f-6d37-45a0-9073-cbc46f1b0bf0_3496x2002.png" width="1456" height="834" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3a4fdd8f-6d37-45a0-9073-cbc46f1b0bf0_3496x2002.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:834,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:829862,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!AWDL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4fdd8f-6d37-45a0-9073-cbc46f1b0bf0_3496x2002.png 424w, https://substackcdn.com/image/fetch/$s_!AWDL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4fdd8f-6d37-45a0-9073-cbc46f1b0bf0_3496x2002.png 848w, https://substackcdn.com/image/fetch/$s_!AWDL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4fdd8f-6d37-45a0-9073-cbc46f1b0bf0_3496x2002.png 1272w, https://substackcdn.com/image/fetch/$s_!AWDL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4fdd8f-6d37-45a0-9073-cbc46f1b0bf0_3496x2002.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>You can select one or more nodes by clicking or brush selecting. Active nodes in your selection will display a data card that lists out all data properties annotated on the node. The <strong>relationships panel </strong>lists the known edges connecting to KLF4 where:</p><ul><li><p>the direction of the arrow indicating the orientation of the connection</p></li><li><p>the badge label representing the Concept associated with the connected object through that relationship, and</p></li><li><p>the number in parentheses indicating the quantity of distinct edges of that type.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!R0g6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd34f821-ec27-48d3-92c2-cdf084ef9ae4_1344x130.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!R0g6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd34f821-ec27-48d3-92c2-cdf084ef9ae4_1344x130.png 424w, https://substackcdn.com/image/fetch/$s_!R0g6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd34f821-ec27-48d3-92c2-cdf084ef9ae4_1344x130.png 848w, https://substackcdn.com/image/fetch/$s_!R0g6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd34f821-ec27-48d3-92c2-cdf084ef9ae4_1344x130.png 1272w, https://substackcdn.com/image/fetch/$s_!R0g6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd34f821-ec27-48d3-92c2-cdf084ef9ae4_1344x130.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!R0g6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd34f821-ec27-48d3-92c2-cdf084ef9ae4_1344x130.png" width="1344" height="130" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dd34f821-ec27-48d3-92c2-cdf084ef9ae4_1344x130.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:130,&quot;width&quot;:1344,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:19232,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!R0g6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd34f821-ec27-48d3-92c2-cdf084ef9ae4_1344x130.png 424w, https://substackcdn.com/image/fetch/$s_!R0g6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd34f821-ec27-48d3-92c2-cdf084ef9ae4_1344x130.png 848w, https://substackcdn.com/image/fetch/$s_!R0g6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd34f821-ec27-48d3-92c2-cdf084ef9ae4_1344x130.png 1272w, https://substackcdn.com/image/fetch/$s_!R0g6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd34f821-ec27-48d3-92c2-cdf084ef9ae4_1344x130.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Ex: There are 11 Regulatory Features that are predicted to regulate KLF4</figcaption></figure></div><h2>Exploring Data Connections</h2><p>Each relationship is backed by either a curated assertion (e.g. from a known ontology) or extracted from a data source. These data sources can be internal proprietary datasets or from literature and public data. In this graph, we&#8217;ve loaded in Locus2Gene scores from <a href="https://www.opentargets.org/">Open Targets</a> Genetics portal and transformed them into <strong>VariantAssociation </strong>hyper nodes. By expanding the &#8220;has association&#8221; edges, we can instantly load in GWAS inferred associations between traits/diseases and KLF4.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!SbFW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5f7e253-91b9-4ace-81b9-4961d1003008_3584x2090.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!SbFW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5f7e253-91b9-4ace-81b9-4961d1003008_3584x2090.png 424w, https://substackcdn.com/image/fetch/$s_!SbFW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5f7e253-91b9-4ace-81b9-4961d1003008_3584x2090.png 848w, https://substackcdn.com/image/fetch/$s_!SbFW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5f7e253-91b9-4ace-81b9-4961d1003008_3584x2090.png 1272w, https://substackcdn.com/image/fetch/$s_!SbFW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5f7e253-91b9-4ace-81b9-4961d1003008_3584x2090.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!SbFW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5f7e253-91b9-4ace-81b9-4961d1003008_3584x2090.png" width="1456" height="849" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c5f7e253-91b9-4ace-81b9-4961d1003008_3584x2090.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:849,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1472882,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!SbFW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5f7e253-91b9-4ace-81b9-4961d1003008_3584x2090.png 424w, https://substackcdn.com/image/fetch/$s_!SbFW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5f7e253-91b9-4ace-81b9-4961d1003008_3584x2090.png 848w, https://substackcdn.com/image/fetch/$s_!SbFW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5f7e253-91b9-4ace-81b9-4961d1003008_3584x2090.png 1272w, https://substackcdn.com/image/fetch/$s_!SbFW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5f7e253-91b9-4ace-81b9-4961d1003008_3584x2090.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Through the directionality and differences in edge types, we can model biological complexity directly in the graph topology. Adding in additional styling helps to visually distinguish these patterns.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-7oN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd2c0042-5576-4e2d-b800-8234e4103e43_3584x2090.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-7oN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd2c0042-5576-4e2d-b800-8234e4103e43_3584x2090.png 424w, https://substackcdn.com/image/fetch/$s_!-7oN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd2c0042-5576-4e2d-b800-8234e4103e43_3584x2090.png 848w, https://substackcdn.com/image/fetch/$s_!-7oN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd2c0042-5576-4e2d-b800-8234e4103e43_3584x2090.png 1272w, https://substackcdn.com/image/fetch/$s_!-7oN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd2c0042-5576-4e2d-b800-8234e4103e43_3584x2090.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-7oN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd2c0042-5576-4e2d-b800-8234e4103e43_3584x2090.png" width="1456" height="849" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cd2c0042-5576-4e2d-b800-8234e4103e43_3584x2090.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:849,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1274778,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-7oN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd2c0042-5576-4e2d-b800-8234e4103e43_3584x2090.png 424w, https://substackcdn.com/image/fetch/$s_!-7oN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd2c0042-5576-4e2d-b800-8234e4103e43_3584x2090.png 848w, https://substackcdn.com/image/fetch/$s_!-7oN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd2c0042-5576-4e2d-b800-8234e4103e43_3584x2090.png 1272w, https://substackcdn.com/image/fetch/$s_!-7oN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd2c0042-5576-4e2d-b800-8234e4103e43_3584x2090.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Zoom out to see the bigger picture</h2><p>As we start to walk through the data graph, we can ramp up the data exploration space by expanding all relationships at a Concept level. In our KLF4 example, now that we&#8217;ve found some Disease connections through GWAS linkages, we can load in all known disease markers into the canvas.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Qd7E!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05475dfb-8c25-47b5-8650-3dccf390f533_3584x2090.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qd7E!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05475dfb-8c25-47b5-8650-3dccf390f533_3584x2090.png 424w, https://substackcdn.com/image/fetch/$s_!Qd7E!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05475dfb-8c25-47b5-8650-3dccf390f533_3584x2090.png 848w, https://substackcdn.com/image/fetch/$s_!Qd7E!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05475dfb-8c25-47b5-8650-3dccf390f533_3584x2090.png 1272w, https://substackcdn.com/image/fetch/$s_!Qd7E!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05475dfb-8c25-47b5-8650-3dccf390f533_3584x2090.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qd7E!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05475dfb-8c25-47b5-8650-3dccf390f533_3584x2090.png" width="1456" height="849" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/05475dfb-8c25-47b5-8650-3dccf390f533_3584x2090.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:849,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1558255,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Qd7E!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05475dfb-8c25-47b5-8650-3dccf390f533_3584x2090.png 424w, https://substackcdn.com/image/fetch/$s_!Qd7E!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05475dfb-8c25-47b5-8650-3dccf390f533_3584x2090.png 848w, https://substackcdn.com/image/fetch/$s_!Qd7E!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05475dfb-8c25-47b5-8650-3dccf390f533_3584x2090.png 1272w, https://substackcdn.com/image/fetch/$s_!Qd7E!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05475dfb-8c25-47b5-8650-3dccf390f533_3584x2090.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Load in disease markers in one click</figcaption></figure></div><p>Next, we can connect the Gene nodes in the canvas through the &#8220;activates&#8221; edge to reveal a sub-network of gene regulatory interactions. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7b0u!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe809fa70-0b0a-447b-8558-df06854905d1_3584x2090.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7b0u!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe809fa70-0b0a-447b-8558-df06854905d1_3584x2090.png 424w, https://substackcdn.com/image/fetch/$s_!7b0u!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe809fa70-0b0a-447b-8558-df06854905d1_3584x2090.png 848w, https://substackcdn.com/image/fetch/$s_!7b0u!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe809fa70-0b0a-447b-8558-df06854905d1_3584x2090.png 1272w, https://substackcdn.com/image/fetch/$s_!7b0u!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe809fa70-0b0a-447b-8558-df06854905d1_3584x2090.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7b0u!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe809fa70-0b0a-447b-8558-df06854905d1_3584x2090.png" width="1456" height="849" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e809fa70-0b0a-447b-8558-df06854905d1_3584x2090.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:849,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2119348,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7b0u!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe809fa70-0b0a-447b-8558-df06854905d1_3584x2090.png 424w, https://substackcdn.com/image/fetch/$s_!7b0u!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe809fa70-0b0a-447b-8558-df06854905d1_3584x2090.png 848w, https://substackcdn.com/image/fetch/$s_!7b0u!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe809fa70-0b0a-447b-8558-df06854905d1_3584x2090.png 1272w, https://substackcdn.com/image/fetch/$s_!7b0u!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe809fa70-0b0a-447b-8558-df06854905d1_3584x2090.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Run graph algorithms</h2><p>At this point, the graph is dense, and what we&#8217;ve systematically done is the following logic:</p><ul><li><p>Starting with KLF4</p></li><li><p>Find some GWAS associations between KLF4 and diseases</p></li><li><p>For those diseases, find all the gene markers</p></li><li><p>For all gene markers, connect their activation regulatory patterns</p></li></ul><p>While it is difficult for us to reason over the graph visually, they are the perfect substrate for graph algorithms that can yield interesting data-driven insights. For example, now that we&#8217;ve loaded in a gene subnetwork from a given context, we can run community detection algorithms like <a href="https://en.wikipedia.org/wiki/PageRank">PageRank</a> to score and detect highly influential genes, with the click of a button!</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lac8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d1790d4-f09c-42ea-bf4b-27a2cfa47b31_3584x2090.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lac8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d1790d4-f09c-42ea-bf4b-27a2cfa47b31_3584x2090.png 424w, https://substackcdn.com/image/fetch/$s_!lac8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d1790d4-f09c-42ea-bf4b-27a2cfa47b31_3584x2090.png 848w, https://substackcdn.com/image/fetch/$s_!lac8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d1790d4-f09c-42ea-bf4b-27a2cfa47b31_3584x2090.png 1272w, https://substackcdn.com/image/fetch/$s_!lac8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d1790d4-f09c-42ea-bf4b-27a2cfa47b31_3584x2090.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lac8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d1790d4-f09c-42ea-bf4b-27a2cfa47b31_3584x2090.png" width="1456" height="849" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4d1790d4-f09c-42ea-bf4b-27a2cfa47b31_3584x2090.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:849,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1646843,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!lac8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d1790d4-f09c-42ea-bf4b-27a2cfa47b31_3584x2090.png 424w, https://substackcdn.com/image/fetch/$s_!lac8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d1790d4-f09c-42ea-bf4b-27a2cfa47b31_3584x2090.png 848w, https://substackcdn.com/image/fetch/$s_!lac8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d1790d4-f09c-42ea-bf4b-27a2cfa47b31_3584x2090.png 1272w, https://substackcdn.com/image/fetch/$s_!lac8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d1790d4-f09c-42ea-bf4b-27a2cfa47b31_3584x2090.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Sharing a biological narrative</h2><p>You can take a snapshot of your current session that captures the data, styling, and results of your analysis so that you can continue working on it later, or perhaps, share it with a colleague. There are no limits to how many sessions can be saved, and that means you have an infinite canvas and the tools to paint your biological narrative and share it with your team.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1qIa!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe407063-7efe-42b6-a948-5d1c606d109f_3584x2090.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1qIa!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe407063-7efe-42b6-a948-5d1c606d109f_3584x2090.png 424w, https://substackcdn.com/image/fetch/$s_!1qIa!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe407063-7efe-42b6-a948-5d1c606d109f_3584x2090.png 848w, https://substackcdn.com/image/fetch/$s_!1qIa!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe407063-7efe-42b6-a948-5d1c606d109f_3584x2090.png 1272w, https://substackcdn.com/image/fetch/$s_!1qIa!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe407063-7efe-42b6-a948-5d1c606d109f_3584x2090.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1qIa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe407063-7efe-42b6-a948-5d1c606d109f_3584x2090.png" width="1456" height="849" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/be407063-7efe-42b6-a948-5d1c606d109f_3584x2090.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:849,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1800908,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1qIa!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe407063-7efe-42b6-a948-5d1c606d109f_3584x2090.png 424w, https://substackcdn.com/image/fetch/$s_!1qIa!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe407063-7efe-42b6-a948-5d1c606d109f_3584x2090.png 848w, https://substackcdn.com/image/fetch/$s_!1qIa!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe407063-7efe-42b6-a948-5d1c606d109f_3584x2090.png 1272w, https://substackcdn.com/image/fetch/$s_!1qIa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe407063-7efe-42b6-a948-5d1c606d109f_3584x2090.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><h1>About BioBox</h1><p><a href="https://biobox.io">BioBox</a> is a data intelligence platform for drug discovery. Rapidly build a custom data graph and deploy graph ML models for target prioritization, indication selection, and MOA analysis. Teams using BioBox make faster and better data driven decisions to de-risk drug programs and get assets into clinical studies quicker.</p><p>To learn more about how BioBox can supercharge your discovery pipeline, <a href="https://biobox.io/book-a-demo">book a demo</a> with one of our team members!</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading BioBox Blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[BioBox: BioData Intelligence Platform ]]></title><description><![CDATA[Build and deploy custom reasoning engines to make better data-driven decisions in drug discovery]]></description><link>https://blog.biobox.io/p/biobox-biodata-intelligence-platform</link><guid isPermaLink="false">https://blog.biobox.io/p/biobox-biodata-intelligence-platform</guid><dc:creator><![CDATA[Christopher Li]]></dc:creator><pubDate>Wed, 01 May 2024 19:54:03 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!Xwro!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79dd2c3b-3c58-4205-a5b0-cf61769d4a62_2206x1460.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h1>Introduction</h1><p>If you want to have a successful drug program, target selection is the most important thing you need to get right. The reality is that if you have a bad target, nothing else really matters, it is never going to work. This is why scientists spend a lot of time and resources meticulously analyzing as much data as possible to develop a compelling scientific narrative. We need to better understand the biology of the disease. The more evidence we can find, the more confident we can be in our decision-making. In this post we present our framework for accelerating scientific reasoning in drug discovery and introduce BioBox: The BioData Intelligence Platform.</p><h1>BioData Intelligence Platform</h1><p>We envision a platform that acts as the <strong>central repository of collective knowledge</strong> that integrates multiple data sources to weave together a <strong>cohesive data graph</strong>. Biologists can interact with and explore this graph in a simple and intuitive interface. Complex scientific logic can be modeled and tuned to assist scientists in making strong, well-reasoned, data-driven decisions.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading BioBox Blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Xwro!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79dd2c3b-3c58-4205-a5b0-cf61769d4a62_2206x1460.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Xwro!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79dd2c3b-3c58-4205-a5b0-cf61769d4a62_2206x1460.png 424w, https://substackcdn.com/image/fetch/$s_!Xwro!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79dd2c3b-3c58-4205-a5b0-cf61769d4a62_2206x1460.png 848w, https://substackcdn.com/image/fetch/$s_!Xwro!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79dd2c3b-3c58-4205-a5b0-cf61769d4a62_2206x1460.png 1272w, https://substackcdn.com/image/fetch/$s_!Xwro!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79dd2c3b-3c58-4205-a5b0-cf61769d4a62_2206x1460.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Xwro!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79dd2c3b-3c58-4205-a5b0-cf61769d4a62_2206x1460.png" width="728" height="482" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/79dd2c3b-3c58-4205-a5b0-cf61769d4a62_2206x1460.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:964,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:381132,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Xwro!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79dd2c3b-3c58-4205-a5b0-cf61769d4a62_2206x1460.png 424w, https://substackcdn.com/image/fetch/$s_!Xwro!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79dd2c3b-3c58-4205-a5b0-cf61769d4a62_2206x1460.png 848w, https://substackcdn.com/image/fetch/$s_!Xwro!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79dd2c3b-3c58-4205-a5b0-cf61769d4a62_2206x1460.png 1272w, https://substackcdn.com/image/fetch/$s_!Xwro!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79dd2c3b-3c58-4205-a5b0-cf61769d4a62_2206x1460.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The platform is divided into 3 layers:</p><ol><li><p><strong>Ontology - </strong>Defining what things mean in your organization</p></li><li><p><strong>Model </strong>- Build custom graph ML models that scores things your way</p></li><li><p><strong>Application - </strong>Put models to use in purpose built apps designed for biology</p></li></ol><p></p><h1>L1-Ontology: Define your data language</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!a_0d!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896a0159-dcb1-4440-9653-7252bc072bd3_2206x1460.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!a_0d!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896a0159-dcb1-4440-9653-7252bc072bd3_2206x1460.png 424w, https://substackcdn.com/image/fetch/$s_!a_0d!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896a0159-dcb1-4440-9653-7252bc072bd3_2206x1460.png 848w, https://substackcdn.com/image/fetch/$s_!a_0d!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896a0159-dcb1-4440-9653-7252bc072bd3_2206x1460.png 1272w, https://substackcdn.com/image/fetch/$s_!a_0d!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896a0159-dcb1-4440-9653-7252bc072bd3_2206x1460.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!a_0d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896a0159-dcb1-4440-9653-7252bc072bd3_2206x1460.png" width="1456" height="964" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/896a0159-dcb1-4440-9653-7252bc072bd3_2206x1460.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:964,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:612774,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!a_0d!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896a0159-dcb1-4440-9653-7252bc072bd3_2206x1460.png 424w, https://substackcdn.com/image/fetch/$s_!a_0d!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896a0159-dcb1-4440-9653-7252bc072bd3_2206x1460.png 848w, https://substackcdn.com/image/fetch/$s_!a_0d!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896a0159-dcb1-4440-9653-7252bc072bd3_2206x1460.png 1272w, https://substackcdn.com/image/fetch/$s_!a_0d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896a0159-dcb1-4440-9653-7252bc072bd3_2206x1460.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Speed and decisiveness is determined by the clarity of communication between different stakeholders when making a target selection decision. In practice, this boils down into getting 3 things right:</p><ol><li><p>Getting everyone on your team to agree on what things mean in your organization</p></li><li><p>Improve the readability and interpretability of critical business objects</p></li><li><p>Capture how data enters and gets transformed in your research practices</p></li></ol><p>Your ontology is where you <strong>create a single source of truth for your team</strong>. Every concept is unambiguously defined and their relatedness explicitly declared. As you begin building and populating your data graph, it <strong>automatically</strong> keeps track of its history to ensure proper <strong>data lineage, governance, and provenance</strong>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YJGR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ba28829-4c82-46c2-91b1-374333ff27eb_2206x1460.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YJGR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ba28829-4c82-46c2-91b1-374333ff27eb_2206x1460.png 424w, https://substackcdn.com/image/fetch/$s_!YJGR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ba28829-4c82-46c2-91b1-374333ff27eb_2206x1460.png 848w, https://substackcdn.com/image/fetch/$s_!YJGR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ba28829-4c82-46c2-91b1-374333ff27eb_2206x1460.png 1272w, https://substackcdn.com/image/fetch/$s_!YJGR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ba28829-4c82-46c2-91b1-374333ff27eb_2206x1460.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YJGR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ba28829-4c82-46c2-91b1-374333ff27eb_2206x1460.png" width="1456" height="964" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8ba28829-4c82-46c2-91b1-374333ff27eb_2206x1460.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:964,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:543544,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YJGR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ba28829-4c82-46c2-91b1-374333ff27eb_2206x1460.png 424w, https://substackcdn.com/image/fetch/$s_!YJGR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ba28829-4c82-46c2-91b1-374333ff27eb_2206x1460.png 848w, https://substackcdn.com/image/fetch/$s_!YJGR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ba28829-4c82-46c2-91b1-374333ff27eb_2206x1460.png 1272w, https://substackcdn.com/image/fetch/$s_!YJGR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ba28829-4c82-46c2-91b1-374333ff27eb_2206x1460.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1>L2-Model: Declare your scientific logic</h1><p>Part of good scientific communication is being able to justify your rationale with data. Certain observations support our hypothesis and some may give us reason to question our assumptions. The strength of the influence depends on the type of evidence we are given. For example, you&#8217;d probably give more relevance to sequencing analyses of patient-derived tumor cells than to assays done on immortalized cells. In other words, evidence are valued with different <strong>weights. </strong>We use these considerations in our mental models that, at its fundamental level, is trying to explain the relatedness between 2 concepts.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kCE4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59818373-5b5b-4d78-9d74-85ca6aff1a00_2206x1460.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kCE4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59818373-5b5b-4d78-9d74-85ca6aff1a00_2206x1460.png 424w, https://substackcdn.com/image/fetch/$s_!kCE4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59818373-5b5b-4d78-9d74-85ca6aff1a00_2206x1460.png 848w, https://substackcdn.com/image/fetch/$s_!kCE4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59818373-5b5b-4d78-9d74-85ca6aff1a00_2206x1460.png 1272w, https://substackcdn.com/image/fetch/$s_!kCE4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59818373-5b5b-4d78-9d74-85ca6aff1a00_2206x1460.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kCE4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59818373-5b5b-4d78-9d74-85ca6aff1a00_2206x1460.png" width="1456" height="964" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/59818373-5b5b-4d78-9d74-85ca6aff1a00_2206x1460.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:964,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:582104,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!kCE4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59818373-5b5b-4d78-9d74-85ca6aff1a00_2206x1460.png 424w, https://substackcdn.com/image/fetch/$s_!kCE4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59818373-5b5b-4d78-9d74-85ca6aff1a00_2206x1460.png 848w, https://substackcdn.com/image/fetch/$s_!kCE4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59818373-5b5b-4d78-9d74-85ca6aff1a00_2206x1460.png 1272w, https://substackcdn.com/image/fetch/$s_!kCE4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59818373-5b5b-4d78-9d74-85ca6aff1a00_2206x1460.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>In BioBox, we capture this logic into discrete graph models that are expertly tuned by your scientists to automatically compute a score based on <strong>all data</strong> in your graph. As new information is added to your graph, these graph models recompute to ensure your scores are always up to date. What this means for research teams is that you have a tractable and explainable way to justify scientific reasoning, that is <strong>unique to your science</strong>.</p><p>Because these graph models can be built against any two concepts in your graph, you could use them to score Gene &#8594; Phenotype, Drug &#8594; Disease, Gene &#8594; Biological Process, etc.</p><p>Graph models allows your scientists to <strong>transparently and explicitly describe data interpretation strategies</strong> and <strong>facilitates strong data-driven decision making</strong>.</p><p></p><h1>L3-Application: Putting the models and graph to work</h1><p>The platform makes building the data graph and the models easy, but to get real business value, computational and translational scientists must be able to interact with them. The application layer of the BioBox platform refers to the collection of ontology and graph-aware software tools that enable your scientists to interactively mine the graph for novel insights, generate data-driven reports powered by your graph models, and provides API solutions to integrate your data graph into your internal ML models.</p><h2>Automatic Report Generation From Graph Models</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!liP8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcfd4469-66cf-43bb-a680-7026ea9b04ea_2206x1460.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!liP8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcfd4469-66cf-43bb-a680-7026ea9b04ea_2206x1460.png 424w, https://substackcdn.com/image/fetch/$s_!liP8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcfd4469-66cf-43bb-a680-7026ea9b04ea_2206x1460.png 848w, https://substackcdn.com/image/fetch/$s_!liP8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcfd4469-66cf-43bb-a680-7026ea9b04ea_2206x1460.png 1272w, https://substackcdn.com/image/fetch/$s_!liP8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcfd4469-66cf-43bb-a680-7026ea9b04ea_2206x1460.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!liP8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcfd4469-66cf-43bb-a680-7026ea9b04ea_2206x1460.png" width="1456" height="964" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fcfd4469-66cf-43bb-a680-7026ea9b04ea_2206x1460.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:964,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:530000,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!liP8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcfd4469-66cf-43bb-a680-7026ea9b04ea_2206x1460.png 424w, https://substackcdn.com/image/fetch/$s_!liP8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcfd4469-66cf-43bb-a680-7026ea9b04ea_2206x1460.png 848w, https://substackcdn.com/image/fetch/$s_!liP8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcfd4469-66cf-43bb-a680-7026ea9b04ea_2206x1460.png 1272w, https://substackcdn.com/image/fetch/$s_!liP8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffcfd4469-66cf-43bb-a680-7026ea9b04ea_2206x1460.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Using your graph models, scientists can rapidly <strong>generate scored target prioritization reports in seconds</strong>. These reports allows you to:</p><ul><li><p>generate a scaled score of associations</p></li><li><p>get a full trace of every data point used to generate scores</p></li><li><p>one-click data snapshot and share findings with cross-functional teams</p></li></ul><h2>Interactive Data Graph Exploration</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1mCU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd93e094b-8100-4dc2-87ac-e91f7c33c55f_2206x1460.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1mCU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd93e094b-8100-4dc2-87ac-e91f7c33c55f_2206x1460.png 424w, https://substackcdn.com/image/fetch/$s_!1mCU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd93e094b-8100-4dc2-87ac-e91f7c33c55f_2206x1460.png 848w, https://substackcdn.com/image/fetch/$s_!1mCU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd93e094b-8100-4dc2-87ac-e91f7c33c55f_2206x1460.png 1272w, https://substackcdn.com/image/fetch/$s_!1mCU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd93e094b-8100-4dc2-87ac-e91f7c33c55f_2206x1460.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1mCU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd93e094b-8100-4dc2-87ac-e91f7c33c55f_2206x1460.png" width="1456" height="964" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d93e094b-8100-4dc2-87ac-e91f7c33c55f_2206x1460.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:964,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:603891,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1mCU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd93e094b-8100-4dc2-87ac-e91f7c33c55f_2206x1460.png 424w, https://substackcdn.com/image/fetch/$s_!1mCU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd93e094b-8100-4dc2-87ac-e91f7c33c55f_2206x1460.png 848w, https://substackcdn.com/image/fetch/$s_!1mCU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd93e094b-8100-4dc2-87ac-e91f7c33c55f_2206x1460.png 1272w, https://substackcdn.com/image/fetch/$s_!1mCU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd93e094b-8100-4dc2-87ac-e91f7c33c55f_2206x1460.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Put the power of your data moat in the hands of your scientists by allowing them to explore it interactively. The BioBox Graph Explorer is a ontology-aware way to navigate the rich connectivity and context of your data graph. Scientists can <strong>run graph algorithms on the fly</strong>, including community detection and path finding, to discover new insights embedded inside your proprietary and/or publicly available data. Sessions can be saved and shared with cross-functional teams to collaboratively build a scientific narrative.</p><h1>Who is this for?</h1><p>This platform caters to teams who  </p><ul><li><p>Are looking to obtain an in-depth understanding of the biology of a disease. </p></li><li><p>Need answers to complex biological questions. </p></li><li><p>Have multi-modal data (NGS, low throughput, clinical metadata, etc).</p></li><li><p>Need a flexible and customizable system that will scale with additional data and team members. </p></li><li><p>Have an internal AI strategy. </p></li></ul><h1>Who is this not for?</h1><p>This platform is not for </p><ul><li><p>Teams looking to strictly mine literature. </p></li><li><p>Teams without multi-modal data. </p></li><li><p>Teams who do not have specific biological questions they are trying to answer. </p></li><li><p>Teams looking for one size fits all knowledge graphs. </p></li></ul><h1>Bottom Line</h1><p>The BioBox Platform is a fully integrated system for research teams to answer complex biological questions using multi-modal data. It helps you solve three things:</p><ol><li><p><strong>Clarify Communication:</strong> By establishing a shared vocabulary and a clear understanding of data relationships through the ontology layer, BioBox ensures that all team members are aligned, enhancing collaboration and efficiency in decision-making processes.</p></li><li><p><strong>Enhance Decision-Making:</strong> Through the model layer, BioBox empowers scientists to quantify their hypotheses and test their theories with precision. This structured approach to scientific reasoning enables you to make data-driven decisions with confidence, backed by a dynamic scoring system that adapts to new data.</p></li><li><p><strong>Drive Innovation:</strong> The application layer transforms complex data interactions into actionable insights, enabling scientists to discover novel therapeutic targets and strategies. This not only accelerates the research and development process but also opens up new avenues for innovation within drug discovery.</p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading BioBox Blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Beyond Data: The Rise of Knowledge Graphs in Accelerating Drug Discovery]]></title><description><![CDATA[We are now living in the era of biological big data.]]></description><link>https://blog.biobox.io/p/beyond-data-the-rise-of-knowledge</link><guid isPermaLink="false">https://blog.biobox.io/p/beyond-data-the-rise-of-knowledge</guid><dc:creator><![CDATA[Christopher Li]]></dc:creator><pubDate>Mon, 25 Sep 2023 16:08:49 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!rzqF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a9357b-9e0e-4614-b934-3b0f1e2a3fe1_852x852.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>We are now living in the era of biological big data. The cost of sequencing is rapidly decreasing. Simultaneously, we are witnessing the rise of bio-computing tools and platforms that enable scientists to process sequencing data at remarkable speed and scale. This abundance of data, when harnessed effectively, holds the key to making informed decisions around <a href="https://bitsinbio.substack.com/p/the-backbone-of-drug-discovery">target identification</a> and validation stages in drug discovery. Selecting a good target to drug is one of the earliest and most consequential decisions made when developing new therapeutics.</p><p>Today, the challenge often lies not in the acquisition of data, but in its interpretation and utilization. Simply having data is no longer enough. To fully exploit the volume and variety of data available, we need systems to capture the relationships and patterns from observations to form a more holistic view. This is where Knowledge Graphs (KG) truly shine, and when properly used, will completely transform that way data is used in drug discovery. In this article, we describe at a high level, what a Knowledge Graph is and how to get started crafting one.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading BioBox Blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rzqF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a9357b-9e0e-4614-b934-3b0f1e2a3fe1_852x852.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rzqF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a9357b-9e0e-4614-b934-3b0f1e2a3fe1_852x852.webp 424w, https://substackcdn.com/image/fetch/$s_!rzqF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a9357b-9e0e-4614-b934-3b0f1e2a3fe1_852x852.webp 848w, https://substackcdn.com/image/fetch/$s_!rzqF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a9357b-9e0e-4614-b934-3b0f1e2a3fe1_852x852.webp 1272w, https://substackcdn.com/image/fetch/$s_!rzqF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a9357b-9e0e-4614-b934-3b0f1e2a3fe1_852x852.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rzqF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a9357b-9e0e-4614-b934-3b0f1e2a3fe1_852x852.webp" width="602" height="602" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/16a9357b-9e0e-4614-b934-3b0f1e2a3fe1_852x852.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:852,&quot;width&quot;:852,&quot;resizeWidth&quot;:602,&quot;bytes&quot;:43742,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rzqF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a9357b-9e0e-4614-b934-3b0f1e2a3fe1_852x852.webp 424w, https://substackcdn.com/image/fetch/$s_!rzqF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a9357b-9e0e-4614-b934-3b0f1e2a3fe1_852x852.webp 848w, https://substackcdn.com/image/fetch/$s_!rzqF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a9357b-9e0e-4614-b934-3b0f1e2a3fe1_852x852.webp 1272w, https://substackcdn.com/image/fetch/$s_!rzqF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16a9357b-9e0e-4614-b934-3b0f1e2a3fe1_852x852.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Data to Impact Journey - Source: Gaping Void</figcaption></figure></div><p></p><h1>What is a Knowledge Graph?</h1><p>At its core, a Knowledge Graph (KG) is a formal structure designed to represent information as a set of entities and the intricate relationships between them. In simple terms, we can think of individuals like <em>BRCA1</em> or <em>Breast Cancer</em> as nodes, and relationships like <em>mutated_in</em> as descriptive edges between nodes.</p><p>When developed properly, it can enable automated <strong>reasoning </strong>and <strong>inference</strong> to unearth hidden implicit connections that often belie breakthroughs and discoveries. Many leading biotech and pharmaceutical companies have been ingesting data and constructing internal knowledge graphs to use in a variety of applications from drug repurposing to target identification.</p><div><hr></div><p><strong>DeepLink</strong>, Janssen Pharmaceuticals: Using DeepLink, an internal knowledge graph platform, researchers successfully identified two hallmark targets (one of them is now in their portfolio) for pulmonary hypertension.</p><p><strong>ARCH, </strong>AbbVie: ARCH is the name of AbbVie&#8217;s internal knowledge graph. In a case study, AbbVie scientists used the embedded logic to discover a putative therapeutic in their portfolio that could be used to treat Carney Complex, a rare and deadly disease with no approved treatments.</p><div><hr></div><p>The essential and most important component of a KG is the underlying ontology. Ontologies are semantic data models which determine the <em>types</em> of concepts that exist in the KG. Importantly, they are distinct from individual (data points) in the KG because they represent entire categories of concepts, not a specific named concept. For example, instead of describing the gene SOX2 and specific properties about SOX2, the ontology focuses on defining the concept of <em>Gene</em> and capturing the characteristics that a <em>Gene</em> should have. Some of these characteristics can be relationships to other concepts in the ontology. For example, we can describe the relationship between a <em>Gene </em>and <em>Pathway</em> with <em>participates_in</em>. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_4Ec!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6239255c-72bb-4a92-a320-01dde88eb42d_470x108.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_4Ec!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6239255c-72bb-4a92-a320-01dde88eb42d_470x108.png 424w, https://substackcdn.com/image/fetch/$s_!_4Ec!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6239255c-72bb-4a92-a320-01dde88eb42d_470x108.png 848w, https://substackcdn.com/image/fetch/$s_!_4Ec!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6239255c-72bb-4a92-a320-01dde88eb42d_470x108.png 1272w, https://substackcdn.com/image/fetch/$s_!_4Ec!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6239255c-72bb-4a92-a320-01dde88eb42d_470x108.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_4Ec!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6239255c-72bb-4a92-a320-01dde88eb42d_470x108.png" width="470" height="108" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6239255c-72bb-4a92-a320-01dde88eb42d_470x108.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:108,&quot;width&quot;:470,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_4Ec!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6239255c-72bb-4a92-a320-01dde88eb42d_470x108.png 424w, https://substackcdn.com/image/fetch/$s_!_4Ec!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6239255c-72bb-4a92-a320-01dde88eb42d_470x108.png 848w, https://substackcdn.com/image/fetch/$s_!_4Ec!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6239255c-72bb-4a92-a320-01dde88eb42d_470x108.png 1272w, https://substackcdn.com/image/fetch/$s_!_4Ec!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6239255c-72bb-4a92-a320-01dde88eb42d_470x108.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Ontology Representation</figcaption></figure></div><p>Using this ontology, data can be structured in an interpretable way, and forms the KG. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!d9wu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F986dddb2-61ae-4fa1-b0c9-f90dde7a29d2_653x108.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!d9wu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F986dddb2-61ae-4fa1-b0c9-f90dde7a29d2_653x108.png 424w, https://substackcdn.com/image/fetch/$s_!d9wu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F986dddb2-61ae-4fa1-b0c9-f90dde7a29d2_653x108.png 848w, https://substackcdn.com/image/fetch/$s_!d9wu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F986dddb2-61ae-4fa1-b0c9-f90dde7a29d2_653x108.png 1272w, https://substackcdn.com/image/fetch/$s_!d9wu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F986dddb2-61ae-4fa1-b0c9-f90dde7a29d2_653x108.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!d9wu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F986dddb2-61ae-4fa1-b0c9-f90dde7a29d2_653x108.png" width="653" height="108" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/986dddb2-61ae-4fa1-b0c9-f90dde7a29d2_653x108.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:108,&quot;width&quot;:653,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!d9wu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F986dddb2-61ae-4fa1-b0c9-f90dde7a29d2_653x108.png 424w, https://substackcdn.com/image/fetch/$s_!d9wu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F986dddb2-61ae-4fa1-b0c9-f90dde7a29d2_653x108.png 848w, https://substackcdn.com/image/fetch/$s_!d9wu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F986dddb2-61ae-4fa1-b0c9-f90dde7a29d2_653x108.png 1272w, https://substackcdn.com/image/fetch/$s_!d9wu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F986dddb2-61ae-4fa1-b0c9-f90dde7a29d2_653x108.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Instance-level Representation</figcaption></figure></div><h2>Tangible Benefits: Drug Repurposing</h2><p>For biotechnology or pharmaceutical companies that have existing assets in their portfolio with a defined target and mechanism, we can expand the market value of the asset if we can find alternate use-cases for it.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!B2U9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1480122c-a434-4d7a-9057-53dd0a385844_1299x724.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!B2U9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1480122c-a434-4d7a-9057-53dd0a385844_1299x724.png 424w, https://substackcdn.com/image/fetch/$s_!B2U9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1480122c-a434-4d7a-9057-53dd0a385844_1299x724.png 848w, https://substackcdn.com/image/fetch/$s_!B2U9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1480122c-a434-4d7a-9057-53dd0a385844_1299x724.png 1272w, https://substackcdn.com/image/fetch/$s_!B2U9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1480122c-a434-4d7a-9057-53dd0a385844_1299x724.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!B2U9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1480122c-a434-4d7a-9057-53dd0a385844_1299x724.png" width="1299" height="724" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1480122c-a434-4d7a-9057-53dd0a385844_1299x724.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:724,&quot;width&quot;:1299,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!B2U9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1480122c-a434-4d7a-9057-53dd0a385844_1299x724.png 424w, https://substackcdn.com/image/fetch/$s_!B2U9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1480122c-a434-4d7a-9057-53dd0a385844_1299x724.png 848w, https://substackcdn.com/image/fetch/$s_!B2U9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1480122c-a434-4d7a-9057-53dd0a385844_1299x724.png 1272w, https://substackcdn.com/image/fetch/$s_!B2U9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1480122c-a434-4d7a-9057-53dd0a385844_1299x724.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Suppose your asset <em>BX123</em> selectively inhibits the <em>G1</em> gene and is an approved treatment for <em>Disease Y </em>by rescuing an overactive pathway <em>P1</em>. When enough data is modeled into a knowledge graph, we can start to make some inferences and uncover new insights. For example, it is also known that <em>P1</em> activates transcription of <em>G2</em>, which in turn, activates <em>P2</em> that is implicated in the etiology of <em>Disease X</em>. Through walking the KG, it can be inferred that the asset <em>BX123</em> could be a viable treatment for <em>Disease X</em>, expanding the use-cases for the asset and ultimately the market value.</p><h2>Tangible Benefits: Target Identification</h2><p>KGs can also help researchers make informed decisions when selecting a target to drug. Using sequencing datasets, we can load in observations such as gene expression, differential expression, mutations etc. to enrich the knowledge graph. For example suppose we have normal tissue expression data and disease-specific patient tumor sequencing data.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2-vk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F428eeaab-b859-44b4-823c-3ef75cf06ee0_931x705.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2-vk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F428eeaab-b859-44b4-823c-3ef75cf06ee0_931x705.png 424w, https://substackcdn.com/image/fetch/$s_!2-vk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F428eeaab-b859-44b4-823c-3ef75cf06ee0_931x705.png 848w, https://substackcdn.com/image/fetch/$s_!2-vk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F428eeaab-b859-44b4-823c-3ef75cf06ee0_931x705.png 1272w, https://substackcdn.com/image/fetch/$s_!2-vk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F428eeaab-b859-44b4-823c-3ef75cf06ee0_931x705.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2-vk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F428eeaab-b859-44b4-823c-3ef75cf06ee0_931x705.png" width="931" height="705" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/428eeaab-b859-44b4-823c-3ef75cf06ee0_931x705.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:705,&quot;width&quot;:931,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2-vk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F428eeaab-b859-44b4-823c-3ef75cf06ee0_931x705.png 424w, https://substackcdn.com/image/fetch/$s_!2-vk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F428eeaab-b859-44b4-823c-3ef75cf06ee0_931x705.png 848w, https://substackcdn.com/image/fetch/$s_!2-vk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F428eeaab-b859-44b4-823c-3ef75cf06ee0_931x705.png 1272w, https://substackcdn.com/image/fetch/$s_!2-vk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F428eeaab-b859-44b4-823c-3ef75cf06ee0_931x705.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>At first glance, <em>G1 </em>and <em>G2</em> appear to be viable targets as they both participate in an overactive disease-causing pathway, but after considering information loaded from sequencing data, <em>G2 </em>seems to be most viable. Here&#8217;s the reasoning:</p><ul><li><p><em>G2 </em>is selectively upregulated in disease</p><ul><li><p><em>G2 </em>is poorly expressed in normal tissue</p></li><li><p><em>G2</em> is significantly upregulated in <em>Disease X </em>compared to healthy normal controls</p></li></ul></li><li><p><em>G1 </em>is not contributing to overactive pathway</p><ul><li><p><em>G1</em> is abundantly expressed in normal tissues and not found to be differentially expressed in Disease X. It stands to reason that if <em>G1</em> is implicated in <em>Disease X</em>, all brains would have <em>Disease X</em>.</p></li></ul></li></ul><p>In this hypothetical scenario, it illustrates how the incorporation of observations from sequencing datasets into the knowledge graph enables researchers to conduct evidence-based reasoning quickly and effectively.</p><h1>Challenges and Considerations</h1><p>Despite the tangible benefits that KGs can bring into a biotech research team to accelerate drug discovery, there are significant challenges to making the KG effective. Here are some considerations to make if you are thinking of starting a KG initiative.</p><h2>Build a shared dictionary</h2><p>Don&#8217;t be an ontology goblin.</p><p>The goal is to have shared knowledge. The ontology should be built with all stakeholders, from computational biologists to business teams, in mind. Successful companies implementing KGs have a common understanding of how things are defined using a shared vocabulary in their organization. This massively reduces communication errors and makes data ingestion, harmonization, and reporting a breeze. It matters not where this is done, but that it is visible and available to everyone.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Byd0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0c9ad89-4072-4ec3-bf53-700bc9b1498b_2156x1282.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Byd0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0c9ad89-4072-4ec3-bf53-700bc9b1498b_2156x1282.png 424w, https://substackcdn.com/image/fetch/$s_!Byd0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0c9ad89-4072-4ec3-bf53-700bc9b1498b_2156x1282.png 848w, https://substackcdn.com/image/fetch/$s_!Byd0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0c9ad89-4072-4ec3-bf53-700bc9b1498b_2156x1282.png 1272w, https://substackcdn.com/image/fetch/$s_!Byd0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0c9ad89-4072-4ec3-bf53-700bc9b1498b_2156x1282.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Byd0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0c9ad89-4072-4ec3-bf53-700bc9b1498b_2156x1282.png" width="1456" height="866" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f0c9ad89-4072-4ec3-bf53-700bc9b1498b_2156x1282.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:866,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:249652,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Byd0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0c9ad89-4072-4ec3-bf53-700bc9b1498b_2156x1282.png 424w, https://substackcdn.com/image/fetch/$s_!Byd0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0c9ad89-4072-4ec3-bf53-700bc9b1498b_2156x1282.png 848w, https://substackcdn.com/image/fetch/$s_!Byd0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0c9ad89-4072-4ec3-bf53-700bc9b1498b_2156x1282.png 1272w, https://substackcdn.com/image/fetch/$s_!Byd0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0c9ad89-4072-4ec3-bf53-700bc9b1498b_2156x1282.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">BioBox Library - Shared Dictionary</figcaption></figure></div><p></p><h2>Connect data streams</h2><p>Before committing to building a KG, be sure to plan out where the data comes from and if it can be reliably and efficiently connected to the KG. Building the perfect ontology and infrastructure is meaningless if you cannot consistently add data to the graph. While they can vary by use-case, the most common ones would be LIMS metadata, bioinformatic workflows and outputs, ELNs etc.</p><h1>Bottom Line</h1><p>Knowledge Graphs are tools that can drive massive acceleration in the drug discovery process. There is growing adoption and use of knowledge graphs in big pharma and biotech. When harnessed effectively, it enables research teams to reason over the data at scale and unearth hidden patterns.</p><p>However, effective KGs require significant time and capital resources, which may limit adoption in early-stage startups and biotech companies, despite the immense value KGs hold. This is where purpose-built platforms like BioBox can help. We empower smaller research teams with dedicated tools to:</p><ul><li><p>collaboratively build ontologies, with access controls and versioning</p></li><li><p>integrate with data streams to power your knowledge graphs</p></li><li><p>explore and mine knowledge graphs for insights</p></li></ul><p></p><p>Ready to get started? <a href="https://biobox.io/book-a-demo">Book a Demo</a></p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.biobox.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading BioBox Blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item></channel></rss>