What Is Medical Entity Relationship Extraction And Search Problem?

Great Essays
RELATED WORK (tx)
Our PubMed entity relation extraction and search problem is related to two fields of study: Medical entity relation mining and Entity-related search system. In the medical text-mining domain, there exists some prior works about the relationship among medical entities shown in the knowledge databases[1,2]. The most popular one is Comparative Toxicogenomics Database (CTD) whose data, including relations between Chemical-Gene, Chemical-Disease and Gene-Disease. Unlike the specific and well-predefined relations between Chemical and Gene by the medical professionals, the relations between Disease and Chemical is very general and only have few relation types such as “therapeutic” and “mechanism” instead of phrases. Another similar
…show more content…
However, consider our system should support online search query, the time costly NLP techniques such as parse tree is not applicable in our case. Thus, we use POS tagging sequence to classify our patterns and there are three types of pattern components--Entity, Verb Phrase and Entity Modifier. The new idea of entity modifier is introduced to specify a sub-level of the entity or describe a relation under a certain condition. Note that through adding entity modifiers, our system is able to not only further distinguish entity relations from general types (eg. the only pattern in the OpenIE:E VP E) to more specific ones but help users to better understand the relation phrases. In the figure XX, entity modifier is used to explain the occurrence of opposite relation phrases for same …show more content…
Most entities co-occur only a few times in the PubMed corpus and there are often diverse ways to describe the same meaning relation between an entity pair. To conquer this issue and take the leverage of the redundancy of the corpus, we decide to cluster synonymous relation phrases.
Relation vector based clustering:
Follow the traditional principle for such task, we run k-means clustering method on relation phrases represented by relation vectors. Relation vector is basically a bag-of-words model, which contains TF-IDF values multiply occurrence frequency for each term. In addition, we observe that the meaning of relation phrase becomes ambiguous without considering entity type information. For example, “prevent” and “treat” are similar relation for chemical and disease, but should view different for gene and chemical. Thus, we added query entity pair’s type information into the relation vector. The last but least, we take account of relation phrase polarity information to cluster similar semantic sense of relation phrases. To leverage the result, we choose the mass center vector of each cluster to represent such

Related Documents

  • Decent Essays

    The concept of semantic memory was introduced in 1972 as a result of a study done by Endel Tulving and Wayne Donaldson on the impact of organization in human memory. Tulving theorized how the semantic and episodic differ in the types…

    • 175 Words
    • 1 Pages
    Decent Essays
  • Improved Essays

    Patient Cohort Case Study

    • 479 Words
    • 2 Pages

    Thank you for contacting us. We're currently working with Dr. Stacy Johnson to automate the process of identifying PE and DVT patient cohort. For the PE task, we want to determine whether a patient has PE or not, if so, whether the PE is acute or chronic and its artery level. For the DVT task, we want to determine whether a patient has VT or not, superficial or deep, proximal or distal, acute or chronic, left or right, upper limbs or lower limbs. For both tasks, we also want to determine if the report impression is certain or uncertain.…

    • 479 Words
    • 2 Pages
    Improved Essays
  • Great Essays

    Audience Analysis Memo

    • 868 Words
    • 4 Pages

    emorandum To: Ms. Julie Jones From: Student’s Name Date: 11/5/2015 Subject: Assignment 1: Audience Analysis Memo As assigned I have compared the information available from two popular online medical resources to see what audience is targeted and how the information is provided. The chosen topic is Attention Deficit Hyperactivity Disorder and the sources were WebMD (http://www.webmd.com/add-adhd/default.htm) and The National Institutes of Health (http://www.nlm.nih.gov/medlineplus/attentiondeficithyperactivitydisorder.html). The primary target for the WebMD site is the general public with a secondary audience possibly being healthcare professionals looking for more information or just a refresher on what they already have learned.…

    • 868 Words
    • 4 Pages
    Great Essays
  • Great Essays

    Pub Med Case Study

    • 612 Words
    • 3 Pages

    in search strategy, 65.33% didn't attend courses related to EBM, 77.84% didn't attend any courses in search strategy and 82% didn't attend any course in critical appraisal. More than half of participants 68% had access to (WWW), most of them 59.73% had this access at home. And 56% of participants read medical journals, but only 2.01% read these journals regularly. Most of family physicians were unaware of most EBM resources, and only 31.33% were aware but not use journals of FM, but 19.33% read Pub Med and only 15.33% used Pub Med in clinical decision making.…

    • 612 Words
    • 3 Pages
    Great Essays
  • Superior Essays

    This paper examines Case 7 in our text “Is Birth Control Bad for One’s Health?”. This is quite an old case (1970), but nonetheless applicable in several ways to ethical and moral issues we face in today’s society. We will examine the original case and some of the applications to similar situations today. We also recognize that in today’s society, legal charges would likely be brought against the physician who acted in a similar manner as Dr. Browne.…

    • 1183 Words
    • 5 Pages
    Superior Essays
  • Improved Essays

    Dr. Fritz Interview

    • 611 Words
    • 3 Pages

    Another key concept that I feel was touch upon during my interview with Dr. Fritz was in regards to stress and burnout. Stress being the “physical and psychological responses to overwhelming stimuli” and burnout referring to a combination of “exhaustion, depersonalization, and a reduced sense of personal accomplishment” (p. 94) During my interview I asked Dr. Fritz how she goes about taking time for herself, and what the best ways healthcare providers could unwind when they find their job taking a toll on them. Her answer was pretty straight foreword, saying that “you really have to find time for yourself”.…

    • 611 Words
    • 3 Pages
    Improved Essays
  • Improved Essays

    To improve the outcome of personalised medicine it is necessary to satisfy the urgent need for a broad program of creative approaches to collection and collation data in a Knowledge Network (1, 2) of databases that make molecular, medical, behavioural, and socio-economic data available in comprehensive digital form to scientists, health care workers, patients, and administrators. The Innovative Medicines Initiative (IMI) on big data for better outcomes programme has taken important steps in this direction. However, there is an added need to expand the understanding by prospectively adding data from clinical studies and results from best clinical practice. The accumulated results build a continuously growing base of knowledge and provide the…

    • 174 Words
    • 1 Pages
    Improved Essays
  • Improved Essays

    1. Identify and describe the 3 different means to evaluate on-site study conduct. Include the responsible party and driver of each means. How are these means complicated by including both domestic and foreign trial sites, if at all?…

    • 1134 Words
    • 5 Pages
    Improved Essays
  • Superior Essays

    Ashley Timmreck Dr. Sherrin Frances English 212 23 July 2016 Personalized Medicine: Classifying Data for use in the Clinical Setting In the past 10 years, the area of genetics has undergone a huge transformation, and many discoveries regarding our DNA, genome, and other aspects of our development have been made. The developments of new technologies, which can sequence our entire genetic sequence in very short periods of time, have caused a major influx of data to the medical field. Researchers are now beginning to think about how all of this data should be organized in order to make it possible to implement discoveries from each genome sequencing into the clinical setting. Looking at the different aspects of hierarchies and rhizomal networks,…

    • 1229 Words
    • 5 Pages
    Superior Essays
  • Improved Essays

    It was also discovered that episodic memories result from the important events that have happened in a person’s life. Some of those events are graduations, embarrassing moments, and weddings. These memories are experienced first-hand and are stored in an individual’s episodic memory. Within this paper, we also discussed semantic memory and how it is established through learning. Semantic memory includes concepts of vocabulary, facts, academic skills, and numerical processes.…

    • 893 Words
    • 4 Pages
    Improved Essays
  • Improved Essays

    Lying, with the end goal of receiving medical attention and sympathy, is not a new concept. Although it was not until the mid-1800s that Factitious disorder was officially recognized, evidence shows instances of Malingering dating back as far as the Roman times; from Odysseus in the Trojan War, to the physician Galen (Rogers 19). Munchausen Syndrome was first described by Asher in 1951, with Munchausen Syndrome by Proxy first being in diagnosed in 1977 (Feldman 1). More recently, in the last 10-15 years, the increasing popularity of the internet and the ease of access to information online has introduced the idea of health-related online deception, Munchausen by Internet (MbI) in 2000 (Feldman 1).…

    • 1013 Words
    • 5 Pages
    Improved Essays
  • Improved Essays

    Ckd Literature Review

    • 1592 Words
    • 7 Pages

    RESEARCH BACKGROUND: The end stage renal disease (ESRD) is a debilitating, chronic condition whereby the renal failure requires dialysis or renal replacement therapy (RRT) to survive (USRDS, 2002) According to the National Health Survey (NHS) 2009 it is evident that the Chronic Kidney Disease (CKD) among the adults in England is around 13%. Over the last four decades, there has been a major development in treatment of CKD that has improved the life expectancy when compared to previous years (NHS, 2008).…

    • 1592 Words
    • 7 Pages
    Improved Essays
  • Improved Essays

    Compared to other system, which may required a complex infrasturcture and highly capable person in order to capture the knowledge, KNOVA can easily eliminated the process as it can integrates content accross the organization. The adpative nature of KNOVA also can increase the accuracy future search as it will learn to the search pattern used by the…

    • 723 Words
    • 3 Pages
    Improved Essays
  • Improved Essays

    Technology has the core importance in today’s world of increasing popularity of online marketing. Current trend for the marketers is to measure the result of marketing tactics and initiatives. The growing dependence on the online world, it has become necessary for businesses to have active, engaging and user-friendly websites. Irrespective of the nature of the business, whether the site is e-commerce focused or another tool of marketing strategy, it is an option anymore to have the site be idle. In today’s increasingly competitive online market, it has never been more crucial to ensure your website is delivering results.…

    • 1256 Words
    • 6 Pages
    Improved Essays
  • Improved Essays

    Data Mining Essay

    • 798 Words
    • 4 Pages

    The topic that I am interested in is Data Mining. This is interesting to me because it can help in various areas of society. This includes the medical field, elderly care, and commerce. This is controversial because of the amount and type of personal data that is being collected. We are living in the “Big Data” era where there are many ways to collect data.…

    • 798 Words
    • 4 Pages
    Improved Essays