A domain-specific lemmatization tool for the morphological analysis of biomedical literature. The BioLemmatizer is tailored to the biological domain through integration of several published lexical resources related to molecular biology. It focuses on the inflectional morphology of English, including the plural form of nouns, the conjugations of verbs, and the comparative and superlative form of adjectives and adverbs.
Provides simplified text to enhance the performance of Natural language processing (NLP) systems and text mining (TM) applications. iSimp denotes simplified sentences in a corpus file, along with the annotation of simplification constructs in the original sentence. It uses shallow parsing and recursive transition networks to detect all forms of simplifications. This tool is able to detect six types of simplification constructs: coordination, relative clause, apposition, introductory phrase, subordinate clause and parenthetical element.
Aims to extract all types of abbreviations with their expansions from a target paper on the fly. ALICE does not only facilitate recognition of an undefined abbreviation in a paper by constructing an abbreviation database or dictionary, but also makes biomedical literature retrieval more accurate.
Existing terminological resources and scientific databases cannot keep up-to-date with the growth of neologisms. A domain independent method for term recognition is very useful to automatically recognize terms from documents. The TerMine demonstrator intergrates C-Value multiword term extraction and AcroMine acronym recognition.
Identifies environment ontology (ENVO) terms in literature. ENVIRONMENTS is able to detect environment descriptive terms based on the web resource of biodiversity Encyclopedia of Life (EOL) Taxon corpus. This software can manage orthographic variation like plural forms or spacing and hyphenation. It can also be adapted to large sources of text other than EOL and can be used as an extracting species-environment pairs tool with the SPECIES tagger.
Permits users to search, download data from, and otherwise interact with NCBI databases. Rentrez is an R package that provides an interface to the NCBI's 'EUtils' API, allowing users to search databases like GenBank and PubMed. Researchers can also process the results of those searches and pull data into their R sessions. Some functions allows listing of databases available from the NCBI or listing of available search fields for a given database.
Assist users in making decisions and providing access to accurate and authoritative global regulations. Vistaar is a regulatory artificial intelligence platform that extracts regulations, changes in guidance, new legislations, changes in regulatory policy and procedures from several health authority sources. It also classifies and clusters regulatory updates into applicable categories (clinical, marketing, lifecycle, manufacturing, labeling, safety and key areas) based on the in-built insights proprietary regulations.