Jump to content

Apache cTAKES

From Wikipedia, the free encyclopedia
(Redirected fromCTAKES)
Apache cTAKES
Developer(s)Apache Software Foundation
Stable release
5.1.0 / May 16, 2024;3 months ago(2024-05-16)
RepositorycTakes Repository
Written inJava,Scala,Python
Operating systemCross-platform
TypeNatural language processing,Bioinformatics,Text mining,Information Extraction
LicenseApache License 2.0
WebsiteOfficial websiteEdit this at Wikidata

Apache cTAKES: clinical Text Analysis and Knowledge Extraction Systemis an open-sourceNatural Language Processing(NLP) system that extracts clinical information fromelectronic health recordunstructured text.It processes clinical notes, identifying types of clinical named entities — drugs, diseases/disorders, signs/symptoms, anatomical sites and procedures. Each named entity has attributes for the text span, the ontology mapping code, context (family history of, current, unrelated to patient), and negated/not negated.[1]

cTAKES was built using theUIMA Unstructured Information Management Architecture frameworkandOpenNLPnatural language processing toolkit.[2][3]

Components

[edit]

Components of cTAKES are specifically trained for the clinical domain, and create rich linguistic and semantic annotations that can be utilized by clinical decision support systems and clinical research.[4]

These components include:

  • Named Section identifier
  • Sentence boundary detector
  • Rule-based tokenizer
  • Formatted list identifier
  • Normalizer
  • Context dependent tokenizer
  • Part-of-speech tagger
  • Phrasal chunker
  • Dictionary lookup annotator
  • Context annotator
  • Negation detector
  • Uncertainty detector
  • Subject detector
  • Dependency parser
  • patient smoking status identifier
  • Drug mention annotator

History

[edit]

Development of cTAKES began at theMayo Clinicin 2006. The development team, led by Dr. Guergana Savova and Dr.Christopher Chute,included physicians, computer scientists and software engineers. After its deployment, cTAKES became an integral part of Mayo's clinical data management infrastructure, processing more than 80 million clinical notes.[5]

When Dr. Savova's moved toBoston Children's Hospitalin early 2010, the core development team grew to include members there. Further external collaborations include:[5]

Such collaborations have extended cTAKES' capabilities into other areas such as Temporal Reasoning, Clinical Question Answering, and coreference resolution for the clinical domain.[5]

In 2010, cTAKES was adopted by thei2b2program and is a central component of theSHARP Area 4.[5]

In 2013, cTAKES released their first release as anApache Software Foundationincubator project:cTAKES 3.0.[citation needed]

In March 2013, cTAKES became anApache Software FoundationTop Level Project (TLP).[5]

See also

[edit]

References

[edit]
  1. ^Denecke, Kerstin (2015-08-31)."Tools and Resources for Information Extraction".Health Web Science: Social Media Data for Healthcare.Springer. p.67.ISBN978-3-319-20582-3– via Google Books.
  2. ^Khalifa, Abdulrahman; Meystre, Stéphane (2015-12-01)."Adapting existing natural language processing resources for cardiovascular risk factors identification in clinical notes".Journal of Biomedical Informatics.Proceedings of the 2014 i2b2/UTHealth Shared-Tasks and Workshop on Challenges in Natural Language Processing for Clinical Data.58(Supplement): S128–S132.doi:10.1016/j.jbi.2015.08.002.PMC4983192.PMID26318122.
  3. ^Khudairi, Sally (2017-04-25)."The Apache Software Foundation Announces Apache® cTAKES™ v4.0"(Press release). Forest Hill, MD: The Apache Software Foundation. Globe Newswire.Retrieved2017-09-20.
  4. ^Savova, Guergana K; Masanz, James J; Ogren, Philip V; Zheng, Jiaping; Sohn, Sunghwan; Kipper-Schuler, Karin C; Chute, Christopher G (2010)."Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications".Journal of the American Medical Informatics Association.17(5): 507–513.doi:10.1136/jamia.2009.001560.ISSN1067-5027.PMC2995668.PMID20819853.
  5. ^abcde"History".Apache cTAKES™ - clinical Text Analysis Knowledge Extraction System.2015-06-22.Retrieved2018-01-11.
[edit]