An Algebraic Approach for High-level Text Analytics

Xiuwen Zheng, Amarnath Gupta

Text analytical tasks like word embedding, phrase mining, and topic modeling, are placing increasing demands as well as challenges to existing database management systems. In this paper, we provide a novel algebraic approach based on associative arrays. Our data model and algebra can bring together relational operators and text operators, which enables interesting optimization opportunities for hybrid data sources that have both relational and textual data. We demonstrate its expressive power in text analytics using several real-world tasks.

Knowledge Graph

arrow_drop_up

Comments

Sign up or login to leave a comment