PACE Corpus: a Multilingual Corpus of Polarity-Annotated Textual Data from the Domains Automotive and Cellphone.

This work points out the challenges when analyzing polarity within a specific domain and when dealing with user-generated textual resources. Two comprehensively annotated corpora (English and German) constisting of user-generated data were made publicly available as gold standard data sets for experiments and evaluations.

C. Hänig, A. Niekler und C. Wünsch: PACE Corpus: a Multilingual Corpus of Polarity-Annotated Textual Data from the Domains Automotive and Cellphone. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), European Language Resources Association (ELRA), 2014

Resource Efficient Document Search

This patent application describes how we can efficiently search huge document collections in a resource constrained system like a mobile phone or tablet. Certainly, the applied techniques also improve the performance in server-based implementations. (2014)