Publications

旅行情報サイトのレビューを用いた抽象的な要求に対する根拠付き推薦文の生成 (要旨)
叶内晨, 根石将人, 林部祐太, 岡崎直観 (東京工業大学) – 言語処理学会 (NLP) 2020

UD Japanese GSD の再整備と固有表現情報付与 (要旨)
松田寛, 若狭絢 (NINJAL), 山下華代, 大村舞(NINJAL), 浅原正幸 (NINJAL) – 言語処理学会 (NLP) 2020

知識の整理のための根拠付き自然文間含意関係コーパスの構築 (要旨)
林部祐太 – 言語処理学会 (NLP) 2020

End-to-end Dialog Systems with Numerical Slot Filling (要旨)
史宏杰 – 言語処理学会 (NLP) 2020 

Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization (PDF)
Wataru Hirota, Yoshihiko Suhara, Behzad Golshan, Wang-Chiew Tan – AAAI 2020

GiNZAで始める日本語依存構造解析(発表資料)
松田 寛 – Universal Dependencies Symposium 2019 国立国語研究所

短単位品詞の用法曖昧性解決と依存関係ラベリングの同時学習 (要旨)
松田 寛, 大村 舞 (NINJAL), 浅原 正幸 (NINJAL) – 言語処理学会 (NLP) 2019

宿レビューからの肯定的事実と推薦対象の抽出 (要旨)
林部 祐太 – 言語処理学会 (NLP) 2019

対義語対の差分ベクトルを使用した評価極性辞書の拡張 (要旨)
川島 寛乃,  松田 寛 ,  毛利 研 – 言語処理学会 (NLP) 2019

Happiness Entailment: Automating Suggestions for Well-Being (PDF)
Sara Evensen, Yoshihiko Suhara, Alon Halevy, Wang-Chiew Tan, Saran Mumick – Affective Computing & Intelligent Interaction (ACII) 2019

Building a Hotel Concierge Bot: an industrial case study (PDF)
Behzad Golshan, George Mihaila, Chen Chen, Jonathan Engel, Alon Halevy, Yoshihiko Suhara, Wang-Chiew Tan, Michael Matuschek (TrustYou) – CAST 2019

Subjective Databases (PDF)
Yuliang Li, Aaron Feng, Jinfeng Li, Saran Mumick, Alon Halevy, Vivian Li, and Wang-Chiew Tan – VLDB 2019

Semantic Cross-lingual Sentence Embedding 
Wataru Hirota, Yoshihiko Suhara, Behzad Golshan, Wang-Chiew Tan – RepL4NLP@ACL 2019

Open Information Extraction from Question-Answer Pairs (PDF)
Nikita Bhutani, Yoshihiko Suhara, Wang-Chiew Tan, Alon Halevy, H. V. Jagadish – NAACL-HLT ’19

Voyageur: An Experiential Travel Search Engine (PDF)

Sara Evensen, Aaron Feng, Alon Halevy, Jinfeng Li, Vivian Li, Yuliang Li, Huining Liu, George Mihaila, John Morales, Natalie Nuno, Ekaterina Pavlovic, Wang-Chiew Tan, Xiaolan Wang – WWW 2019 (Demonstration)

FrameIt: Ontology Discovery for Noisy User-Generated Text (PDF)
Dan Iter, Alon Y. Halevy, Wang-Chiew Tan – NUT@EMNLP 2018: 173-183

Scalable Semantic Querying of Text (PDF) 
Xiaolan Wang, Aaron Feng, Behzad Golshan, Alon Y. Halevy, George A. Mihaila, Hidekazu Oiwa, Wang-Chiew Tan – PVLDB 11(9): 961-974 (2018)

 – Arxiv version

Koko: A System for Scalable Semantic Querying of Text (PDF)
Xiaolan Wang, Jiyu Komiya, Yoshihiko Suhara, Aaron Feng, Behzad Golshan, Alon Y. Halevy, Wang-Chiew Tan – PVLDB 11(12): 2018-2021 (2018) (Demonstration)

BigGorilla: An Open-Source Ecosystem for Data Preparation and Integration (PDF)
Chen Chen, Behzad Golshan, Alon Y. Halevy, Wang-Chiew Tan, AnHai Doan – IEEE Data Eng. Bull. 41(2): 10-22 (2018)

HappyDB: A Corpus of 100, 000 Crowdsourced Happy Moments (PDF)
Akari Asai, Sara Evensen, Behzad Golshan, Alon Y. Halevy, Vivian Li, Andrei Lopatenko, Daniela Stepanov, Yoshihiko Suhara, Wang-Chiew Tan, Yinzhan Xu – LREC 2018

– HappyDB corpus: 100,000 crowd-sourced happy moments

– HappyDB Kaggle task

Data Integration: After the Teenage Years (PDF)
Alon Halevy, Wang-Chiew Tan, George Mihaila, Behzad Golshan – SIGMOD/PODS Conference 2017

CoFE: A Collaborative Feature Engineering Framework for Data Science
Yoshihiko Suhara (MIT Media Lab), Hideki Awashima (Recruit Institute of Technology), Hidekazu Oiwa (Recruit Institute of Technology) and Alex Pentland (MIT Media Lab) – appeared in HCOMP 2016

Managing Google’s data lake: an overview of the Goods system (PDF)
Alon Halevy, Flip Korn, Natalya F. Noy, Christopher Olston, Neoklis Polyzotis, Sudip Roy, Steven Euijong Whang – IEEE Data Eng. Bull. 39(3) (work done at Google)

Goods: Organizing Google’s Datasets (PDF)

Alon Halevy, Flip Korn, Natalya F. Noy, Christopher Olston, Neoklis Polyzotis, Sudip Roy, Steven Euijong Whang – SIGMOD Conference 2016 (work done at Google).

Discovering Structure in the Universe of Attribute Names (PDF)
Alon Halevy, Flip Korn, Natalya F. Noy, Christopher Olston, Neoklis Polyzotis, Sudip Roy, Steven Euijong Whang – WWW 2016: 939-949 (work done at Google)