supported by the National Natural Science Foundation of China(No.61001178,No.61172053,No.61202266);National Soft Science Research Program(No.2010GXQ5D317);Beijing Natural Science Foundation(No.4102012,No.4112009);Scientific Research Common Program of Beijing Municipal Commission of Education(No.KM201210005024);the National High Technology Research and Development Program of China(863 Program)(No.2012AA011706)
With the rapid development of information technology, short texts arising from socialized human interaction are gradually predominant in network information streams. Accelerating demands are requiring the industry to ...
Project(60763001) supported by the National Natural Science Foundation of China;Project(2010GZS0072) supported by the Natural Science Foundation of Jiangxi Province,China;Project(GJJ12271) supported by the Science and Technology Foundation of Provincial Education Department of Jiangxi Province,China
Category-based statistic language model is an important method to solve the problem of sparse data.But there are two bottlenecks:1) The problem of word clustering.It is hard to find a suitable clustering method with g...