检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:巢成 蒲非凡 许建秋[1] 高云君[2] Chao Cheng;Pu Feifan;Xu Jianqiu;Gao Yunjun(College of Computer Science and Technology/College of Artificial Intelligence/College of Software,Nanjing University of Aeronautics and Astronautics,Nanjing 211100;College of Computer Science and Technology,Zhejiang University,Hangzhou 310058)
机构地区:[1]南京航空航天大学计算机科学与技术学院/人工智能学院/软件学院,南京211100 [2]浙江大学计算机科学与技术学院,杭州310058
出 处:《计算机研究与发展》2024年第7期1771-1790,共20页Journal of Computer Research and Development
基 金:国家自然科学基金项目(U23A20296)。
摘 要:由于新型信息技术的快速发展,社会处于数字化、信息化转型的关键时期,各行业对于以数据库技术为基础的信息系统的需求也日益凸显.基于位置的服务依赖于海量实时生成的轨迹数据,在处理亿万级随时间连续变化的轨迹数据时,降维算法和查询技术一直是研究的关键,通过降低轨迹数据的规模,减少查询操作时处理数据的时间,能有效提升查询的性能,而能否实现高质量、高效率查询对于数据库而言至关重要.提出了面向轨迹数据的均匀网格编码,并在进一步优化后提出非均匀网格降维算法,将轨迹数据的坐标转化为1维字符串存储,对不符合要求的网格进行合并处理;通过空间位置映射充分保留轨迹数据间复杂的相互关系,并采用范围查询与最近邻查询对降维后的数据进行性能测试.实验使用不同城市真实轨迹数据与模拟生成轨迹数据作为数据集,将提出的均匀网格算法、非均匀网格算法与3种基准方法进行对比.实验证明,优化后的非均匀网格算法降维后数据的空间位置关系相似度可高达82.50%,范围查询时间较其他查询时间提升了至少73.86%,最近邻查询时间提升了至少52.26%,与其他基准方法相比取得了更好的效果.Due to the rapid development of information technology,society is in a critical period of digitalization and information transformation,and the demand for information systems based on database technology in various industries is becoming increasingly prominent.Location-based services rely on massive real-time generated trajectory data.In the processing of hundreds of millions of continuously changing trajectory data,dimensionality reduction algorithm and query technology have been the key to research.By reducing the scale of trajectory data and reducing the time of data processing during query operations,the performance of query can be effectively improved,and whether high-quality and efficient query can be achieved is very important for the database.In this paper,a UGC(uniform grid code)and a NGDR(non-uniform grid dimensionality reduction algorithm)for trajectory data are proposed,which convert the coordinates of trajectory data into one-dimensional string storage,merge the grids that do not meet the requirements,fully retain the complex interrelationship between trajectory data through spatial position mapping,and use range query and nearest neighbor query to test the performance of the reduced data.The real trajectory and virtual generated trajectory data in different cities are used as datasets,and the uniform grid code algorithm,non-uniform grid algorithm proposed in this paper are compared with three benchmark methods.Experiments show that the spatial position relationship similarity of the data after NGDR can be up to 82.5%.The range query time of NGDR is improved at least by 73.86%compared with the other queries,and the nearest neighbour query time is improved at least by 52.26%,which achieves better results than other benchmark methods.
关 键 词:轨迹数据 降维算法 非均匀网格 空间位置关系 查询技术
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.49