多源异构的网络感知信息的元数据组织方法研究

摘要
随着网络感知能力的增强,网络中产生了海量的、多源异构的网络感知信息,各种感知信息将给网络创新和应用创新带来强大的推动力,如何高效地组织管理已获取的感知信息,以便提高感知信息的有效利用成了必须要解决的问题。
数据虚拟化针对多源异构的数据集,基于元数据实现对数据的统一访问与集成,因此也适用于多源异构的网络感知信息的集成管理。目前数据虚拟化采用传统的树型结构对元数据组织,未考虑数据源之间同类网络感知信息的元数据关系。随着网络感知信息的不断增长,元数据树的高度也不断增长,查询效率会明显将低。本文在数据虚拟化基础上研究一种元数据分簇组织方法。论文主要研究工作及贡献如下:首先,针对网络感知信息的多源异构性问题,研究设计了感知信息的统一描述方式。用XML语言设计网络感知信息的标签格式与结构,标签<information></ information>之间表示一条感知信息,共包括感知信息的类别名称、来源、存储位置、具体描述及相关参数五个部分。
其次,针对数据虚拟化系统中元数据组织所采用的树结构存在的查询效率低的问题,考虑到对同类数据的元数据可以进行统一的组织,研究提出了网络感知信息的元数据分簇组织方法,能够有效缩短树结构的高度,提高查询效率。在元数据分簇组织结构中,每个簇对应一个网络感知信息类。为了有效获取源
数据集中的分类簇及数目,本文采用经典的层次聚类算法对不同源中不同类的感知信息经过封装形成的XML文档进行聚类,然后将聚类生成的簇上传到数据虚拟化系统,利用系统自动提取数据源的元数据的优势,实现相应网络感知信息类对应元数据的分簇组织。
大容量数据存储最后,针对从WAP网关获取的移动互联网感知信息数据源,设计构建了一个数据虚拟化原型系统,能够实现对多源异构数据的统一访问和集成。另外,重点对本文提出的网络感知信息的元数据分簇组织方法进行性能测试,与数据虚拟化系统目前采用的传统的树型结构进行对比,测试结果表明本文提出的方法在保证查全率的同时,查准率平均提高了10.3%,查询效率也有了一定的提高。
关键词:网络感知信息,数据虚拟化,元数据组织,信息查询
Abstract
With the increasing use of network awareness,there is a massive and multi-source heterogeneous network-aware information,the perceived data resources and decentralized data sources are also more and more,they will bring a strong impetus to network innovation and application innovation,How to efficiently organize and manage the acquirednetwork-aware information to improve the effective use has become the problems to be solved.
For multi-source and heterogeneous data sets,data virtualization has achieved a unified data accessand integration based on metadata,therefore,It is also applicable to the integrated management of multi-source heterogeneous network-aware information.At present,Data virtualization system organizes the metadata using the traditional tree structure,it does not consider the relationships of metadata among different data sources.With the continuous increasing of network-aware information,the height of the metadata tree will continue to grow,and the query efficiency will be getting lower obviously.Based on data virtualization,this paper studies a metadata clustering organization method.The main work and contribution of this paper are listed below: Firstly,aiming to solve the problem of heterogeneity from multi-source network-aware information,a unified description method of network-aware information is studied and designed.The label format and structure of network-aware information is designed by the XML language,and the label<information></information>represents a network-aware information,which includes the five parts of its category names,sources, storage locations,specific descriptions and related parameters.
Secondly,the query efficiency of the tree structure used for the metadata organization in the current data virtualization system is low,considering that the metadata belonging to same categorycan be organized uniformly,a method of metadata clustering organization of network-aware information is pr
oposed,which can effectively shorten the height of tree structure and improve the query efficiency.In the metadata clustering organization,each cluster corresponds to a network-aware information class.In order to effectively obtain the cluster and its number of the data sources,In this paper,the classical hierarchical clustering algorithm is used to cluster the XML documents of different types of network-aware informationfrom different sources.Then,the clustering results are
uploaded to the data virtualization system whichcan automatically extract the metadata of the data source to realize the clustering organization of the metadata based on network-aware information class.
Finally,for the mobile Internet-aware information obtained from the WAP gateway, a data virtualization prototype system is designed and built,and it could achieve the unified access and integration of multi-source and heterogeneous data.In addition,the performance test of the metadata clustering organization method of network-aware information proposed in this paper is carried out emphatically and compared with the traditional tree structure currently used in the data virtualization system.The results show that the method proposed in this paper has an average rate of precision inc
reased by 10.3%and also has a certain improvement in the efficiency of the query while ensures the recall ratio of the query.
Keywords:network-aware information,data virtualization,metadata organiza tion,information query
目录
图录.................................................................................................................................VI 表录..............................................................................................................................VIII 第1章绪论 (1)
1.1选题背景及研究意义 (1)
1.2研究现状 (3)
1.2.1网络感知信息的研究现状 (3)
1.2.2元数据组织方法的研究现状 (3)
1.3论文主要工作 (5)
1.4论文的组织结构 (6)
第2章网络感知信息及数据虚拟化系统分析 (8)
2.1网络感知信息分析 (8)
2.2数据虚拟化系统概述及架构分析 (9)
2.2.1数据虚拟化系统概述 (9)
2.2.2数据虚拟化系统架构分析 (10)
2.3元数据在数据虚拟化系统中的作用及对查询效率的影响 (12)
2.3.1元数据在数据虚拟化系统中的作用 (12)
2.3.2元数据组织对查询效率的影响 (13)
2.4树型元数据组织方法存在的问题 (15)
2.5总体解决思路 (17)
2.6本章小结 (17)
第3章网络感知信息的统一描述 (19)
3.1问题分析 (19)
3.2解决思路 (20)
3.3相关技术分析 (21)
3.4网络感知信息统一描述方式的设计 (22)
3.5本章小结 (23)
第4章网络感知信息的元数据分簇组织方法 (24)
4.1问题分析 (24)
天津百货大楼肺炎4.2解决思路 (24)
4.3网络感知信息的XML文档聚类 (25)
4.3.1基于层次的XML文档聚类算法 (25)
4.3.2聚类的设计与实现 (27)
4.3.3聚类算法复杂度分析 (30)
4.4元数据分簇组织方法 (30)
家园通信息平台4.5元数据分簇组织方法的分析 (32)
4.6本章小结 (33)
第5章实验测试与结果分析 (34)
5.1测试工具及环境分析 (34)
5.1.1Eclipse与MySQL介绍 (34)
5.1.2Teiid分析 (35)
5.1.3Jboss eap分析 (35)
5.1.4Modeshape分析 (36)
5.2测试思路 (37)
1999年虎门大桥事故
5.2.1整体思路 (37)
5.2.2性能测试指标分析 (37)
5.3测试及结果分析 (38)
5.3.1测试数据分析 (39)
5.3.2测试数据处理 (40)
5.3.3功能测试 (42)
5.3.4性能测试 (48)
5.4本章小结 (54)
协查通报格式第6章总结与未来的工作 (55)
6.1论文工作总结 (55)
6.2未来的工作 (56)
参考文献 (58)
致谢 (62)
立体裁剪攻读硕士学位期间从事的科研工作及取得的研究成果 (63)

本文发布于:2024-09-20 23:36:47,感谢您对本站的认可!

本文链接:https://www.17tex.com/xueshu/12510.html

版权声明:本站内容均来自互联网,仅供演示用,请勿用于商业和其他非法用途。如果侵犯了您的权益请与我们联系,我们将在24小时内删除。

标签:数据   信息   感知   网络
留言与评论(共有 0 条评论)
   
验证码:
Copyright ©2019-2024 Comsenz Inc.Powered by © 易纺专利技术学习网 豫ICP备2022007602号 豫公网安备41160202000603 站长QQ:729038198 关于我们 投诉建议