We use cookies to improve your experience with our site.
李振华, 刘刚, 嵇智源, . 基于腾讯大数据的高成本效益的云下载系统设计[J]. 计算机科学技术学报, 2015, 30(6): 1163-1174. DOI: 10.1007/s11390-015-1591-5
引用本文: 李振华, 刘刚, 嵇智源, . 基于腾讯大数据的高成本效益的云下载系统设计[J]. 计算机科学技术学报, 2015, 30(6): 1163-1174. DOI: 10.1007/s11390-015-1591-5
Zhen-Hua Li, Gang Liu, Zhi-Yuan Ji, Roger Zimmermann. Towards Cost-Effective Cloud Downloading with Tencent Big Data[J]. Journal of Computer Science and Technology, 2015, 30(6): 1163-1174. DOI: 10.1007/s11390-015-1591-5
Citation: Zhen-Hua Li, Gang Liu, Zhi-Yuan Ji, Roger Zimmermann. Towards Cost-Effective Cloud Downloading with Tencent Big Data[J]. Journal of Computer Science and Technology, 2015, 30(6): 1163-1174. DOI: 10.1007/s11390-015-1591-5

基于腾讯大数据的高成本效益的云下载系统设计

Towards Cost-Effective Cloud Downloading with Tencent Big Data

  • 摘要: 我们于2011年提出的“云下载”模式已经有效优化了数亿用户的下载体验。此外, 人们开始基于云下载构建各种各样的有用的互联网服务。简单地说, 通过使用云计算基础设施、代替用户从“仅提供最大努力服务”的互联网中下载(并缓存)文件, 云下载保证了数据可用性、并且显著提升了数据传输速度。虽然看上去简单、直接, 但设计一个真实世界的、服务大量用户的云下载系统涉及到复杂而微妙的(部署开销和用户体验之间的)权衡, 比如: (1)如何规划云缓存容量, 以获得较高并且可支付的缓存命中率, (2)如何加速云端到用户的数据传输, (3)如何处理用户对于热门文件的密集请求, (4)如何判断云端潜在的下载失败情形。本文基于腾讯QQ旋风离线下载系统的大数据、从实践的角度探索了这些下载权衡。系统运行日志帮助我们寻找到合理的设计权衡和参数, 而系统真实性能则验证了设计的功效。我们的研究为相似、相关系统的开发者提供了坚实的经验和有价值的启发。

     

    Abstract: The cloud downloading scheme, first proposed by us in 2011, has effectively optimized hundreds of millions of users' downloading experiences. Also, people start to build a variety of useful Internet services on top of cloud downloading. In brief, by using cloud facilities to download (and cache) the requested file from the “best-effort” Internet on behalf of the user, cloud downloading ensures the data availability and remarkably enhances the data delivery speed. Although this scheme seems simple and straightforward, designing a real-world cloud downloading system involves complicated and subtle trade-offs (between deployment cost and user experience) when serving a large number of users: 1) how to plan the cloud cache capacity to achieve a high and affordable cache hit ratio, 2) how to accelerate the data delivery from the cloud to numerous users, 3) how to handle the dense user requests for highly popular files, and 4) how to judge a potential downloading failure of the cloud. This paper addresses these design trade-offs from a practical perspective, based on big data from a nationwide commercial cloud downloading system, i.e., Tencent QQXuanfeng. Its running traces help us find reasonable design strategies and parameters, and its real performances confirm the efficacy of our design. Our study provides solid experiences and valuable heuristics for the developers of similar and relevant systems.

     

/

返回文章
返回