We use cookies to improve your experience with our site.

Harmonia:数据中心中使ECN和信用预约流量收敛的拥塞控制

Harmonia: Explicit Congestion Notification and Credit-Reservation Transport Converged Congestion Control in Datacenters

  • 摘要: 在数据中心网络(DCNs)中,突发流量和并发流量会不可避免地造成网络拥塞,进而影响整体性能。为了缓解网络拥塞,各种传输协议被开发出来,包括反应型协议和主动型协议。反应型协议使用不同的拥塞信号,如显式拥塞通知(ECN)和往返时间(RTT),来在网络拥塞发生后处理拥塞。然而,随着数据中心规模和链路速度的增长,反应型协议面临着对拥塞响应缓慢的严重问题。相反,主动型协议(如信用预约协议)设计的出发点就是为了在拥塞发生之前就规避拥塞,它们具有零丢包、收敛速度快、缓冲区占用率低等优点。但是信用预约协议在目前的DCNs (如微软、亚马逊)中并没有广泛的部署,它们主要部署的是基于ECN的协议,如DCTCP、DCQCN。在实际的部署场景中,很难保证一种协议一次性就部署到全部服务器上。因此在信用预约协议被逐步部署到DCNs里时,网络会进入一种多协议共存的状态,并将面临以下基本挑战:(i)带宽分配不公平,(ii)缓冲区占用率高,(iii)高尾延迟。因此,我们提出了Harmonia,旨在以最少的修改代价将基于ECN的协议和信用预留协议收敛到公平。据我们所知,Harmonia是第一个解决与被动拥塞控制共存时的协调性问题的主动型协议。针对那些使用广泛的基于ECN的协议,DCTCP和DCQCN,Harmonia利用ECN和RTT来提供实时拥塞信息,并重新定义了拥塞控制机制。实验结果表明,该算法有效地解决了链路分配不公平的问题,消除了超时现象并解决了缓冲区溢出的问题。

     

    Abstract: Bursty traffic and thousands of concurrent flows incur inevitable network congestion in datacenter networks (DCNs) and then affect the overall performance. Various transport protocols are developed to mitigate the network congestion, including reactive and proactive protocols. Reactive schemes use different congestion signals, such as explicit congestion notification (ECN) and round trip time (RTT), to handle the network congestion after congestion arises. However, with the growth of scale and link speed in datacenters, reactive schemes encounter a significant problem of slow responding to congestion. On the contrary, proactive protocols (e.g., credit-reservation protocols) are designed to avoid congestion before it occurs, and they have the advantages of zero data loss, fast convergence and low buffer occupancy. But credit-reservation protocols have not been widely deployed in current DCNs (e.g., Microsoft, Amazon), which mainly deploy ECN-based protocols, such as data center transport control protocol (DCTCP) and data center quantized congestion notification (DCQCN). And in an actual deployment scenario, it is hard to guarantee one protocol to be deployed in every server at one time. When credit-reservation protocol is deployed to DCNs step by step, the network will be converted to multi-protocol state and will face the following fundamental challenges:1) unfairness, 2) high buffer occupancy, and 3) heavy tail latency. Therefore, we propose Harmonia, aiming for converging ECN-based and credit-reservation protocols to fairness with minimal modification. To the best of our knowledge, Harmonia is the first to address the trouble of harmonizing proactive and reactive congestion control. Targeting the common ECN-based protocols-DCTCP and DCQCN, Harmonia leverages forward ECN and RTT to deliver real-time congestion information and redefines feedback control. After the evaluation, the results show that Harmonia effectively solves the unfair link allocation, eliminating the timeouts and addressing the buffer overflow.

     

/

返回文章
返回