14 posts / 0 new
最后一篇
Zwang308.
Offline
最后一次露面:4 years 4 months ago
掌握
加入:2014-07-02 14:15
连接稳定性问题

Hi Dialog,

I am developing a pair of BLE device, both server and slave, base on DA14580. After connected to each other, even I leave these processor to be idle(no BLE read/write/notify, no i2c, gpio... active ) but they still disconnect randomly. It looks like one of them corrupts in link layer, because after disconnect they could not reconnect again. Below are the parameters I set for this connection and some wireshark captures.

msg-> nb_peers = 1;
memcpy((void *) &msg->peers[0].addr, (void *)peer_addr, BD_ADDR_LEN);
msg->peers[0].addr_type = peer_addr_type;
msg->con_intv_max = 10;
msg-> con_intv_min = 10;
msg->ce_len_max = 0x20;
msg->ce_len_min = 0x20;
msg->con_latency = 0;
msg-> superv_to = 100;
msg-> scan_interval = 0x180;
msg-> scan_window = 0x160;
msg-> op.code = gapm_connection_direct;

wireshark capture:

9796 61.273854000 Master Slave BLE Data 26 Empty Data PDU
9797 61.275295000 Slave Master BLE Data 26 Empty Data PDU
9798 61.286603000 Master Slave BLE Data 26 Empty Data PDU
9799 61.287569000从站主机BLE数据26空数据PDU
9800 61.298855000主从电池数据26空数据PDU
9801 61.299789000 Slave Master BLE Data 26 Empty Data PDU
9802年62.320227000 ab_mat5 BLE ADV 48 ADV_IND
9803 62.321788000 ab_mat5 BLE ADV 48 ADV_IND

Server and client keep the connection for 60s and could not connect to each other anymore.

I could still see slave side advertising, but even use an Iphone with LightBlue application, I could not connect to the slave anymore.
Wireshark capture:
25 6.273182000 ab_mat4 BLE ADV 48 ADV_IND
26 6.274736000 50:0b:f8:18:ac:29 80:ea:ca:00:00:41 BLE ADV 60 CONNECT_REQ
27 6.276461000 Master Slave BLE Data 26 Empty Data PDU
28 6.303327000主从机BLE数据26空数据PDU
29 6.333342000 Master Slave BLE Data 26 Empty Data PDU
30 6.363329000主从机BLE数据26空数据PDU
31 6.393384000 Master Slave BLE Data 26 Empty Data PDU
32 6.423327000主从机BLE数据26空数据PDU
33 6.453367000主从机BLE数据26空数据PDU
34 6.483385000 Master Slave BLE Data 26 Empty Data PDU
35 6.513421000 Master Slave BLE Data 26 Empty Data PDU
36 6.543378000 Master Slave BLE Data 26 Empty Data PDU
37 6.573448000 Master Slave BLE Data 26 Empty Data PDU
38 6.603361000 Master Slave BLE Data 26 Empty Data PDU
39 6.633361000 Master Slave BLE Data 26 Empty Data PDU
40 7.640482000 AB_MAT4 BLE ADV 48 ADV_IND
41 7.642505000 ab_mat4 BLE ADV 48 ADV_IND

In this test, I set the connection interval to be 12.5 ms. In this case, sever and client will disconnect randomly after 60 s to 45 mins they connect.

在后一种测试中,我将连接间隔更改为30ms。他们可以保持连接更长(超过10小时),但仍然脱离了。

I would assume if I leave the processor in idle state, everything should handle by BLE stack automatically and sending empty to keep the connection should have nothing to do with my application code. Also, I not have any application activity at the time they disconnect.

请帮我找到断开连接和如何提高连接稳定性的原因。

非常感谢。

设备:
Zwang308.
Offline
最后一次露面:4 years 4 months ago
掌握
加入:2014-07-02 14:15
Some update,

Some update,

From my overnight test, it looks like the ble timer is not working appropriately.
首先,它将OKEY工作约3小时(可能更多)。我们可以看到主机将空的PDU发送到从每30ms设置为从Alave作为我设置的值。
726216 10926.699285000主从池数据26空数据PDU
726217 10926.699861000从站主板数据26空数据PDU
726218 10926.729275000主从机BLE数据26空数据PDU
726219 10926.729815000 Slave Master BLE Data 26 Empty Data PDU
726220 10926.759274000 Master Slave BLE Data 26 Empty Data PDU
726221 10926.760253000 Slave Master BLE Data 26 Empty Data PDU
726222 10926.789250000 Master Slave BLE Data 26 Empty Data PDU
726223 10926.790219000从机主BLE数据26空数据PDU
726224 10926.818524000 Master Slave BLE Data 26 Empty Data PDU
我猜嗅探器丢失了一个捕获。
726225 10926.848719000 Master Slave BLE Data 26 Empty Data PDU
726226 10926.850223000从机主BLE数据26空数据PDU
出于某种原因,我丢失了大约10个小时的捕获。幸运的是,就在他们断开之前,嗅探器再次获得信号。但是此时,主侧似乎表现出疯狂,它每次约2ms向客户端发送一个空的PDU,或者一旦收到来自从机的空的PDU,就会向客户端发送。
726227 47816.865216000主从机BLE数据26空数据PDU
726228 47816.866244000从站主板数据26空数据PDU
726229 47816.867143000 Master Slave BLE Data 26 Empty Data PDU
726230 47816.868015000 Slave Master BLE Data 26 Empty Data PDU
726231 47816.868887000 Master Slave BLE Data 26 Empty Data PDU
726232 47816.869742000 Slave Master BLE Data 26 Empty Data PDU
726233 47816.8706106070主从电池数据26空数据PDU
726234 47816.871492000 Slave Master BLE Data 26 Empty Data PDU
.......
726560 47817.196839000主从机BLE数据26空数据PDU
726561 47817197000从机主BLE数据26空数据PDU
726562 47817.19878000主从机BLE数据26空数据PDU
726563 47817.199657000 Slave Master BLE Data 26 Empty Data PDU
726564年47817.201497000 ab_mat5 BLE副词48 ADV_IND
726565 47817.202433000 AB_MAT5 BLE ADV 48 ADV_IND

I hope these information give you guys more sense about this problem, thanks a lot.

谢谢!

mt_dialog.
Offline
最后一次露面:2 months 1 week ago
职员
加入:2015-06-08 11:34
Hi zwang308,

Hi zwang308,

Which SDK you are using in your current implementation and what is the development kit you have ? As far as i can tell the connection parameters you are using are the almost same with the DSPS project. Your assumptions are correct, when you are connected with a device the stack will keep the connection alive by exchanging empty packets without demanding from the application to do anything.

从第一个日志中,我可以看到您的主站停止发送空数据包,以便保持连接,因此对从侧没有响应,发生超时,并且当您应该检查主机时,连接丢失。

从第2个日志似乎从主机响应从主机的连接请求,检查如何在外围代码中处理连接请求,并检查设备是否达到连接确认。

About the 3rd log, this seems very odd, i cant imagine why the central would start sending empty packets in such a high rate especially if has no data to send, the central itself or the device.

谢谢mt_dialog.

Zwang308.
Offline
最后一次露面:4 years 4 months ago
掌握
加入:2014-07-02 14:15
我正在使用SDK 3.0.10.1。

我正在使用SDK 3.0.10.1。

问题是连接随机打破。

For 12ms connection interval, sometimes the connection goes well for more than an hour; but sometimes they just disconnect after 2 minutes.

After I change the connection interval to 30ms, the average connected time gets much longer. But they still disconnect randomly,

当他们断开连接时,实际上没有应用程序活动。我假设BLE堆栈将处理连接,在此期间自动发送/ ACK空PDU。
Also, even they disconnected, they should be able to reconnect. But once they disconnect, I could not connect to the slave anymore, no matter I use my master application or use some standard device e.g. IPhone.

To me, it looks like part of slave's BLE stack is screwed up: slave could receive the connection request from master(it stops advertising) but it could not send an ack. So, the both side just disconnect after 1s. Afert that, slave begin to advertise again.

These are everything I got from my test.

非常感谢您的帮助!

Zwang308.
Offline
最后一次露面:4 years 4 months ago
掌握
加入:2014-07-02 14:15
By the way, I use the same

By the way, I use the same code base for all tests. I am sure the slave side send the connect request confirmation, otherwise they would not connect. Again, the problem is the connect-----keep the connection for a random time---disconnect and never able to reconnect anymore.

Thanks,

mt_dialog.
Offline
最后一次露面:2 months 1 week ago
职员
加入:2015-06-08 11:34
Hi zwang308,

Hi zwang308,

I VE在SDK 3.0.10.1上实现了一个中心和外围设备(包括配置文件)I VE将连接参数设置为与您的主机相同的主机,其中连接间隔等于Pro和基本套件之间的12.5ms。到目前为止,一切似乎都是正确的,并且只要我能观看(超过两个小时),即可持续的连接也能够重新连接到主机,(尚未扩展睡眠模式)。您是否实施了任何配置文件或任何其他功能,以便尝试和重现您的问题?

谢谢mt_dialog.

Zwang308.
Offline
最后一次露面:4 years 4 months ago
掌握
加入:2014-07-02 14:15
嗨mt_dialog,

嗨mt_dialog,

What I did is pretty much the same as your DSPS project.

I develop an I2C driver and put data into nvram when I receive a package. In app_asynch_trim function, I pull the buffer and send data through BLE.

But in my test, I disable the I2C driver and the pull function in app_asynch_trim. In the test, the two DA14580s are really doing nothing when they disconnect.

From my observation, they keep sending empty PDU for a while and suddenly the master stop to send empty PDU. Do we have method to know why master stop sending empty PDU? Could we get a message from the kernel to know why they disconnect.

To me, it seems like they disconnect randomly. We did lots of test last week, sometimes the connection is good for more than 16 hour; sometimes they disconnect after 2 minutes.

It really confused me.

非常感谢!

mt_dialog.
Offline
最后一次露面:2 months 1 week ago
职员
加入:2015-06-08 11:34
Hi zwang308,

Hi zwang308,

Can you run your master form keil and check when this issue occurs, what is the state of your master ? Also are your handlers indicate that there is a disconnection either by the master or at least by your slave when he stops receiving data by your host. You can place a break point to your device and check what you get as a disconnection reason. There is no way to check why the master doesn't send any data, an error indication or something like that since this should not happen the stack should always keep the connection.

谢谢mt_dialog.

Zwang308.
Offline
最后一次露面:4 years 4 months ago
掌握
加入:2014-07-02 14:15
Hi MT_Dialog,

Hi MT_Dialog,

For the first time they disconnect. The error code on peripheral is 0x13(REMOTE USER TERMINATED CONNECTION) while the error code on central side is 0x08(CONNECTION TIMEOUT).

之后,外设侧的错误代码始终为0x08,中心侧为0x3e(连接无法建立)。

我认为代码实际上停止在某种程度上,因为这两个方面都在广告/扫描和连接状态之间保持切换。

Do we have any idea why that happens?

Do we have a method to soft reset the chip(run firmware from the beginning without reload the firmware or just reset BLE stack)?

非常感谢。

mt_dialog.
Offline
最后一次露面:2 months 1 week ago
职员
加入:2015-06-08 11:34
Hi zwang308,

Hi zwang308,

We haven't observed anything similar in tests we performed (i.e. the central to just stop sending packets) although we managed to track a cornercase where the peripheral isn't able to connect under specific circumstances, the patch for the specific cornercase isn't yet released. For this kind of situations we propose either to perform a sw reset to the 580 or reset the stack.

您可以尝试使用sys_ctrl_reg的sw_reset字段发出重置,并将启动地址设置为sysram(请检查数据表)虽然它不是一个好主意,并优先从原始图像重新启动(下载代码580使用引导加载程序)也可以尝试发送GapM_Reset命令以重置堆栈。

Also is it possible to share some code (central and device) in order to have a look since we couldn't replicate the problem on our side?

谢谢mt_dialog.

Zwang308.
Offline
最后一次露面:4 years 4 months ago
掌握
加入:2014-07-02 14:15
HI MT_dialog,

HI MT_dialog,

您能否向我提供发送代码的电子邮件地址?你可以通过电子邮件亲自到达我。

非常感谢,

mt_dialog.
Offline
最后一次露面:2 months 1 week ago
职员
加入:2015-06-08 11:34
Hi zwang308,

Hi zwang308,

Problem was due to an obsolete SDK.

谢谢mt_dialog.

孟森
Offline
最后一次露面:3年3天前
加入:2018-01-16 08:09
Hi, MT_dialog.

Hi, MT_dialog.
I am suffering the same error as Mr zwang308's. I am using the sdk version V3.0.8. You said it was due to an obsolete SDK, please tell me which sdk is ok. Thanks.

mt_dialog.
Offline
最后一次露面:2 months 1 week ago
职员
加入:2015-06-08 11:34
嗨menson,

嗨menson,

The latest SDK is the 5.0.4 for the 580 device.

谢谢mt_dialog.