|
@ -2,72 +2,75 @@ |
|
|
|
|
|
|
|
|
```mermaid |
|
|
```mermaid |
|
|
graph LR |
|
|
graph LR |
|
|
subgraph crawler |
|
|
|
|
|
source_1([108shu.com]) |
|
|
subgraph crawler |
|
|
source_2([aidusk.com]) |
|
|
source_1([108shu.com]) |
|
|
source_3([ixsw.la]) |
|
|
source_2([aidusk.com]) |
|
|
source_4([m.wxsy.net]) |
|
|
source_3([ixsw.la]) |
|
|
source_5([wxsy.net]) |
|
|
source_4([m.wxsy.net]) |
|
|
source_6([xswang.com]) |
|
|
source_5([wxsy.net]) |
|
|
source_7([zhihu.com]) |
|
|
source_6([xswang.com]) |
|
|
|
|
|
source_7([zhihu.com]) |
|
|
|
|
|
end |
|
|
|
|
|
|
|
|
|
|
|
subgraph sample |
|
|
|
|
|
subgraph raw |
|
|
|
|
|
s1a_raw{{sample_1-a}} |
|
|
|
|
|
s1b_raw{{sample_1-b}} |
|
|
|
|
|
s2a_raw{{sample_2-a}} |
|
|
|
|
|
s2b_raw{{sample_2-b}} |
|
|
|
|
|
s3_raw{{sample_3}} |
|
|
end |
|
|
end |
|
|
|
|
|
|
|
|
subgraph sample |
|
|
subgraph combine |
|
|
subgraph raw |
|
|
s1_combine[sample_1] |
|
|
s1a_raw{{sample_1-a}} |
|
|
s2_combine[sample_2] |
|
|
s1b_raw{{sample_1-b}} |
|
|
s3_combine[sample_3] |
|
|
s2a_raw{{sample_2-a}} |
|
|
|
|
|
s2b_raw{{sample_2-b}} |
|
|
|
|
|
s3_raw{{sample_3}} |
|
|
|
|
|
end |
|
|
|
|
|
|
|
|
|
|
|
subgraph combine |
|
|
|
|
|
s1_combine[sample_1] |
|
|
|
|
|
s2_combine[sample_2] |
|
|
|
|
|
s3_combine[sample_3] |
|
|
|
|
|
end |
|
|
|
|
|
|
|
|
|
|
|
subgraph fixed |
|
|
|
|
|
s1_fixed(sample_1) |
|
|
|
|
|
s2_fixed(sample_2) |
|
|
|
|
|
s3_fixed(sample_3) |
|
|
|
|
|
end |
|
|
|
|
|
|
|
|
|
|
|
subgraph replenish |
|
|
|
|
|
sa{{sample_a}} |
|
|
|
|
|
sb{{sample_b}} |
|
|
|
|
|
end |
|
|
|
|
|
|
|
|
|
|
|
source_1 ==> s1a_raw |
|
|
|
|
|
source_2 ==> s1b_raw |
|
|
|
|
|
source_3 ==> s1b_raw |
|
|
|
|
|
source_4 ==> s2a_raw |
|
|
|
|
|
source_5 ==> s2a_raw |
|
|
|
|
|
source_6 ==> s2b_raw |
|
|
|
|
|
source_7 ==> s3_raw |
|
|
|
|
|
|
|
|
|
|
|
s1a_raw -- replenish --> s1_combine |
|
|
|
|
|
s1b_raw --> s1_combine |
|
|
|
|
|
s2a_raw -- replenish --> s2_combine |
|
|
|
|
|
s2b_raw -- replenish --> s2_combine |
|
|
|
|
|
s3_raw -- clean up --> s3_combine |
|
|
|
|
|
|
|
|
|
|
|
s1_combine -- fix --> s1_fixed |
|
|
|
|
|
s2_combine -- fix --> s2_fixed |
|
|
|
|
|
s3_combine -- fix --> s3_fixed |
|
|
|
|
|
|
|
|
|
|
|
s1_fixed --> sa |
|
|
|
|
|
s2_fixed -- replenish --> sa |
|
|
|
|
|
s2_fixed -. restore .-> sb |
|
|
|
|
|
s3_fixed -- replenish --> sb |
|
|
|
|
|
end |
|
|
end |
|
|
|
|
|
|
|
|
subgraph release |
|
|
subgraph fixed |
|
|
rc-1([rc-1]) |
|
|
s1_fixed(sample_1) |
|
|
|
|
|
s2_fixed(sample_2) |
|
|
|
|
|
s3_fixed(sample_3) |
|
|
|
|
|
end |
|
|
|
|
|
|
|
|
sa --> rc-1 |
|
|
subgraph replenish |
|
|
sb -- fix --> rc-1 |
|
|
sa{{sample_a}} |
|
|
|
|
|
sb{{sample_b}} |
|
|
end |
|
|
end |
|
|
|
|
|
|
|
|
|
|
|
source_1 ==> s1a_raw |
|
|
|
|
|
source_2 ==> s1b_raw |
|
|
|
|
|
source_3 ==> s1b_raw |
|
|
|
|
|
source_4 ==> s2a_raw |
|
|
|
|
|
source_5 ==> s2a_raw |
|
|
|
|
|
source_6 ==> s2b_raw |
|
|
|
|
|
source_7 ==> s3_raw |
|
|
|
|
|
|
|
|
|
|
|
s1a_raw -- replenish --> s1_combine |
|
|
|
|
|
s1b_raw --> s1_combine |
|
|
|
|
|
s2a_raw -- replenish --> s2_combine |
|
|
|
|
|
s2b_raw -- replenish --> s2_combine |
|
|
|
|
|
s3_raw -- clean up --> s3_combine |
|
|
|
|
|
|
|
|
|
|
|
s1_combine -- fix --> s1_fixed |
|
|
|
|
|
s2_combine -- fix --> s2_fixed |
|
|
|
|
|
s3_combine -- fix --> s3_fixed |
|
|
|
|
|
|
|
|
|
|
|
s1_fixed --> sa |
|
|
|
|
|
s2_fixed -- replenish --> sa |
|
|
|
|
|
s2_fixed -. restore .-> sb |
|
|
|
|
|
s3_fixed -- replenish --> sb |
|
|
|
|
|
end |
|
|
|
|
|
|
|
|
|
|
|
subgraph release |
|
|
|
|
|
rc-1([RC-1]) |
|
|
|
|
|
rc-n([RC-2/3/...]) |
|
|
|
|
|
sa --> rc-1 |
|
|
|
|
|
sb -- fix --> rc-1 |
|
|
|
|
|
rc-1 ==> rc-n |
|
|
|
|
|
end |
|
|
|
|
|
|
|
|
``` |
|
|
``` |
|
|
|
|
|
|
|
|
## 数据来源 |
|
|
## 数据来源 |
|
@ -117,3 +120,5 @@ |
|
|
+ `RC-1` :初始合并版本 |
|
|
+ `RC-1` :初始合并版本 |
|
|
|
|
|
|
|
|
+ `RC-2` :修复部分屏蔽词与语法错误 |
|
|
+ `RC-2` :修复部分屏蔽词与语法错误 |
|
|
|
|
|
|
|
|
|
|
|
+ `RC-3` :修复部分繁体字错误 |
|
|