IT討論區(57) 大佬呀又唔開post

1001 回覆
2 Like 0 Dislike
2019-02-13 16:46:19
2019-02-13 17:06:22
2019-02-13 17:07:30
2019-02-13 17:09:36
2019-02-13 17:40:15
IT狗思考方法
2019-02-13 17:42:10
一係BA/PM

一係 network

一係做看更
2019-02-13 17:48:24
做看更啦
反正做IT到中年都係做看更
2019-02-13 18:05:18
2019-02-13 18:05:47
打坐好工
2019-02-13 18:11:36
打code 打多幾年都係做看更
不如早啲做看更拎經驗
之後升做高級看更搵多啲
2019-02-13 18:14:04
IFC樓下個D看更要識好多language
2019-02-13 18:47:57
係咪尖吵咀個間
2019-02-13 19:26:20
召喚強手

假設有10000句phase, eg "coffee workshop", "cat owner meeting" 要搵相似既phase做cluster

Step 1: English normalisation, 將meeting, met 變做meet. ..

Step2: for each unique normalized word, get a dictionary of occurrence. "meet":6%, "coffee" 0.5%, workshop 5%....

‌Step3: build a co-exist-probability matrix for the phases.
"Cat ownership talk" vs "cat owner meeting" = prob of "cat" * prob of "own"
..........

Step4: build a graph, each phase is a node, if the co-exist-probability of two node is smaller than x%, build an edge between the two node.

Step 5: use graph algorithm to find all cluster.


會唔會做多左,定係已經有library做緊呢D
2019-02-13 19:31:05
可能已經有人做左🤔你入返你d training data 就okay
搵下有冇open source
2019-02-13 19:51:15
我堆text 好多specific noun , 驚用library太loose

有冇人知build 個matrix 係咪一定要 o(n^2)
2019-02-13 20:09:56
睇你能力
2019-02-13 20:13:41
想做Quant Dev搵真銀

求教路
2019-02-13 20:24:13
buzz word, 入假quant扮真quant
2019-02-13 20:25:34
你同手黏好似就係Quant Dev / Data Engine
吹水台自選台熱 門最 新手機台時事台政事台World體育台娛樂台動漫台Apps台遊戲台影視台講故台健康台感情台家庭台潮流台美容台上班台財經台房屋台飲食台旅遊台學術台校園台汽車台音樂台創意台硬件台電器台攝影台玩具台寵物台軟件台活動台電訊台直播台站務台黑 洞