[JAVA高手入] 幾億個item要點先fit到入2gb heap space?

213 回覆
5 Like 26 Dislike
2022-08-11 22:45:37
個output 係指d咩 從一個50000 x 50000 搵到邊個position 既值係唔等於0?
2022-08-11 22:46:31
樓主試下呢個方法
2022-08-11 22:47:06
2022-08-11 22:49:05
big data project 用返 big data 方法
試下切開啲 data 做唔同 partition 逐個 block 做
條問題本身都係預係 hadoop 個類 distributed system run
2022-08-11 22:49:27
條題目寫到明啦, 係無可能load晒20gb

咁樣樣則係要用disk
你只有2gb memory.
所以你要唸點樣將個matrix拆做十份
2022-08-11 22:50:20
2022-08-11 22:51:47
agger
用hadoop map reduce 或者 spark
2022-08-11 22:52:00
2022-08-11 22:53:33
LM
2022-08-11 22:54:36
2022-08-11 22:57:23
2022-08-11 22:57:32
2022-08-11 22:59:43
用Java google "download more ram"
2022-08-11 22:59:55
唔關事, big data 講究係 divide and conquer
input data 可以去到幾 tb, 無可能俾你寫入 single hard disk
你要分散啲 data 等可以派到去俾 worker
hadoop / spark 係俾你一堆 worker, 單機係 single worker, 做法基本一樣
2022-08-11 23:00:31
2022-08-11 23:01:15
2022-08-11 23:03:01
2022-08-11 23:03:02
2022-08-11 23:03:02
有冇試過2d array,會比hashmap用少好多memory
2022-08-11 23:04:13
2022-08-11 23:04:57
咁樣不如寫落SQLite
之後落返index
2022-08-11 23:05:23
你個input matrix咩樣嫁?plot 個histogram出黎睇下?如果成個feature column係constant 就成欄skip 得
2022-08-11 23:05:54
唔洗寫落 file, 直接開住個 input stream read
個 40gb 係邊度黎 ? http file ?
2022-08-11 23:06:45
2022-08-11 23:08:00
上面個sample就係memory mapped file, 將hdd map做memory
以前用c++&win32 api玩過
後來應該c# java vb.net 之流都有原生library支援, 而且cross platform

利申9up
吹水台自選台熱 門最 新手機台時事台政事台World體育台娛樂台動漫台Apps台遊戲台影視台講故台健康台感情台家庭台潮流台美容台上班台財經台房屋台飲食台旅遊台學術台校園台汽車台音樂台創意台硬件台電器台攝影台玩具台寵物台軟件台活動台電訊台直播台站務台黑 洞