先科普一下大名鼎鼎的Two Sigma吧系洛。2001年創(chuàng)立俊性,現(xiàn)在管理多達(dá)500億美金的資產(chǎn),排名對沖基金公司的全球第四C璩丁6ㄒ场!做為一家知名的對沖基金绽诚,為何他們給Apache Spark社區(qū)堅(jiān)持不斷地做貢獻(xiàn)典徊?
這也許是很多人困惑的事情。其實(shí)恩够,現(xiàn)在的對沖基金的投資決策是基于大量數(shù)據(jù)分析卒落,結(jié)合人工智能技術(shù)而做出的。蜂桶。儡毕。因此,他們也是重度的Spark用戶屎飘!還專門為社區(qū)開發(fā)了時(shí)序處理的庫Flint【此為大財(cái)閥Two Sigma的Flint 非阿里的Flink】妥曲!他們的網(wǎng)站https://opensource.twosigma.com?如是說,
We depend on Apache Spark to scale our data-heavy analyses and we build tools on top of it (see our Flint project above). The Spark Summits each year are a calendar highlight.
WHY WE CONTRIBUTE
At Two Sigma, we use science and technology to tackle the world’s most complex problems. We balance IP concerns with the drive to give back to the community – wherever possible, we believe in open sourcing the tools we’ve developed to help others discover value in the world’s data.
實(shí)際上钦购,他們也到處宣講他們的開源精神檐盟!見Slides:Why Two Sigma Contributes to Open Source?
#1: Leveraging other people’s work?
#2: Shaping the ecosystem
#3: Avoiding isolation
#4: Being cool
#5: Building your legacy
#6: Making the world a better place
做為重度使用Spark的公司,他們做為投資界的表率押桃,孜孜不倦地為社區(qū)做著各種貢獻(xiàn)葵萎,下面兩篇blog介紹了其中的主要貢獻(xiàn)!
- 基于Spark的時(shí)序library:Introducing Flint: A time-series library for Apache Spark
- 基于Arrow的Pandas UDF:Introducing Pandas UDF for PySpark?