Similar repositories to gbif/spark-duplicate-detection:
gbif/spark-duplicate-detection
github
similar
elbaulp/DPASF
github
similar
awslabs/ml-io
github
similar
databricks/spark-tfocs
github
similar
Sotera/correlation-approximation
github
similar
lightcopy/parquet-index
github
similar
vagmcs/Optimus
github
similar
acrosa/scala-redis
github
similar
hortonworks/data-tutorials
github
similar
spotify/big-data-rosetta-code
github
similar
awesome-spark/spark-gotchas
github
similar
usethesource/capsule
github
similar
propensive/fury
github
similar
UdashFramework/udash-core
github
similar
amplab/keystone
github
similar
dibbhatt/kafka-spark-consumer
github
similar
unfiltered/unfiltered
github
similar
sbt/sbteclipse
github
similar
vegas-viz/Vegas
github
similar
ray-project/tutorial
github
similar
databricks/tensorframes
github
similar
akka/akka-samples
github
similar
sequenceiq/docker-spark
github
similar
linkedin/photon-ml
github
similar
playframework/play-slick
github
similar
big-data-europe/docker-hive
github
similar
eaplatanios/tensorflow_scala
github
similar
BIDData/BIDMach
github
similar
microsoft/Mobius
github
similar
twitter/elephant-bird
github
similar
tensorflow/ecosystem
github
similar
scopt/scopt
github
similar
pathikrit/better-files
github
similar
OryxProject/oryx
github
similar
datastax/spark-cassandra-connector
github
similar
twitter/algebird
github
similar
milessabin/shapeless
github
similar
addthis/stream-lib
github
similar
twitter/scalding
github
similar
oracle/visualvm
github
similar