Red Data Tools
Data processing with Ruby!
About Red Data Tools
Red Data Tools is a project that provides data processing tools for Ruby.
Our policies:
-
Collaborate across the Ruby community
- We collaborate with the Ruby community and other communities. For example, we use Apache Arrow, shared with many languages, and join in development of Apache Arrow to share benefits.
-
Acting rather than blaming
- We spend our time writing code, writing tests, writing documentation, introducing our work to others, giving feedback to other projects and so on–rather than blaming the current situation (Python may have many more good tools than Ruby), blaming some current library implementations and so on.
-
Continuous, iterative progress rather than a short, big project
- We may need a diverse set of components to do many data processing tasks with Ruby. We need to work continuously to achieve complete tools for data processing. So steady progress is more important than a short sprint for us.
-
The current lack of knowledge doesn't matter
- We may also need knowledge about mathematics, statistics, linear algebra and other areas to implement fast tools. But we don't require them when someone joins us. We can learn about them while we work. We can use existing fast implementations and learn from existing fast implementations.
-
Ignore criticism from outsiders
- We may take a long time to achieve a complete set of tools for data processing. Some outsiders may criticize us until that time. We should ignore the blame. We don't have time to worry about them. :-)
-
Fun!
- Because we use Ruby!
Community
Products
In alphabetical order:
-
Arrow packages: Apache Arrow related packages for Debian GNU/Linux, Ubuntu and CentOS.
-
Charty: Ruby library for visualizing your data in a simple way.
-
Jekyll Jupyter Notebook plugin: Jekyll plugin to use Jupyter notebook.
-
Parquet GLib: Wrapper library for Apache Parquet C++ for language bindings.
-
Red Arrow GSL: Ruby library that provides conversion methods between Red Arrow and Ruby/GSL.
-
Red Arrow NMatrix: Ruby library that provides conversion methods between Red Arrow and NMatrix.
-
Red Arrow Numo::NArray: Ruby library that provides conversion methods between Red Arrow and Numo::NArray.
-
Red Arrow PyCall: Ruby library that provides conversion methods between Red Arrow and pyarrow in PyCall.
-
Red Arrow: Ruby bindings of Apache Arrow.
-
Red Chainer: A flexible framework for neural network which ported Python's Chainer with Ruby.
-
Red Datasets: Example open datasets commonly used for machine learning collected in a Ruby gem for easy use in projects.