104. Alluxio
Formerly known as Tachyon, Alluxio describes itself as "a memory-centric distributed storage system enabling reliable data sharing at memory-speed across cluster frameworks." It works with tools like Spark and Hadoop to speed performance on big data queries. Operating System: Linux, OS X
105. Ambari
Part of the Hadoop ecosystem, this Apache project offers an intuitive Web-based interface for provisioning, managing, and monitoring Hadoop clusters. It also provides RESTful APIs for developers who want to integrate Ambari's capabilities into their own applications. Operating System: Windows, Linux, OS X.
106. Avro
This Apache project provides a data serialization system with rich data structures and a compact format. Schemas are defined with JSON and it integrates easily with dynamic languages. Operating System: OS Independent.
107. Cascading
Cascading is an application development platform based on Hadoop. Commercial support and training are available. Operating System: OS Independent.
108. Chukwa
Based on Hadoop, Chukwa collects data from large distributed systems for monitoring purposes. It also includes tools for analyzing and displaying the data. Operating System: Linux, OS X.
109. Data Torrent RTS
Data Torrent has been around a while, but it first open sourced its Core RTS technology in June of this year. It claims to be "the industry's only open source enterprise-grade unified stream and batch platform." It comes in community, standard and enterprise versions. Operating System: Linux
110. Disco
Originally developed by Nokia, Disco is a distributed computing framework that, like Hadoop, is based on MapReduce. It includes a distributed filesystem and a database that supports billions of keys and values. Operating System: Linux, OS X.
111. Flume
Flume collects log data from other applications and delivers them into Hadoop. The website boasts, "It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms." Operating System: Linux, OS X.
112. Genie
Created by Netflix, Genie allows IT administrators to manage Hadoop jobs running on cloud computing services. Netflix uses it to run many thousands of Hadoop jobs every day. Operating System: Windows, Linux, OS X
113. Hadoop
This Apache-sponsored project is the best-known big data tool available. Numerous companies, including Amazon Web Services, Cloudera, Hortonworks, IBM, Pivotal, SyncSort and VMware, offer related products or commercial support for Hadoop. Well-known users include Alibaba, AOL, eBay, Facebook, Google, Hulu, LinkedIn, Spotify, Twitter and Yahoo. Operating System: Windows, Linux, OS X
114. Hadoop Distributed File System
HDFS is the file system for Hadoop, but it can also be used as a standalone distributed file system. It's Java-based, fault-tolerant, highly scalable and highly configurable. Operating System: Windows, Linux, OS X.
115. HPCC
This alternative to Hadoop also offers massive parallel processing and storage of big data workloads. Paid enterprise services are available. Operating System: Linux
116. Hypertable
Very popular with Web companies, Hypertable was developed by Google as a way to make databases more scalable. Its users include Baidu, eBay, Groupon and Yelp. It is compatible with Hadoop, and commercial support and training are available. Operating System: Linux, OS X
117. Ignite
This Apache project describes itself as "a high-performance, integrated and distributed in-memory platform for computing and transacting on large-scale data sets in real-time, orders of magnitude faster than possible with traditional disk-based or flash technologies." The platform includes data grid, compute grid, service grid, streaming, Hadoop acceleration, advanced clustering, file system, messaging, events and data structure capabilities. Operating System: OS Independent.
118. Kudu
Currently in beta trials, Kudu is an Apache project that is part of the Hadoop ecosystem. It combines a simple data model with columnar storage, low latency and distributed architecture. Operating System: Windows, Linux, OS X
119. Lipstick
This Netflix project provides an easy-to-understand graphical representation of Hadoop Pig jobs. It updates as the job executes so that administrators and developers no longer need to sift through log data. Operating System: Windows, Linux, OS X
120. Lucene
Java-based Lucene performs full-text searches very quickly. According to the website, it can index more than 150GB per hour on modern hardware, and it includes powerful and efficient search algorithms. Development is sponsored by the Apache Software Foundation. Operating System: OS Independent.
121. Lumify
Created by a company called Altamira Technologies, Lumify describes itself as an "open source big data analysis and visualization platform." It makes it easy to create 2D or 3D graphs that show the relationship between entities or to overlay data on maps. For those who are interested in learning more about how it works, the website offers several videos that show Lumify in action, and it also has a demo site that allows users to upload their own data and try out the software. Operating System: Linux.
122. MapReduce
An integral part of Hadoop, MapReduce is a programming model that provides a way to process large distributed datasets. It was originally developed by Google, and it also used by several other big data tools on our list, including CouchDB, MongoDB and Riak. Operating System: OS Independent.
123. Mesos
Apache Mesos is a resource abstraction tool that makes it possible for enterprises to treat their entire data center as a single pool of resources, and it is popular with companies that are also running Hadoop, Spark and similar applications. Organizations that use it include Airbnb, CERN, Cisco, Coursera, Foursquare, Groupon, Netflix, Twitter and Uber. Operating System: Linux, OS X
124. Oozie
This workflow scheduler is specifically designed to manage Hadoop jobs. It can trigger jobs by time or by data availability, and it integrates with MapReduce, Pig, Hive, Sqoop and many other related tools. Operating System: Linux, OS X.
125. Pandas
The Pandas project includes data structures and data analysis tools based on the Python programming language. It allows organizations to use Python as an alternative to R for big data analysis projects. Operating System: Windows, Linux, OS X.
126. Pig
Apache Pig is a platform for distributed big data analysis. It relies on a programming language called Pig Latin, which boasts simplified parallel programming, optimization and extensibility. Operating System: OS Independent.
127. Pinot
Developed by LinkedIn and open sourced in June 2015, Pinto is a distributed OLAP datastore that enables real-time scalable analytics. Key features include column orientation, pluggable indexing technology, data ingestion from Kafka and Hadoop and support for a SQL-like language. Operating System: Windows, Linux, OS X
128. Pivotal GemFire, Greenplum Database and HAWQ
Pivotal, a spinoff of EMC and VMware, announced earlier this year that they would be open sourcing key pieces of their big data suite, including GemFire, Greenplum Database and HAWQ. Gemfire helps organizations build data-driven applications, Greenplum is an analytics-enabled database, and HAWQ is a SQL analytics engine for Hadoop. Operating System: Windows, Linux, OS X
129. Presto
Developed by Facebook, Presto describes itself as "an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes." Facebook says that it uses it for queries against a 300PB data warehouse, and other users include Airbnb and Dropbox. Operating System: Linux
130. Solr
This "blazing-fast" enterprise search platform claims to be highly reliable, scalable and fault tolerant. The companies using it include AT&T, Ticketmaster, Comcast, Instagram, Netflix, IBM, Adobe and SAP Hybris. Operating System: OS Independent
131. Spark
Apache Spark boasts that it can "run programs up to 100x faster than Hadoop MapReduce in memory, or 10x faster on disk." Organizations "powered by" Spark include Amazon, Baidu, Groupon, Hitachi Solutions, IBM, MyFitnessPal, Nokia and Yahoo. Operating System: Windows, Linux, OS X
132. Sqoop
Enterprises frequently need to transfer data between their relational databases and Hadoop, and Sqoop is one tool that gets the job done. It can import data to Hive or HBase and export from Hadoop to RDBMSes. Operating System: OS Independent.
133. Storm
Apache Storm offers to do for real-time data what Hadoop does for batch data. Its website lists The Weather Channel, Twitter, Yahoo, WebMD, Spotify, Verisign, Flipboard and Klout among its users. Operating System: Linux
134. Samza
Originally developed by LinkedIn, this Apache top-level project does distributed stream processing. It relies on the Kafka messaging system as well as Hadoop YARN distributed processing technology. Operating System: Windows, Linux, OS X
135. Terracotta
Calling its BigMemory technology "the world's premier in-memory data management platform," Terracotta boasts 2.1 million developers and 2.5 million deployments of its software. The company also offers commercial versions of its software, plus support, consulting and training services. Operating System: OS Independent.
136. Tez
Built on top of Apache Hadoop YARN, Tez is "an application framework which allows for a complex directed-acyclic-graph of tasks for processing data." It allows Hive and Pig to simplify complicated jobs that would otherwise take multiple steps. Operating System: Windows, Linux, OS X.
137. Zookeeper
This administrative big data tool describes itself as "a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services." It allows nodes within a Hadoop cluster to coordinate with each other. Operating System: Linux, Windows (development only), OS X (development only).
138. jBilling
Downloaded more than 120,000 times, jBilling claims to be "world's leading open source enterprise billing solution." It's particularly good for subscription-based billing, and notable customers include Deutsche Telecom, Constant Contact and Rackspace. In addition to the free version, paid support and services are available. Operating System: OS Independent
139. SimpleInvoices
This web-based invoicing system is easy to use, sending electronic invoices as PDF files. You can download the code and host it on your own server or use the hosting services provided by Smarter Invoices. Operating System: OS Independent.
140. Siwapp
Another Web-based invoicing option, Siwapp allows users to create professional-looking, secure invoices with a minimum amount of hassle. Check out the screenshots and demo on the site to get a feel for how easy it is to use. Operating System: OS Independent.
Bitcoin
141. OpenBazaar
Formerly known as DarkMarket, this project allows users to trade BitCoin for goods and services freely. Based on BitTorrent technology, it emphasizes privacy and requires no fees for trades. It is currently in beta trials. Operating System: Windows, Linux, OS X
Blogging
142. B2evolution
B2evolution can run a standalone blog, multiple blogs or an entire website. It also includes features for setting up a community and marketing your site, and lots of information about hosting is available on the website. Operating System: OS Independent
143. Ghost
Downloaded more than 340,000 times, Ghost is an open source blogging platform based on Node.js. You can run it on your own Web server for free or use the hosted service, with prices starting at $10 a month. Operating System: Windows, Linux, OS X.
144. WordPress
Used by more than 60 million people and organizations, WordPress is one of the most popular blogging platforms available. Users can host their own blogs using the open source software or use the service provided at WordPress.com. Microsoft has also partnered with the organization on Scalable WordPress, a Web application that runs on Azure. Operating System: Windows, Linux, OS X, iOS, Android, BlackBerry, Windows Phone, others
Browsers
145. Chromium
Chromium is the open source project behind Google's Chrome browser. It aims to be safer, faster and more stable than other browsers. Operating System: Windows, Linux, OS X, Chrome OS, iOS, Android
146. Dooble
Less well-known than some of the other open source browsers, Dooble was created in order to offer better privacy on the Web. It encrypts most of the information it saves, and it includes the option of automatically deleting cookies. Operating System: Windows, Linux, OS X
147. Firefox
According to NetMarketShare, more than 13 percent of people use Firefox as their Web browser. It's made by Mozilla, which has been named the "most trusted Internet company for privacy." Operating System: Windows, Linux, OS X, Android
148. Tor Browser
Designed for privacy protection, the Tor browser cloaks the identity of users by bouncing communications of various servers around the world. It can be installed on a system or run from a thumb drive. Operating System: Windows, Linux, OS X
Bugtrackers
149. BugTracker.NET
Boasting thousands of users, BugTracker.NET is easy to use and highly configurable. It supports Subversion, Git, and Mercurial, and the website includes a helpful demo and a link to comparisons of various bug tracking systems. Operating System: Windows
150. Bugzilla
A favorite in the open source community, Bugzilla is used by Mozilla, the Linux Foundation, GNOME, KDE, Apache, LibreOffice, Open Office, Eclipse, Red Hat and Novell, among others. Key features in this bugtracker include advanced search capabilities, email notifications, scheduled reports, time tracking, excellent security and more. Operating System: Windows, Linux, OS X
151. FlySpray
FlySpray describes itself as "an uncomplicated, Web-based bug tracking system written in PHP for assisting with software development." It supports multiple databases, including MySQL and PGSQL, and it's easy to use. Operating System: OS Independent
152. GNATS
GNATS is the Gnu project's bug tracking system. Gnatsweb provides a web interface for the command line tool, and several third-party interfaces are also available. Operating System: OS Independent
153. MantisBT
This Web-based bug tracker integrates with most SQL databases and can be accessed with any browser. It's easy to install and has an enormous list of features. Operating System: Windows, Linux, OS X
154. BIRT
BIRT isn't a complete BI suite; instead, it focuses on technology that can be "used to create data visualizations and reports that can be embedded into rich client and web applications." It is sponsored by OpenText, Innovent Solutions and IBM. Operating System: Windows, Linux, OS X
155. Jaspersoft
This popular business intelligence suite is available in a Cloud Analytics version that runs on Amazon Web Services. It also comes in both free and paid versions for on-premise deployment. Operating System: OS Independent.
156. JMagallanes
This Java-based reporting tool can create static or dynamic reports and charts. It can accept data from SQL, Excel, XML and other sources, and it can create PDF, XML or application-specific files. Operating System: OS Independent
157. KNIME
KNIME offers a free, open source Analytics Platform, with several commercial extensions available; these include Personal Productivity, Partner Productivity, TeamSpace, Server, Big Data Extension and Cluster Execution. It claims to be "quick to deploy, easy to scale, intuitive to use and open to all." Operating System: Windows, Linux, OS X
158. OpenI
OpenI strives to help enterprises go from "data to insights in 72 hours with open source." It's a reporting tool that integrates with other open source BI solutions and can be deployed to Amazon Web Services very easily. Operating System: OS Independent.
159. Pentaho
Now owned by Hitachi Data Systems, Pentaho describes itself as "a comprehensive data integration and business analytics platform." Its customers include Caterpillar, Halliburton, BR and Nasdaq. It allows organizations to integrate big data from a variety of sources, including Hadoop, NoSQL databases, analytic databases and relational databases. It then enables interactive analysis, reporting and visualizations, allowing users to create customized dashboards that suit their unique purposes.
The open source version of the Pentaho software is available on the Community website. In addition, the company offers a free thirty-day trial of its paid enterprise software, and it offers a variety of paid services like custom visualizations, training, enterprise support, consulting, certification and technical support. Operating System: Windows, Linux, OS X
160. SpagoBI
SpagoBI is an open source business intelligence and big data analytics platform. The software is completely free, but paid user support, maintenance, consulting and training are available for purchase. It includes tools for reporting, multidimensional analysis (OLAP), charts, location intelligence, data mining, ETL and more. It also integrates with popular in-memory processing engines and enables real-time processing. Operating System: OS Independent
Business Process Management (BPM)
161. Activiti
Designed for business people, developers and systems administrators, Activiti is a lightweight workflow and BPM solution. Paid enterprise support is available through Alfresco. Operating System: OS Independent
162. Bonita Open Solution
Bonita claims to be "the most powerful BPM-based application platform." It comes in community and subscription editions, and paid training and consulting are also available. Operating System: OS Independent
163. ProcessMaker
Award-winning ProcessMaker offers tools to help enterprises automate workflow processes as well as performing full-scale BPM. It comes in community, enterprise and cloud editions. Operating System: Windows, Linux.
CAD
164. Archimedes
Designed with architects in mind, Archimedes is a less full-featured CAD modeling tool. It is based on Eclipse's Rich Client Platform and is highly extensible. Operating System: Windows, Linux, OS X
165. BRL-CAD
Early development activity on this very mature project began way back in 1979, and the U.S. military has been using it to model weapons systems for more than twenty years. It includes more than 400 tools for constructive solid geometry modeling. Operating System: Windows, Linux, OS X, others
166. FreeCAD
Useful for hobbyists, educators and experienced CAD professionals, FreeCAD is a general purpose 3D modeling tool. It has been around since 2001 and has a very full set of features. Operating System: Windows, Linux, OS X, others
Charts
167. GraphViz
Another alternative for creating network diagrams, GraphViz lets you use simple text or GXL (a variant of XML) to describe charts, and then it does the drawing work for you. See the Gallery for examples of the types of graphics it can create. Operating System: Windows, Linux, OS X.
Chemistry
168. Avogadro
For more advanced students and professional chemists, Avogrado offers an intuitive interface for creating visualizations of molecules. The website also includes some tips for educators on integrating Avogadro into the classroom. Operating System: Windows, Linux
169. Jmol
This java-based app lets students create diagrams of atoms, molecules, macromolecules, crystals, and more. The site includes a handbook and tutorials for helping you learn how to use the software. Operating System: OS Independent.
170. Kalzium
Need help with introductory chemistry? This KDE app allows students to explore the periodic table, and it comes complete with a molecular weight calculator, an isotope table, a 3D molecule editor and an equation solver for stoichiometric problems. (Note that in order to use Kalzium on Windows, you'll have to download KDE for Windows.) Operating System: Windows, Linux
Cinema Camera
171. AXIOM Beta
Made by a company called apertus, the AXIOM Beta is the world's first open source cinema camera. Beta prototypes began shipping in August, and the company has a well-developed road map for eventually shipping a complete device based on a modular open source hardware concept. Operating System: Linux
Classroom Management
172. iTALC
Intelligent Teaching and Learning with Computers, aka iTALC, gives teachers the tools they need to manage a computer-based classroom without the high license fees of commercial software. Key features include remote control, demo viewing, overview mode, workstation locking and VPN access for off-site students. Operating System: Windows, Linux
173. AppScale
AppScale allows users to run applications based on Google App Engine on any infrastructure. It is used by organizations like the World Wildlife Federation, Chico's and Google itself. Operating System: Linux.
174. Cloud Foundry
Cloud Foundry offers open source tools for building a platform as a service. Claiming to be "built by industry leaders for industry leaders," it lists IBM, Pivotal, Hewlett Packard Enterprise, VMware, Intel, SAP and EMC as its supporters. Operating System: Linux
175. CloudStack
This turnkey IaaS solution forms the basis for many public and private clouds. It has an extremely long list of users that includes Alcatel-Lucent, Apple, Autodesk, BT, CA Technologies, Citrix, Cloudera, Dell, Fujitsu, SAP and Verizon. Operating System: OS Independent
176. Dasein Cloud
A Dell project, Dasein Cloud is "an open source cloud abstraction library for Java." It allows developers to write applications once that can be translated to any cloud provider's model. Operating System: Linux.
177. Eucalyptus
Now managed by Hewlett Packard Enterprise (HPE), Helion Eucalyptus enables hybrid cloud computing by allowing organizations to build private clouds that are compatible with Amazon Web Services (AWS), which. Paid support is available from HPE. Operating System: Linux.
178. FOSS-Cloud
With FOSS-Cloud, users can build their own public or private cloud computing environments. It includes tools for VDI, IaaS, PaaS, SaaS and cloud storage. Operating System: Windows, Linux.
179. ManageIQ
This cloud management solution is the open source project behind Red Hat CloudForms. It enables services like chargebacks, service orchestration, lifecycle management and automated workflows, as well as enabling hybrid cloud environments. Operating System: Linux, VMware
180. Oneye
Forked from eyeOS, which became closed source after version 2.0, Oneye allows users to set up their own servers for cloud desktop functionality. There is a demo available on the website. Operating System: Linux.
181. OpenNebula
OpenNebula promises "the simplest could deployment and management experience." Paid support, training, engineering and consulting are available through OpenNebula Systems. Operating System: Linux.
182. openQRM
Downloaded more than 400,000 times, the open source version of openQRM offers free cloud computing software that is suitable for small deployments. It also comes in multiple paid enterprise editions, and the company behind the project also offers paid hardware/software bundles and a variety of related services. Operating System: Linux.
183. OpenShift
This Red Hat project aims to make it easy for enterprises to develop, host and scale cloud-based applications. Red Hat offers a public PaaS under the OpenShift name, as well as the open source software and an enterprise version for organizations that want to build a private PaaS. Operating System: Linux.
184. OpenStack
This very popular cloud computing platform claims that "hundreds of the world's largest brands" rely on it every day. Its sponsors include AT&T, Ubuntu, Hewlett Packard Enterprise, IBM, Intel, Rackspace, Red Hat, SUSE, Cisco, Dell, EMC, Symantec and many other well-known technology firms. Operating System: OS Independent
185. ownCloud
Designed to help users set up their "own cloud," this project emphasizes file syncing and sharing, as well as collaboration capabilities. A paid enterprise version is available through OwnCloud.com. Operating System: Windows, Linux.
186. Roboconf
This tool makes it easier to deploy applications to the cloud or other distributed computing environments. It supports many public cloud services, including AWS, Microsoft and Vmware, as well as most private cloud environments. Operating System: OS Independent
187. Scalr
This cloud management platform has been highly ranked by market research firms, and it simplifies the process of managing multiple cloud environments. Its notable users include Expedia, Samsung, the NASA Jet Propulsion Laboratory, Accenture, Sony and Autodesk. Operating System: Linux
188. Synnefo
Co-financed by Greece and the European Union, Synnefo offers a complete open source cloud computing stack written in Python. It uses OpenStack APIs to simplify management, and it also relies on Google Ganeti for cluster management and Archipelago for storage management. Operating System: Linux.
Cloud Storage
189. Camlistore
Content-Addressable Multi-Layer Indexed Storage, or Camlistore for short, offers "formats, protocols, and software for modeling, storing, searching, sharing and synchronizing data in the post-PC era." Stored data can be accessed from any device. Note that it is for more technically minded users, not beginners. Operating System: Linux
190. CloudStore
An alternative to Dropbox, CloudStore synchronizes data between a system and online storage. Its features include strong encryption, password-less authentication, flexible synchronization, fast setup and auto-resumes for interrupted data transfers. Operating System: Linux
191. Cozy
Cozy invites individuals to "store, sync, and share your data just the way you want it" by setting up their own personal cloud. It hopes to evolve into "a universal tool that can help you organize your life and make you more productive by automating routine tasks." Operating System: Linux
192. DREBS
Designed for Amazon Web Services users, DREBS stands for "Disaster Recovery for Elastic Block Store." It runs on Amazon's EC2 services and takes snapshots of EBS volumes for disaster recovery purposes. Operating System: Linux
193. Duplicati
Duplicati works with cloud storage services like Amazon, Windows OneDrive, Google Drive and Rackspace to create encrypted backups. It does a full backup on first use and incremental backups after that. Operating System: Windows, Linux, OS X
194. DuraCloud
With DuraCloud, users can backup their files to multiple cloud storage providers at once or move their files from one cloud to another at will. In addition to the open source software, it is also offered as a paid service. Operating System: Windows, Linux, OS X.
195. FTPbox
This tool also allows users to set up their own cloud storage solution. And as you might guess from the name, it transfers files via FTP. Operating System: Windows
196. LipSync
Another option for setting up a DropBox-like service for yourself, LipSync is a very lightweight tool based on OpenSSH, rsync and lsyncd. The project was born as an idea in a blog post. Operating System: Linux
197. Pydio
Formerly known as AjaXplorer, Pydio offers enterprise-class cloud storage designed to meet the needs of developers and system administrators. Downloaded more than a million times, it claims to be scalable, secure, highly flexible and loved by end users. A paid enterprise distribution is available. Operating System: Windows, Linux (Android and iOS clients available)
198. Rockstor
Rockstor offers several open source storage solutions, including cloud storage servers for individuals and SMBs. It also offers an open source NAS solution. Paid support is available. Operating System: Linux
199. SeaFile
SeaFile is both an open source project for setting up cloud storage with client-side encryption and a paid cloud service with the same capabilities. It's enterprise-ready and promises fast performance Operating System: Windows, Linux, OS X, Android, iOS
200. Sheepdog
According to its website, Sheepdog is "a distributed object storage system for volume and container services and manages the disks and nodes intelligently." It supports snapshotting, cloning and thin provisioning, and it is compatible with OpenStack Swift and Amazon S3. Operating System: Linux
201. Skylable
Downloaded more than 285,000 times, Skylable allows users to create their own object storage solution that is compatible with Amazon's S3 service. A paid, supported version is also available. Operating System: Windows, Linux, OS X
202. SparkleShare
With SparkleShare, you create a special folder on your system that is automatically synchronized with a host folder. It's a good option for collaboration, but less than ideal for full system backups. Operating System: Windows, Linux, OS X
203. StackSync
This tool describes itself as "the open source personal cloud for organizations." Like Dropbox, it syncs files across multiple devices, and it also includes client-side encryption for security. Operating System: Windows, Linux
204. Syncany
Syncany can encrypt and sync files stored on any cloud storage service. It is still an alpha release, but is working. Operating System: Windows, Linux, OS X
205. Syncthing
Yet another DropBox replacement, Syncthing allows users to choose where their data is stored—on their own servers or a cloud service. It runs on any operating system and includes an easy-to-use Web GUI. Operating System: Windows, Linux, OS X
206. Unison
Another file synchronization tool, Unison can handle changes made to multiple copies of a file at the same time, making it a good option for distributed development teams or enterprise collaboration as well as for backup needs. It can work cross-platform and transfers files via SSH. Operating System: Windows, Linux, OS X,
Collaboration/Groupware
207. Collabtive
Similar to Basecamp, Collabtive offers web-based project management that includes time-tracking and reporting. Download the software and host it yourself, or purchase on a SaaS basis. Operating System: OS Independent.
208. cyn.in
This groupware tool includes wikis, blogs, file repositories, event calendars, bookmark directories, discussion boards and multimedia galleries. It comes in free and commercially supported versions, and it is also available as an on-demand cloud service. Operating System: Windows, Linux, OS X.
209. EGroupware
EGroupware emphasizes flexibility, offering configurable custom fields and making it easy to integrate data from other applications. In addition to the free community edition, it also comes in a full version for on-premise deployment and a cloud version with several pricing tiers. Operating System: OS Independent.
210. Feng Office
Feng Office boasts more than 2 million users at organizations like University of California Berkeley, Airbus, NASA, Porsche Design, Xerox and the NBA. It comes in both free and paid community editions, as well as professional and enterprise editions and the Feng Sky cloud-based option. Operating System: Windows, Linux, OS X.
211. Group-Office
Group-Office combines enterprise CRM and groupware features like calendar and email. The paid professional version adds helpdesk, mobile sync, time-tracking, projects and document editing. A hosted cloud version is also available. Operating System: OS Independent.
212. K-9
This is an alternative email client with features like search, multi-folder sync, flagging, filing, signatures, PGP and more. It supports Exchange, IMAP and POP3 mail servers. Operating System: Android
213. Open Atrium
Open Atrium describes itself as "a team collaboration tool with a kick of open source hotness." It includes blog, calendar, sharebox, notebook, case tracker and dashboard capabilities. Operating System: OS Independent
214. Seafile
This collaboration solution includes cloud storage, mobile document access, file syncing, messaging and other capabilities. It's available in free or paid private server versions or as a cloud-based service. Operating System: Linux.
215. TeamLab
TeamLab combines document editing, mail, CRM and project management into a single package. Download the open source version, pay for a supported enterprise version, or use the hosted cloud version. Unlike most other projects, TeamLab also offers a free cloud version for non-profits. Operating System: OS Independent.
216. TUTOS
Short for "The Ultimate Team Organization Software," includes group calendar, email, time tracking, document management and project management. In addition, it also offers special features for developers, like a bug tracker, test support and some Scrum tools . Operating System: Linux
217. Zarafa
A complete communication tool, Zarafa offers Web-based meetings and file management as well as standard calendar and email capabilities. It also offers mobile sync and secure mail. A variety of paid subscription plans are available; for the free version, click the community tab on the website. Operating System: Linux
Compression
218. 7-zip
This helpful utility can compress files 30-70 percent more than the well-known WinZip utility. It addition to its unique 7z file format, it can also pack and unpack a variety of other archive file formats. Operating System: Windows, Linux, OS X
219. ArchivConvert
This award-winning app can convert among a wide variety of archive file formats. The latest version can read 52 different formats and write to 20. Operating System: Windows
220. ArcThemALL!
With this tool, users can unpack nearly three dozen different types of compressed files and create UPX, ZIP or 7Z files. It includes encryption capabilities and the ability to create self-extracting archives. Operating System: Windows
221. Caesium
This helpful app compresses images up to 90 percent, making it easier to e-mail them or upload them to your favorite Web site. It supports most common image formats, including JPG, PNG, GIF, BMP, and WMF. Operating System: Windows
222. J7Z
This tool offers the same functionality at 7-Zip with a slightly different interface. It was designed to speed up the compression process for certain types of jobs. Operating System: Windows
223. Keka
This Mac-only archiving tool can extract from 13 different compression formats and create seven. The default compression option is a port of 7-zip. Operating System: OS X
224. KGB Archiver
KGB claims to be able to compress files even more than 7-zip. It also includes built-in AES-256 encryption. Operating System: Windows
225. PeaZip
One of the most versatile compression tools available, PeaZip can open more thn 150 different types of archive file formats. It also includes security features like encryption, two-factor authentication, secure delete and much more. Operating System: Windows, Linux, OS X.
226. cAdvisor
Short for "Container Advisor," cAdvisor is a Google project that monitors container performance and resource usage. It is intended for use with Docker. Operating System: Linux
227. Docker
Docker has quickly established itself as the dominant platform in the relatively new field of containerization. Many of the biggest names in technology, including Amazon, Microsoft, IBM, Hewlett Packard Enterprise, Red Hat, Rackspace and Canonical are building or offering products that extend or use Docker technology. Operating System: Windows, Linux, OS X
228. Kubernetes
Developed by Google, Kubernetes is an open source container management solution. It is highly scalable, running billions of containers in Google's data centers, as well as containers at Viacom, Ebay and Wikimedia. Note that in order to use it you will need Docker and a Google Cloud Platform Account. Operating System: Linux, OS X.
229. Linux Containers
This group oversees three separate containerization-related projects: LXC, a set of tools for containerization; LXD, a descendant of LXC which provides a more intuitive user experience; CG Manager container group manager daemon and the LXCFS filesystem. Its stated goal is "to offer a distro- and vendor-neutral environment for the development of Linux container technologies. Operating System: Linux
230. OpenVZ
While it's not nearly as well-known as Docker, OpenVZ also offers open source containerization technology. It provides the basis for a commercial product called Odin Virtuozzo. Operating System: Linux
231. Rkt
Part of the CoreOS operating system, Rkt describes itself as "the next-generation container manager for Linux clusters." It emphasizes security and simplicity, and it ships in ArchLinux, Fedora and several other Linux distributions. Operating System: Linux, OS X.
Content Management and Wikis
232. Alfresco
Similar to Microsoft SharePoint, Alfresco's community edition offers enterprise content management that "helps you regain control of critical business content, strengthen compliance, optimize processes and make collaboration easy." It also comes in a paid enterprise version called Alfresco One, and a cloud-based version which makes the software available on an SaaS basis. Operating System: Windows, Linux, OS X.
233. Armstrong
First released in 2011, Armstrong is a content management system specifically designed to meet the needs of news organizations. It was developed by The Bay Citizen and the Texas Tribune. Operating System: OS Independent
234. BIGACE
While BIGACE is used by many smaller organizations, it's multi-site, multi-language, and multi-user capabilities make it a possibility for larger enterprises looking for a Web CMS. Key features include a WYSIWIG editor, templates and stylesheets, and permission management. Operating System: Windows, Linux, Unix
235. Bitweaver
Bitweaver aims to make it easy for non-technical staff, like the marketing team, to add content to corporate or small business websites. It includes tools for adding articles, blogs, calendars and image galleries, and multiple add-ons are available. Operating System: OS Independent.
236. DNN
Previously known as DotNetNuke, this content management solution promises high ROI for less effort when building rich interactive websites. Its users include Canon, Time Warner Cable, Texas Instruments and Bank of America. Operating System: Windows
237. DokuWiki
This simple-to-use wiki was designed for developer teams and other small groups. Key features include unlimited revisions, recent changes viewing, section editing, anti-spam capabilities, multiple language support and more. Operating System: OS Independent
238. Drupal
Drupal claims that more than 98,000 developers are actively contributing to this extremely popular content management system. It counts Microsoft, Zend, Fastly and New Relic among its supporters, and it has a marketplace that features hundreds of companies which offer related products and services. Operating System: OS Independent
239. Family Connections
If you love scrapbooking, you might also love setting up a family Web site. This app makes it easy to create a private site for sharing photos, calendars, recipes and keeping in touch with family members. Operating System: OS Independent
240. FOSWiki
A split from the TWiki project, FOSWiki distinguishes itself with support of embedded macros to enhance content and allow end-users to create their own applications. The Web site includes considerable support for new users, as well as a number of extensions to the basic application. Operating System: OS Independent
241. Fraym
Launched in 2014, Fraym promises "a new experience for creating and editing websites" and includes features like multi-language support, SEO, integrated caching, themes and drag and drop editing. Operating System: Windows, Linux, OS X.
242. Get Simple
Designed for smaller websites, Get Simple's claim to fame is its extreme ease of use. It boasts excellent security, an intuitive interface, customization capabilities, integrated site backups and more. Operating System: Linux.
243. Hippo CMS
Hippo was the only Java-based open source CMS in Gartner's Magic Quadrant for 2015. For marketers, it offers the ability to measure "which content matters to their audiences and deliver optimal and personalized digital experiences based on this insight." It comes in open source and enterprise editions as well as the Hippo OnDemand platform as a service. Operating System: OS Independent.
244. Joomla
Joomla provides the platform for millions of websites, and it has been downloaded more than 50 million times. Among its many users are companies like eBay, Barnes & Noble, MTV and Peugeot. Operating System: OS Independent
245. MediaWiki
Best known as the software Wikipedia uses, MediaWiki also powers sites for Baidu, Vistaprint, Novell, Intel and NASA. It's a good option for creating editable web pages, and many organizations use it for their internal knowledgebases. Operating System: Windows, Linux/Unix, OS X
246. Mezzanine
Built using the Django framework, Mezzanine offers features like an intuitive interface, scheduled publishing, WYSIWYG or in-line page editing, SEO features, ecommerce, dashboard widgets, and a blog engine. Visit the live demo on the website to see it in action. Operating System: OS Independent.
247. MindTouch
This project allows enterprises to set up self-service and knowledge-base websites where their customers can quickly find information they need. The cloud-based version is now the primary version of the software, but you can still download the open source MindTouch Core from SourceForge. Operating System: Windows, Linux.
248. Okatea
Modular, PHP-based Okatea describes itself as "a compromise between simplicity and flexibility." It boasts a modular architecture which allows users to add features for news, contact forms, etc. Operating System: OS Independent.
249. Orchard
This ASP.NET-based CMS is modular and highly extensible, making it easy to add blogs, photos and much more. The project has several international communities, and Microsoft staffers are involved in the project. Operating System: Windows.
250. Plone
Calling itself "the ultimate enterprise CMS," Plone offers integration with many other enterprise tools including Salesforce and other CRM solutions, Oracle, continuous integration tools, and many Web services. Paid support and other services are available through third-party providers. Operating System: Windows, Linux, OS X
251. Pimcore
This award-winning CMS claims to be "the industry’s first integrated open-source e-commerce & product information management platform for delivering rich and compelling e-commerce (B2C/B2B) experiences across all available channels." Paid support, training and add-ons can be purchased through the site. Operating System: Linux, Unix
252. SEOToaster
This solution claims to be "the most advanced SEO CMS and ecommerce website builder." It offers hub and spoke marketing technology that allows users to manage localized websites from a centralized interface. Operating System: OS Independent
253. SilverStripe
Created by developers in New Zealand, SilverStripe is both an open source CMS and a development firm that provides a variety of related services. SwipeStripe adds ecommerce capabilities to SilverStripe sites. Operating System: OS Independent
254. TikiWiki
TikiWiki calls itself "Groupware" because it offers features like blogs, forums, an image gallery, map server, bug tracker and feed reader to standard wiki capabilities. The software is completely open source, but support, consulting, hosting and other services are available through third-party partners listed on the site. Operating System: OS Independent
255. TWiki
As a structured wiki, TWiki combines the benefits of a wiki with the benefits of a database. It can be used for project management, document management, as a knowledge base, or to collaborate on virtually any type of content for intranets or the Internet. Operating System: OS Independent.
256. TYPO3
Installed more than 500,000 times, TYPO3 is a widely used enterprise CMS with multisite functionality, excellent scalability, granular permissions, multi-channel content publishing and more. Paid support is available. Operating System: Windows, Linux, Unix
257. WebGUI
Used by thousands of organizations, WebGUI offers capabilities like wikis, online surveys, news feeds, event management, message boards, shopping carts, blogs and more. Several third-party partners offer related services. Operating System: Windows, Linux/Unix, OS X
258. Wolf CMS
This newer CMS prides itself on being lightweight, fast and easy to use. However, note that it is easiest to use if you have some PHP coding skills. Operating System: Windows, Linux
259. XOOPS
Another very popular content management system, XOOPS has won several awards. It boasts a modular architecture, SEO features, excellent security, dynamic editing, email notifications and more. Operating System: OS Independent
260. XWiki
Most wiki software falls a little short when it comes to meeting marketing needs, but XWiki is a very full-featured platform with more advanced capabilities than most other open source projects of its kind. It supports blogging, reporting and the creation of simple Web applications, all of which can be useful to marketing teams. It comes in both a free and a paid enterprise version. Operating System: OS Independent.
261. Yellow
Yellow's claim to fame is its simplicity: "Just files and folders...Not much to learn." It's best for simple blogs and wiki-style websites. Operating System: OS Independent.
Customer Relationship Management (CRM)
262. CiviCRM
Similar to Orange Leap, CiviCRM was particularly designed for advocacy groups, NGOs and non-profit organizations with similar needs. It includes modules for case management, fundraising, event management, membership management, e-mail communications and marketing, and it integrates with both Drupal and Joomla. Operating System: OS Independent
263. ConcourseSuite
In addition to its features for sales and customer service teams, this Web-based CRM solution includes multiple features for marketers, including lead tracking, targeted email campaigns and survey capabilities. In addition to the open source version, a paid, supported version is also available. Operating System: Windows, Linux, OS X.
264. openCRX
This full-featured CRM solution combines groupware functionality (email, calendar and contacts) with sales force automation, marketing automation, customer service and analytics features. Paid support and custom development are available through partners. Operating System: OS Independent
265. SplendidCRM
SplendidCRM calls itself "the clear and obvious choice for companies that prefer Microsoft IIS to Apache and SQL Server to MySQL." It's a Windows-based CRM solution that comes in multiple cloud and on-premise versions. Operating System: Windows.
266. SugarCRM
Used by more than 1.5 million people in 120 countries, SugarCRM is an extremely popular, award-winning open source CRM solution. The website link above is primarily devoted to selling cloud-based subscriptions, but you can find the open source version at SugarCRM.com/download. Operating System: Windows, Linux, OS X.
267. vTiger
Downloaded more than four million times, vTiger comes in both cloud and open source versions. Key features include contact management, opportunity management, email marketing, forecasting, collaboration, workflow automation, reporting and mobile apps. Operating System: Windows, Linux, iOS, Android.
268. X2Contacts
This CRM tool gives marketers the ability to capture Web leads, track Website visits, draft and track emails, and automate and manage campaigns. It comes in a cloud-based version that runs on AWS or in an open source version. Operating System: Windows, Linux, OS X
269. Blazegraph
Formerly known as "Bigdata," Blazegraph is a highly scalable, high-performance database. It is available under an open source or a commercial license. Operating System: OS Independent.
270. BlinkDB
Still an alpha release, BlinkDB is a "massively parallel, approximate query engine for running interactive SQL queries on large volumes of data." In some tests it performed up to 200 times faster than Hive. Operating System: Windows, Linux
271. Cassandra
Created by Facebook, this NoSQL database counts Apple, CERN, Comcast, eBay, GitHub, GoDaddy, Hulu, Instagram, Intuit, Netflix, Reddit and other tech companies among its users. It supports extremely large data sets and boasts very fast performance and excellent durability and elasticity. Support is available through third parties. Operating System: OS Independent
272. CockroachDB
The team behind this project is working to create a database that is just as hard to kill as a cockroach is—in other words, it's extremely resilient. It also spreads like cockroaches—in other words, it's highly scalable. Operating System: Docker
273. CouchDB
Built for the Web, CouchDB is a NoSQL database that stores data in JSON documents which can be queried via HTTP and manipulated with JavaScript. Cloudant, which is now owned by IBM, offers a professionally supported version of the software, which is used by Samsung, Akamai, Expedia, Microsoft Game Studios and other companies. Operating system: Windows, Linux, OS X, Android
274. Drill
Apache Drill allows users to use SQL queries for non-relational data storage systems. It supports a range of NoSQL and cloud-based data storage systems, including HBase, MongoDB, MapR-DB, HDFS, MapR-FS, Amazon S3, Azure Blob Storage, Google Cloud Storage and Swift. It also allows users to search through multiple datasets stored with different technologies using a single query. In addition, it supports many popular BI tools. Operating System: Windows, Linux, OS X.
275. Firebird
This mature database has been around since 1981. According to its website, it offers "excellent concurrency, high performance, and powerful language support for stored procedures and triggers." Operating System: Windows, Linux, Unix, OS X, Solaris
276. FlockDB
Developed by Twitter, FlockDB is a very fast, very scalable graph database that is good at storing social networking data. While it is still available for download, the open source version of this project has not been updated in quite a while. Operating System: OS Independent.
277. GridGain
Powered by Apache Ignite, GridGrain offers in-memory data fabric for fast processing of big data and a Hadoop Accelerator based on the same technology. It comes in a paid enterprise version and a free community edition, which includes free basic support. Operating System: Windows, Linux, OS X.
278. HBase
Designed for very large tables with billions of rows and millions of columns, HBase is a distributed database that provides random real-time read/write access to big data. It is somewhat similar to Google's Bigtable, but built on top of Hadoop and HDFS. Operating System: OS Independent.
279. Hibari
This Erlang-based project describes itself as "a distributed, ordered key-value store with strong consistency guarantee." It was first developed by Gemini Mobile Technologies and is used by several telecommunications carriers in Europe and Asia. Operating System: OS Independent.
280. Hive
Apache Hive is the data warehouse for the Hadoop ecosystem. It allows users to query and manage big data using HiveQL, a language that is similar to SQL. Operating System: OS Independent.
281. Hustle
Hustle describes itself as "A column-oriented, embarrassingly distributed relational event database." Based on Disco, it's designed to offer extremely fast queries for very large data sets. Operating System: Linux.
282. Hypertable
Used by eBay, Baidu, Groupon, Yelp and many other Internet companies, Hypertable is a Hadoop-compatible big data database that promises fast performance. Commercial support is available. Operating System: Linux, OS X.
283. Impala
Cloudera claims that its SQL-based Impala database is "the leading open source analytic database for Apache Hadoop." It can be downloaded as a standalone product and is also part of Cloudera's commercial big data products. Operating System: Linux, OS X.
284. Infinispan
A Red Hat JBoss project, Java-based Infinispan is a distributed in-memory data grid. It can be used as a cache, as a high-performance NoSQL database, or to add clustering capabilities to frameworks. Operating System: OS Independent.
285. InfluxDB
InfluxDB is a "distributed time series database with no external dependencies." That makes it ideal for collecting data from IoT sensors; in fact, it can track data from tens of thousands of sensors sampling more than once per second. Operating System: Linux, OS X
286. Kexi
Part of the Calligra office productivity suite, Kexi is a visual database application creator similar to Access and Filemaker Pro. Note that it offers better support for Linux than for Windows or OS X. Operating System: Windows, Linux, OS X
287. LucidDB
The LucidDB website claims that this database is "the first and only open-source RDBMS purpose-built entirely for data warehousing and business intelligence." It is written partially in Java and partially in C++ in order to combine high performance with ease of development. Operating System: Windows, Linux, OS X
288. MongoDB
Used by Foursquare, Forbes, Pebble, Adobe, LinkedIn, eHarmony and others, MongoDB is a NoSQL database that claims to be "optimized for mission-critical deployments." Paid professional and enterprise versions are available. Operating system: Windows, Linux, OS X, Solaris
289. MySQL
Beloved by Web companies like YouTube, PayPal, Google, Facebook, Twitter, eBay, LinkedIn, Uber and Amazon, MySQL calls itself "the world's most popular open source database." It comes in multiple paid version as well as the free community version, and the latest update claims to be three times faster than its predecessor. Operating System: Windows, Linux, Unix, OS X
290. Neo4j
The self-proclaimed "world's leading graph database," Neo4J is used for fraud detection, recommendation engines, social networking, master data management and more. It users include eBay, Walmart, Cisco, HP, Accenture, CrunchBase, eHarmony, Care.com and many other organizations. Operating System: Windows, Linux
291. OrientDB
This multi-model database combines some of the capabilities of a graph database with some of the capabilities of a document database. Paid support, training and consulting are available. Operating system: OS Independent.
292. PostgreSQL
PostgreSQL calls itself "the world's most advanced open source database" and boasts more than 15 years of development. It has won multiple awards and offers excellent reliability and stability, even in high-volume environments. Operating System: Windows, Linux, OS X
293. Riak
"Full of great stuff," Riak comes in two versions: KV is the distributed NoSQL database, and S2 provides object storage for the cloud. It's available in open source or commercial editions, with add-ons for Spark, Redis and Solr. Operating System: Linux, OS X.
294. Realm
With an impressive roster of users that includes Google, Amazon, Starbucks, eBay, Budweiser, SAP, BC, Intel, Intuit, McDonalds, Walmart and IBM, Realm boasts that hundreds of millions of people rely on its mobile database. iOS, Android and Java versions are available for freel; enterprise versions are available for a fee. Operating System: Windows, Linux, OS X, iOS, Android
295. Redis
Now sponsored by Pivotal, Redis is a key-value cache and store. Paid support is available. Note that while the project doesn't officially support Windows, Microsoft has a Windows fork on GitHub. Operating System: Linux.
296. WebScaleSQL
Based on MySQL, WebScaleSQL is a collaboration among Facebook, Google, LinkedIn and Twitter. Their goal is to create a SQL database that can offer the performance, reliability and scalability that these large Web companies need. Operating System: Windows, Linux, OS X.
Data Destruction
297. BleachBit
BleachBit can securely delete files from a standalone system. In addition, it can clean up systems and improve performance by erasing cached files, temporary files, logs and other unnecessary data. Operating System: Windows, Linux
298. Darik's Boot And Nuke
Also known as DBAN, Darik's Boot And Nuke can completely eliminate all data from a hard drive. The open source version is designed for personal use, and a commercial version that can erase RAID arrays is available through Blancco, the project owner. Operating System: OS Independent
299. Eraser
For Windows only, Eraser deletes data from hard drives and overwrites it multiple times so that it cannot be recovered. It can destroy data on an entire drive or wipe out specified files and folders, and it includes a customizable scheduler. Operating System: Windows
300. FileKiller
Another secure deletion tool, FileKiller gives users the option of specifying how many times deleted data is overwritten. It promises fast performance. Operating System: Windows
Data Integration
301. Apatar
Apatar aims to make it easy to move data between on-premise and cloud-based applications, and it includes connectors for Salesforce.com, SugarCRM, and Goldmine CRM. It also comes in an on-demand version that integrates data from Salesforce.com and QuickBooks. Operating System: OS Independent.
302. Clover ETL
The Community Version of this extract, transform, load (ETL) tool can handle "modest" data transformation and ETL jobs. It also comes in paid Designer, Server Standard, Server Corporate and Server Cluster versions. Operating System: OS Independent.
303. DataCleaner
This enterprise-class tool can monitor your data, verify it against internal or external reference data, analyze your data quality, and find and merge duplicate entries. In additional to the free Community version, the company also offers commercially supported Professional and Enterprise editions. Operating System: OS Independent
304. GeoKettle
Based on Kettle/Pentaho Data Integration, GeoKettle incorporates geospatial capabilities from a variety of other open source tools. It is owned by Spatialytics, which offers commercial versions of the tools. Operating System: Windows, Linux, OS X
305. Karma
Developed at the University of Southern California, Karma can integrate data from databases, spreadsheets, delimited text files, XML, JSON, KML and Web APIs. It aims to be easy to use, and the website includes a number of videos showing its capabilities in action. Operating System: OS Independent.
306. KETL
KETL calls itself "a premier, open source ETL tool" and boasts that its features "successfully compete with major commercial products." Commercial support is available through Kinetic Networks. Operating System: Linux, Unix.
307. Kettle
Also known as Pentaho Data Integration, Kettle promises "powerful Extraction, Transformation, and Loading (ETL) capabilities, using a groundbreaking, metadata-driven approach." It features a drag-and-drop interface and scalable architecture. Operating System: Windows, Linux, OS X
308. MailArchiva
MailArchiva stores enterprise e-mail messages, allowing companies to meet compliance requirements, to search old messages quickly, to monitor content and to save on storage costs. The link above will connect you with the enterprise and ISP versions of the software; for the open source version, see SourceForge. Operating System: Windows, Linux.
309. Talend Open Studio
Talend is managed by a for-profit company rather than a foundation. As a result, paid support is available. Talend offers a mix of free and paid products. Its free, open source solution is called Talend Open Studio, and it has been downloaded more than 2 million times.
Market research firm Gartner recently named Talend a "Leader" in data integration. The company boasts that it can help enterprises analyze their big data five times faster and at one-fifth the cost compared to competing solutions. Operating System: Windows, Linux, OS X.
Data Loss Prevention
310. MyDLP
This robust DLP solution can "monitor, discover and prevent data leakage on your company network and endpoints." In addition to the free community version, it also comes in a supported enterprise version. Operating System: Windows, Linux, VMware.
Data Mining
311. DataMelt
The successor to jHepWork, DataMelt can do mathematical computation, data mining, statistical analysis and data visualization. It supports Java and related programming languages including Jython, Groovy, JRuby and Beanshell. Operating System: OS Independent.
312. KEEL
Short for "Knowledge Extraction based on Evolutionary Learning," KEEL is a Java-based machine learning tool that provides algorithms for a variety of big data tasks. It's also helpful for assessing the effectiveness of algorithms for regression, classification, clustering, pattern mining and similar tasks. Operating System: OS Independent.
313. Mahout
An Apache Foundation project, Mahout is an open source machine learning framework. According to its website, it offers three major features: a programming environment for building scalable algorithms, premade algorithms for tools like Spark and H2O, and a vector-math experimentation environment called Samsara. Companies using Mahout include Adobe, Accenture, Foursquare, Intel, LinkedIn, Twitter, Yahoo and many others. Professional support is available through third parties listed on the website. Operating System: OS Independent.
314. Orange
Orange believes data mining should be "fruitful and fun," whether you have years of experience or are just getting started in the discipline. It offers visual programming and Python scripting tools for data visualizations and analysis. Operating System: Windows, Linux, OS X.
315. RapidMiner
RapidMiner claims to be the "#1 open source data science platform," and Gartner named it a leader in its Magic Quadrant report for advanced analytics. It enables self-service predictive analytics and promises lightning-fast performance. Its users include BMW, Lufthansa, Domino's Pizza, Sony, Ford, Salesforce, Amnesty International and GE.
The complete RadiMiner Platform includes three separate pieces: RapidMiner Studio, RapidMiner Server and RapidMiner Radoop. All three are available under open source or commercial licenses, and commercial prices depend on the number of users. Operating System: OS Independent.
316. Rattle
Rattle stands for "R Analytical Tool To Learn Easily." It provides a graphical interface for the R programming language, simplifying the processes of creating statistical or visual summaries of data, creating models and performing data transformations. Operating System: Windows, Linux, OS X.
317. SPMF
SPMF now includes 93 algorithms for sequential pattern mining, association rule mining, itemset mining, sequential rule mining and clustering. It can be used on its own or incorporated into other Java-based programs. Operating System: OS Independent.
318. Weka
The Waikato Environment for Knowledge Analysis, or Weka, is a set Java-based machine-learning algorithms for data mining. It can perform data pre-processing, classification, regression, clustering, association rules and visualization. Operating System: Windows, Linux, OS X.
Data Visualization
319. Aperture JS
This project was designed to help produce visualizations of data to aid analysts and other decision makers. It's a JavaScript library that takes a "unified layer based approach to visualization assembly" in order to create rich, powerful graphics. Operating System: OS Independent.
320. D3.js
Short for Data-Driven Documents, D3 makes full use of newer Web standards to help users create interesting graphs and diagrams of their data. It grew out of the older Protovis project, and has been gaining more attention in the last couple of years. Operating System: Windows, Linux, Mac, iPad
De-duplication
321. Opendedup
Opendedup performs inline de-duplication to reduce storage utilization by up to 95 percent. It's available as an appliance for simplified setup and deployment. Operating System: Windows, Linux