July 2012 Board reports (see ReportingSchedule).

This report is CLOSED.


This is the sixth Incubator report since we started putting more effort into
overseeing our podlings and keeping better track of their progress towards
graduation.

Over that time we had 51 podlings reporting twice on normal schedule. Of those
podlings, one was stuck at IP clearance, four continued without a release,
seven had a low level of activity for the whole quarter, and another seven
were in various other stages without significant progress towards graduation.
Many of these cases are already resolved, and the remaining are being looked
at in more detail.

Over the same six-month time period we saw thirteen projects graduate,
three retire, and five enter the Incubator as new podlings. There are
currently 49 podlings in the Incubator.

o Community

  Jörn Kottmann and Rich Bowen joined the Incubator PMC since our last report.

  The Lucene.Net podling is currently preparing for graduation.

  The following proposal for a new incubating project was accepted:

  - Apache Allura

  A proposal for a new project called Blur was brought up for discussion.

  Various inactive podlings are being retired. We decided to retire
  the Kato podling as mentioned in last month's report. This month
  both the AWF and Kitty podlings are being considered for retirement
  due to inactivity. See the relevant cursory reports for details.

  The proposed resolution to establish the Apache Steve project was briefly
  discussed, and the general consensus was that there is no specific need
  for the project to first go through incubation as the codebase and the
  development community around it already exists within the ASF.

  The role of a Sponsor, i.e. an existing TLP that plans to take a podling
  up as a subproject upon graduation, came up in discussion related to
  the EasyAnt podling. Such a role reflects the concept of an umbrella
  project and is thus no longer that relevant. Whether to discontinue or
  redefine the Sponsor role remains to be decided.

o Releases

  The following incubating releases were made since our last report:

  - June 21st, 2012: Apache Airavata 0.3-incubating
  - June 22nd, 2012: Apache Wookie 0.11.0-incubating
  - June 25th, 2012: Apache Syncope 1.0.0-RC2-incubating
  - June 27th, 2012: Apache Kafka 0.7.1-incubating
  - July  9th, 2012: Apache DirectMemory 0.1-incubating
  - July 10th, 2012: Apache Stanbol Entityhub 0.10.0-incubating

  In addition the release of Apache Any23 0.7.0-incubating is imminent,
  as the release vote has just passed.

o Legal / Trademarks

  The instructions on checking project names for suitability as trademarks are
  a bit inconsistent and partially outdated. We're working on fixing that.

o Infrastructure

  Some of the recently graduated podlings have had trouble clearly communicating
  to the infrastructure team everything that's needed for migrating project
  infrastructure to a TLP. We'll need to find a way to better instruct podlings
  on this as they prepare to graduate.

-------------------- Summary of podling reports --------------------

Still getting started at the Incubator (4 podlings)

  Allura, CloudStack, Crunch, cTAKES

  These projects are still getting started, so no immediate progress
  towards graduation is yet expected.

Not yet ready to graduate (12 podlings)

  No release:    Any23, Celix, JSPWiki, VXQuery
  Low activity:  AWF, DeviceMap, EasyAnt, Kitty, ODF Toolkit
  Low diversity: Chukwa, Mesos, Tashi

  We expect the next quarterly report of projects in this category to
  include a summary of their actions and progress in solving these issues.

Ready to graduate (3 podlings)

  DirectMemory, Kafka, Oozie

  We expect these projects to graduate within the next quarter.

--------------------
Allura

The Allura Project is forge software for the development of software projects,
including source control systems, issue tracking, discussion, wiki, and other
software project management tools. Allura entered incubation on 2012-06-25.

Allura only got its mailing lists a couple of hours ago. There is nothing
more to report yet.

Signed off by mentor:

--------------------
Any23

Anything To Triples (any23) is a library, a web service and a command line
tool that extracts structured data in RDF format from a variety of Web
documents.

Any23 was voted into the Incubator by the IPMC on October 1, 2011.

Three most important steps moving towards graduation

  - Grow the Any23 PPMC and community
  - Ensure at least one Any23 release
  - Identify and liase with other semantic web/linked data projects within
    the ASF to establish common goals and objectives.

The Any23 PPMC was delighted to extend an invitation to Peter Ansell during
June. The invitation was subsequently accepted and we are happy to say that
Peter is now on board.

During June/July the community has focused on constructing the 0.7.0-incubating
release candidate. We have experienced several problems along the way which
have delayed this significantly, however as of writing the community is now
VOTE'ing on the 0.7.0-incubating (release candidate #2) release thanks to
Simone Tripodi, we anticipate a release shortly. Since last reporting we have
integrated a number of commits to the project from a range of people. The
total number of commits sitting at >1174 since Any23 entered the Apache
Incubator.

The website at http://incubator.apache.org/any23/ is also up and functioning,
and appears to be in line with the Apache branding requirements.

Lewis John McGibbney worked with ASF infrastructure to get a zone VM up and
running to host the Any23 web service.

The community has again been in touch with members from Stanbol, Jena and
Clerezza and we have already discussed a committer meetup and liason with
the aforementioned sister projects at the forthcoming ApacheCon Europe.

Signed off by mentor: mattmann
Shepherd: Dave Fisher

--------------------
AWF

AWF is a non-blocking, asynchronous, event driven high performance web
framework running on the JVM. AWF, originally named Deft, entered incubation
on 2011-07-08.

  - AWF is about to retire from the Apache incubation
  - No community development since last report
  - No commits since last report.

A vote to retire the AWF podling is in progress.

Signed off by mentor:
Shepherd: Jukka Zitting

--------------------
Celix

Celix is an implementation of the OSGi Specification in C.

Celix entered incubation on November 2, 2010.

Over the last months we have started working on "Native-OSGi" which is an
effort to make a specification for a C and C++ bases OSGi implementation. As
detailed in the graduation plan [1] in the previous report we hope to attract
more community members with this move. Details for Native-OSGi can be found
at [2].

Also the donation of the Device Access code has been finished and committed
to the SVN repository. This has been a good experience from both a technical
and a process point of view.

Furthermore there has been a slight increase on the mailing list, hopefully
this is a trend that will continue in the coming months.

Most important issues to address before we can graduate:

  Make a first release and grow a larger community. It is also needed to
  attract some more committers, there are currently 2.

Any issues the Incubator PMC or ASF board not te be aware of:

  None at this time

How has the community developed since the last report:

  There has been an increase on the mailing list, furthermore with the start
  of the Native-OSGi project we hope this will increase in the coming months.

How has the project developed since the last report:

  From a technical point of view, we are working towards a first release.
  The build files have been updated to support a more modular project
  structure. Since the implementation is moving forward (al be it slowly),
  the community growth is the biggest concern for Celix.

[1]: http://incubator.apache.org/celix/community/boardreports/boardreports.html#2012-04
[2]: https://github.com/abroekhuis/NativeOSGi

Signed off by mentor: marrs
Shepherd: Matt Franklin

--------------------
Chukwa

Chukwa is an open source data collection system for monitoring large
distributed systems. Chukwa is built on top of the Hadoop Distributed
File System (HDFS), HBase and Map/Reduce framework and inherits Hadoop’s
scalability and robustness. Chukwa also includes a ?exible and powerful
toolkit for displaying, monitoring and analyzing results to make the best
use of the collected data.

  - Mailing list is picking up some activities.
  - 3 new contributors submitted patches since last report in April.

Most important issues to address:

  Growing the community, especially attracting new developers

Any issues that the Incubator PMC or ASF Board might wish/need to be aware of:

  None at this time.

How has the community developed since the last report:

  Plan to release version 0.6 to fix some of the major bugs in version 0.5.

How has the project developed since the last report:

  Prior to Hadoop summit, there was a thread discussion of retiring Chukwa
  because there does not seem to be much activity in the community.  All PPMC
  members seem to agree to this idea.  There were some free advertisement for
  Chukwa in Hadoop summit from word of mouth and speakers mentioned Chukwa in
  their talks.  There seems to be some contribution activities happening
  after Hadoop summit.  Discussion continued for retiring Chukwa because
  mentor does not think a community is developing.  We plan to open enrollment
  for Chukwa committers for people who are interested to continue development
  for the next quarter.  If activities generate a active community, then we
  will apply for becoming top level project.  If no activities happen, then
  we will retire Chukwa.

Signed off by mentor:
Shepherd: Benson Margulies

--------------------
CloudStack

CloudStack is an IaaS (“Infrastracture as a Service”) cloud orchestration
platform. CloudStack has been in incubation since 2012-04-16

The top 3 issues to address to move towards graduation:

  - CloudStack's source still contains works that are prohibited for release
    under ASF guidelines
  - Performing a release
  - Migration of infrastructure (bug tracker, CI, websites, etc)

How has the community developed since the last report:

  As of the time of this report at least one new committer has been added.
  Additionally, a number of previously unknown developers have begun making
  substantial quantities of bugfixes and even working on new functionality.

How has the project developed since the last report:

  The project itself continues to deal with process questions as well as
  learning how to operate in the new environment. A good deal of work has
  been happening on resolving the problems in the code base around licensing,
  though not currently close to finishing.

Signed off by mentor:

--------------------
Crunch

Crunch is a Java library for writing, testing, and running pipelines of
MapReduce jobs on Apache Hadoop.

Crunch entered incubation on May 27, 2012.

The most important steps towards graduation:

  - Infrastructure setup (JIRA, Confluence, etc.)
  - CCLA licensing of the existing Crunch code
  - Adding new contributors
  - Creating a release

Nothing that currently requires IPMC attention.

Community:

  The developer mailing list has been very active with bug fixes, new
  features, and discussions of infrastructure setup and project policies,
  both from the existing committers and other developers with an interest in
  the project. The first patch from a non-committer is currently being
  prepared for submission: the code is written, but we were blocking on
  getting JIRA setup so that the copyright on the code could cleanly be
  assigned to the ASF. The JIRA issues were resolved earlier this week.

  All ICLAs are in place. Cloudera has gathered all of the copyright
  assignments for the existing Crunch code from non-Cloudera developers
  and is preparing the CCLA to assign the copyrights on the existing Crunch
  code to the ASF.

Development:

  The 15 commits on the project this month were primarily for documentation
  and bug fixes, although we are evaluating two larger patches that bring
  additional functionality to the library: 1) adding map-side joins and
  2) supporting interactive pipeline creation and execution via the Scala REPL.

Signed off by mentor: phunt, tomwhite

--------------------
cTAKES

cTAKES (clinical Text Analysis and Knowledge Extraction System) is a natural
language processing (NLP) tool for information extraction from electronic
medical record clinical free-text.

cTAKES was voted into the Incubator by the IPMC on Monday, June 11, 2012.

Three most important steps moving towards graduation

  - Attract new contributors
  - Make at least one cTAKES release
  - Get everyone's ICLA on file and start developing code at Apache and using
    the infrastructure

Anything required IPMC attention?

  No

Community:

  We are still wrapping up getting a few ICLAs for Guergana Savova and for
  Sean Finan. So far, little mailing list discussion, but the project is
  getting bootstrapped. The next step will be to start developing on list
  and getting the project going.

  Jörn Kottmann who originally volunteered to Champion the project, was
  elected to the IPMC and can now officially be our Champion. Congrats Jörn!

  The light discussion on the mailing list is because there will be another
  release on the old sourceforge site before the migration to the apache svn
  and name space updates.

Development:

  So far, nothing much other than getting mailing lists set up, and other
  infrastructure tasks that can be tracked at:
  https://issues.apache.org/jira/browse/INFRA-4910

Signed off by mentor: mattmann

--------------------
DeviceMap

Apache DeviceMap is a data repository containing device information, images
and other relevant information for all sorts of mobile devices, e.g.
smartphones and tablets.

Entered incubation on January 3rd, 2012.

There are no issues that require the Incubator PMC's or the board's attention.

Little has happened since our last report in April, a prototype of client-side
device probes has been created [1], a few messages have been exchanged about
that but that's all.

At this point the future of the project is unclear, we should re-evaluate the
situation for the next report in October to see if activity has picked up.

[1] http://markmail.org/message/3bd63yqmuixn6co6

Signed off by mentor: bdelacretaz, kevan
Shepherd: Matt Hogstrom

--------------------
DirectMemory

(incubating since October 2011)

Apache DirectMemory is a multi layered cache implementation featuring
off-heap memory management (a-la BigMemory) to enable efficient handling
of a large number of java objects without affecting JVM garbage collection
performance.

There is only one important issue to address in the move towards graduation

  Understanding process/decision making guidelines (new committer process
  is undergoing testing, release process still yet to be worked out)

Any issues that the Incubator PMC or ASF Board might wish/need to be aware of

  None

How has the community developed since the last report

  Tasks and proposals contributed by non-committers

How has the project developed since the last report.

  - A first public release 0.1-incubating has been done.

  - A talk of DirectMemory has been presented at DevoXX France (thanks to
    Olivier Lamy and Benoit Perroud). Slides (in French) are available at
    http://www.slideshare.net/benoitperroud/direct-memory-3devoxxfr201220418-12607286

  - A talk of DirectMemory has been presented (thanks to Benoit Perroud) at
    Java User Group Lausanne. Slides are available at
    http://www.slideshare.net/benoitperroud/direct-memory-jugl20120308-12607297

  - A talk of Direct has been presented at BreizhCamp (thanks to OlivierLamy).
    Slides are available at http://www.slideshare.net/olamy/heapoff-wtf

Signed off by mentor: Olivier Lamy
Shepherd: Benson Margulies

--------------------
EasyAnt

EasyAnt is a build system based on Apache Ant and Apache Ivy.
Incubating since 2011-01-31.

Towards graduation, we need to:

  - Create a release
  - Build a community

Since the last report:

  There was very low activity on the project. There was actually no commit on
  the project. Some work has been done though a bug was found on a dependency,
  Apache Ant, and a bug was reported, and a patch committed since one of the
  EasyAnt committer happens to have commit rights there.

  After acknowledging this, a mail was sent on the dev mailing to remotivate
  people to focus on doing a release. A positive response and some commits
  were following.

Signed off by mentor: bodewig
Shepherd: Dave Fisher

--------------------
JSPWiki

JSPWiki has been incubating since September 2007.

JSPWiki is a JSP-based wiki program.

Still following the objetive of making the first Apache release:

  4 JIRA issues were fixed since last period, the project website has been
  revamped, and few other niceties & fixes have got into trunk (for example
  lucene library was upgraded to 3.6.0).

  2 Release candidates have been made, but they didn't succeed because of
  technical issues. Hopefully the third RC will become the first ASF release.

The main issues blocking graduation keep being the same as in the last report:
resolution acceptance, community and IPMC graduation vote.

Signed off by mentor:
Shepherd: Jukka Zitting

--------------------
Kafka

(introduced to Apache incubator on Jul 4, 2011)

Kafka provides an extremely high throughput distributed publish/subscribe
messaging system. Additionally, it supports relatively long term persistence
of messages to support a wide variety of consumers, partitioning of the
message stream across servers and consumers, and functionality for loading
data into Apache Hadoop for offline, batch processing.

Three most important issues to address in the move towards graduation:

  None.

  Previously, there was some concern about the diversity of the project
  that we believe has been addressed:

  - The project now has active committers from four companies
  - and continues to receive patches from other contributors from various
    institutions.

  We are considering whether to pursue graduation immediately or focus
  exclusively on the development of 0.8.

Any issues that the Incubator PMC or ASF Board might wish/need to be aware of:

  None.

How has the community developed since the last report:

  - The mailing list continues to be very active with bug reports, patch
    submissions, feature requests and use cases.
    [(Apr,May,Jun): kafka-users (151, 261, 251); kafka-dev (142, 195, 380)].

  - Added two new committers.

  - Conducted the first Kafka user group meeting with over 50 attendees
    (archived video link available here: https://cwiki.apache.org/confluence/display/KAFKA/Kafka+papers+and+presentations).
    This was a very successful event that gave users and contributors an
    opportunity to meet and discuss Kafka usage at various companies, ongoing
    development efforts, feature requests, etc.

  - Received and reviewed several major patches (including contributions from
    non-committers).

How has the project developed since the last report:

  - Intra-cluster replication (KAFKA-50)
    - Considerable progress has been made on the design and implementation of
      this much-anticipated feature.
    - Regular status updates are being sent out on the kafka-dev mailing list.

  - Another significant project that is under discussion and development is
    the consumer redesign (KAFKA-364) and consumer coordinator implementation
    (KAFKA-264).

  - Released Kafka 0.7.1 in June. This is the second Kafka release since its
    introduction to Apache incubator

Signed off by mentor: Alan Cabrera
Shepherd: Matt Franklin

--------------------
Kitty

Kitty is a lightweight, production focused, Java-based application server
performance diagnostic and management utility. Kitty entered incubation on
October 3rd, 2010.

A vote to retire the Kitty podling due to inactivity is in progress.

Signed off by mentor:
Shepherd: Mohammad Nour

--------------------
Mesos

Mesos is a scalable cluster manager that can dynamically share resources
between multiple computing frameworks, including Hadoop, Spark, and MPI.

Mesos entered incubation on December 23, 2010.

Progress since the last report:

  - Released 0.9.0!
  - Added support for Hadoop CDH3u3 and updated the tutorial.
  - Revamped MPI framework support (thanks Harvey Feng!).
  - New JS based web interface (thanks Jon Fuchs!).
  - Initial cgroups abstractions/interfaces for better resource isolation
    on Linux (thanks Jie Yu!).
  - Refactor of allocator for more sophisticated allocations
    (thanks Thomas Marshall!).
  - Numerous bug fixes (FD_CLOEXEC, JSON escaping, killtree.sh issues, etc).

Top priorities prior to graduation:

  - Same as before, finalize addition of new committers to the project!
  - Do another release.

Issues for Incubator PMC or ASF Board:

  - None at this time.

Signed off by mentor: tomwhite
Shepherd: Benson Margulies

--------------------
ODF Toolkit

The ODF Toolkit is a set of Java modules that allow programmatic creation,
scanning and manipulation of OpenDocument Format (ISO/IEC 26300 == ODF)
documents. Unlike other approaches which rely on runtime manipulation of
heavy-weight editors via an automation interface, the ODF Toolkit is
lightweight and ideal for server use.

ODF Toolkit entered incubation on Aug 1st, 2011.
Our last release was Januart 14th, 2012.
We voted in our most recent committer on November 16th, 2011.

Most important issues to address:

  Growing the community, especially attracting new developers

Any issues that the Incubator PMC or ASF Board might wish/need to be aware of:

  None at this time.

How has the community developed since the last report:

  - We are mentoring two GSoC students on ODF- Toolkit related projects

  - One of our leading developers has had a job change and is no longer
    involved with the project

  - Considering content-related track at ApacheCon, perhaps with other
    related projects like OpenOffice and POI

How has the project developed since the last report:

  Working on next release, to feature the new document encryption support

Signed off by mentor:
Shepherd: Ross Gardler

--------------------
Oozie

Oozie is a workflow management and scheduler primarily for Hadoop based jobs.

Oozie entered the incubation on July 11, 2011.

The three most important issues to address in the move towards graduation:

  - Improve the documentations:  user and development for quicker adoption
  - Automate the formal release process.
  - Diversify the dev and committer base.

Any issues that the Incubator PMC or ASF Board might wish/need to be aware of:

  No issues.

How has the community developed since the last report:

  - 88 JIRAs were resolved and 145 JIRAs were closed in last quarter.
  - 103 JIRAs were created since last report.
  - Oozie dev/users are active in the email lists (around 1000 emails in
    oozie-dev and 250+ in oozie-user).
  - Around 6 new developers are contributing to the process and coding.

How has the project developed since the last report.

  - The second Oozie release (3.2.0) from Apache incubator has been
    successfully accomplished.

  - A lot of new features are added. Fixed a lot of bugs too. Nearly 230
    JIRAs were closed/resolved in last three months.

  - Oozie contribution process in now more stream-lined. Further improvement
    is going on.

Signed off by mentor: ddas
Shepherd: Matt Hogstrom

--------------------
Tashi

Tashi has been incubating since September 2008.

The Tashi project aims to build a software infrastructure for cloud
computing on massive internet-scale datasets (what we call Big Data).
The idea is to build a cluster management system that enables the Big
Data that are stored in a cluster/data center to be accessed, shared,
manipulated, and computed on by remote users in a convenient, efficient,
and safe manner.

Tashi originally encompassed just the tools to manage virtual machines
using Xen and QEMU, but has been merged with Zoni, which manages the
physical aspects of a cluster like power control, network settings and
handing out physical machines.

In the period from April to July, the project did not ask to make
another incubating release, but is ready to start the process for a new
release incorporating the development efforts of this period.

Development efforts this period have included making user actions
display assurance messages via the client in case of successful
operations, extending the SQL database backend to support all Instance
and Host fields that are already recorded via the alternative "pickled"
backend.

The primitive scheduler gained additional resilience to refrain from
scheduling load on hosts that are down transiently. The node manager
service now tries to ensure that undelivered messages to the cluster
manager are resubmitted regularly. Virtual machine migration was revised
to ensure stale state wasn't being shadowed by new data, only to
reappear when the migrated VM was shut down.

The code underwent a complete automatic analysis, fixing several issues.
Furthermore, a few other minor additions, fixes and documentation
updates were made.

The project has received code contributions from two non-committers in
this period. MIMOS via Luke Jing Yuan have contributed "convertz" to the
code base, a utility to convert a VM image to an image deployable to a
physical machine provisioned by Zoni. Alexey Tumanov of CMU provided a
communications timeout wrapper to handle the problem of threads hanging
forever, trying to communicate over a broken network connection.

Upcoming software goals are to separate the client into an
administrative and a user interface, to investigate what is needed to
support IPv6, replace RPyC, and to provide the ability to hand out
server slices (operating system level virtualization). Besides CPU and
memory, disk storage should also be a schedulable resource.

The project has a user community, but it is small. Growth mostly has
happened by word of mouth. To show potential users at large the utility
of this project, the author of this report is creating web pages to
demonstrate how to accomplish distributed computing tasks. Base images
of (free) OS installs will be provided to allow new users to get started
quickly. Hopefully this will increase visibility of the project.

Items to be resolved before graduation:

  - Generate more publicity for the project.
  - Develop members of the user community to submit feature extensions.

Signed off by mentor:
Shepherd: Jukka Zitting

--------------------
VXQuery

The VXQuery Project implements a standard compliant XML Query
processor for parallel evaluation.
It has been in incubation since 2009-07-06.

VXQuery changed its focus from being a pure library implementation
of XQuery in Java to being an engine for parallel XQuery evaluation.
For the runtime VXQuery now leverages the Hyracks system which is
an Apache-licensed parallel execution engine developed at UCI.

So far the top issues were

  1) low activity
  2) small community
  3) create a release.

The activities on these issues are:

  ad 1)
  Since this change of focus - and the start of a GSoC project with
  this new focus - development activity has picked up significantly.
  This due to both activity of the GSoC student (Preston Carman) and
  of the existing committers.

  ad 2)
  Preston Carman a) has provided good patches for VXQuery and
  b) interacts very well with the existing community.
  Consequently he is a candidate to be the next committer for VXQuery.
  After the initial vote on vxquery-private failed (all committers
  voted +1, but there was no mentor/IPMC member vote), Jukka Zitting
  voted +1. The next step is to finalize the administrative part to
  make Preston a committer.

  Further, there is a plan to publish a  paper to show the projects
  utility and to increase the visibility of the incubation effort.

  ad 3)
  After the change of focus some things work, but not enough to
  build a release that is easily consumable. VXQuery should be
  released before a paper is published to enable readers or the
  paper to run the software.

Signed off by mentor:
Shepherd: Matt Franklin

--------------------

July2012 (last edited 2012-07-16 10:48:24 by JukkaZitting)