10869
Comment: added detailed explanation for pynac optimization project
|
14528
|
Deletions are marked like this. | Additions are marked like this. |
Line 26: | Line 26: |
and has been used daily by thousands of people over the last year. It's one of the main ''killer features'' of Sage. This project is about improving the notebook. |
and has been used daily by thousands of people over the last year. A number of universities use Sage notebook servers for significant student use. It's one of the main ''killer features'' of Sage. This project is about improving the notebook. |
Line 31: | Line 32: |
* Authentication backend (ldap, kerberos?) * Improvements to interact * Add support for making presentations * Enhance export capabilities create methods for well designed PDF, LaTeX (with or without SageTeX) or ODF output. * master-notebook, collection of other notebooks for a script or book. |
* Improvements to [[interact|@interact]] * a 2d locator widget (there is already a separate bounty for this, around $100 USD or so, I think) * flexible layout of controls, and interacts within interacts * master-worksheet, collection of other worksheets for a script or book. |
Line 40: | Line 40: |
* basic datasheets (simple Google Docs-like spreadsheets) that can be shared with worksheets in read-only or read-write mode, could be editor to SQLite tables. * content-editable divs for the code cells, so we can support javascript widgets for inputting code (for example, we could have a slider right in the code cell, representing a numeric value, or a javascript graph editor representing a graph, or a wysiwyg formula editor, or color syntax highlighting) * support various types of text cells, so that, at the user's option, we could have a ReST text cell, an HTML (TinyMCE) text cell, a plain text cell, a latex text cell, etc. |
|
Line 41: | Line 44: |
==== Authentication backend ==== (ldap, kerberos? [[http://trac.sagemath.org/sage_trac/ticket/4309|#4309]]) ==== Slideshow mode ==== This project would involve improving the slideshow mode for Sage worksheets, which does not work very well right now: [[http://trac.sagemath.org/sage_trac/ticket/6342|#6342]]. Since Sage worksheets run in any modern web browser and already support nice jsmath typesetting, TinyMCE text editing, and of course computation, an attractive and usable slideshow mode would be extremely useful for anyone giving a presentation that involves some kind of computing. ==== Export to something printable ==== Enhance export capabilities of Sage worksheets: create methods for well designed PDF, LaTeX (with or without SageTeX) or ODF output. This would be an inverse of sorts of Rob Beezer's [[http://bitbucket.org/rbeezer/tex2sws/|tex2sws]] work, which takes TeX files and converts them into Sage worksheets. ==== Community Tools ==== Related to enhanced publishing and export abilities, we would like the Sage notebook to have better community/social tools. Projects for this could involve: * Enhance publishing of Notebook documents (i.e. like on [[http://www.sagenb.org/pub]]). * Wiki-like platform for editing notebooks for publishing mathematical, physical, statistical and other content. * tagging support, listings by tags * efficiently exchange usage examples, tips and ideas. * a repository of useful examples, interacts, etc., distributed with Sage and hosted on a website, that users can query and easily use/adapt. |
|
Line 55: | Line 79: |
involved going through source code and translating each string | these translations involved going through source code and translating each string |
Line 82: | Line 106: |
=== Community Tools === * Enhance publishing of Notebook documents (i.e. like on [[http://www.sagenb.org/pub]]). * Wiki-like platform for editing notebooks for publishing mathematical, physical, statistical and other content. * tagging support, listings by tags * efficiently exchange usage examples, tips and ideas. |
Potential mentor: DanDrake |
Line 90: | Line 111: |
Make it easier to call Sage from other applications. |
|
Line 93: | Line 112: |
* libGAP | Make it easier to call Sage from other applications. * libGAP [[http://trac.sagemath.org/sage_trac/ticket/6391|#6391]] |
Line 101: | Line 122: |
Another possible thing to look at is [[http://lipforge.ens-lyon.fr/www/crlibm/index.html]]. |
|
Line 137: | Line 160: |
* automated doctesting for tickets marked "needs review" | The Sage development process ([[http://www.sagemath.org/doc/developer/index.html|detailed here) involves posting patches on our [[http://sagetrac.org|trac server]], getting them reviewed, and then merged into our source code. Then the code is compiled and tested. Right now, many of these steps are done manually and could be automated and improved. Possible ideas for projects in this area would involve: |
Line 139: | Line 163: |
* set up a code review system like [[http://code.google.com/p/rietveld/|Rietveld]]. | |
Line 142: | Line 167: |
Sage is mostly "native" to Linux and OS X. A port to 32-bit Solaris SPARC is mostly complete, and a Cygwin port is underway (it may be working by this summer). But porting to other systems is very useful, both to increase our potential user base, and to help eliminate bugs (see the section "Computational Technique; the Pentium Flaw" in [[http://www.trnicely.net/twins/twins.html|this paper]]). This project would involve assisting our ports to the following platforms: |
|
Line 147: | Line 174: |
* Improvements to the build system? | It could also involve improvements to the build system -- can we use the same system on different platforms, including Windows? Skills required: knowledge of porting software to new compilers and platforms. Another idea in this vein would be setting up a [[https://help.launchpad.net/Packaging/PPA|Launchpad PPA]] for Sage. The Ubuntu packages for Sage are hopelessly out of date, and because of the large and heterogeneous nature of Sage, packaging for various Linux distributions is quite difficult. Having Ubuntu packages available would be very useful and possibly help us get Sage packaged for other distributions. |
Google Summer of Code 2010
This is the main organization page for the Google Summer of Code efforts of the Sage project.
If you're a student interested in working on any of the projects described below, or if you want to suggest any other projects, please write a short message to [email protected] introducing yourself (background, skills) and your interests.
If you're a Sage developer, please take some time to organize the list below and add more ideas. The notes section contains some guidelines from the GSOC FAQ.
Important Dates
Here is the original timeline. Some highlights:
March 8 - 12: application window
March 18: accepted mentoring organizations announced
March 29 - April 9: student application window
- ...
GSoC Sage Projects
All #numbers below refer to trac tickets.
Notebook
The Sage notebook is an AJAX application similar to Google Documents that provides functionality for all mathematical software somewhat like Mathematica notebooks. It was written from scratch (in Javascript and Python) by the Sage development team, and has been used daily by thousands of people over the last year. A number of universities use Sage notebook servers for significant student use. It's one of the main killer features of Sage.
This project is about improving the notebook. No special mathematical knowledge is required. Knowledge of Javascript, jQuery, Python, and general AJAX techniques is needed.
Improvements to @interact
- a 2d locator widget (there is already a separate bounty for this, around $100 USD or so, I think)
- flexible layout of controls, and interacts within interacts
- master-worksheet, collection of other worksheets for a script or book.
- Permanent hyperlinks between worksheets, independent of worksheet numbering, to support multi-worksheet documents (ie books)
- enhance history and snapshot capabilities.
- concurrent editing of one single document: only altered cells are updated and "collision" warnings issued if more than one change happens with appropriate methods to solve it
- read/write permission management for groups with roles (teacher is able to read notebooks, but students are not able to read each others)
- basic datasheets (simple Google Docs-like spreadsheets) that can be shared with worksheets in read-only or read-write mode, could be editor to SQLite tables.
- content-editable divs for the code cells, so we can support javascript widgets for inputting code (for example, we could have a slider right in the code cell, representing a numeric value, or a javascript graph editor representing a graph, or a wysiwyg formula editor, or color syntax highlighting)
- support various types of text cells, so that, at the user's option, we could have a ReST text cell, an HTML (TinyMCE) text cell, a plain text cell, a latex text cell, etc.
- ...
Authentication backend
(ldap, kerberos? #4309)
Slideshow mode
This project would involve improving the slideshow mode for Sage worksheets, which does not work very well right now: #6342. Since Sage worksheets run in any modern web browser and already support nice jsmath typesetting, TinyMCE text editing, and of course computation, an attractive and usable slideshow mode would be extremely useful for anyone giving a presentation that involves some kind of computing.
Export to something printable
Enhance export capabilities of Sage worksheets: create methods for well designed PDF, LaTeX (with or without SageTeX) or ODF output. This would be an inverse of sorts of Rob Beezer's tex2sws work, which takes TeX files and converts them into Sage worksheets.
Community Tools
Related to enhanced publishing and export abilities, we would like the Sage notebook to have better community/social tools. Projects for this could involve:
Enhance publishing of Notebook documents (i.e. like on http://www.sagenb.org/pub).
- Wiki-like platform for editing notebooks for publishing mathematical, physical, statistical and other content.
- tagging support, listings by tags
- efficiently exchange usage examples, tips and ideas.
- a repository of useful examples, interacts, etc., distributed with Sage and hosted on a website, that users can query and easily use/adapt.
Internationalization of the notebook
This project would involve changing the Sage notebook so that the user interface language can be translated and changed on the fly. This project will require knowledge of Python, Mercurial, and basic web coding; knowing the GNU gettext utilities, Javascript, and the Jinja web templating system will be helpful. No knowledge of (human!) languages other than English is necessary.
Currently, the user interface for the Sage notebook is all in English. Several one-off translations have been done (Korean; Russian) these translations involved going through source code and translating each string individually. The goal of the Sage project is to produce a viable alternative to Maple, Mathematica, Magma, and Matlab; having the user interface available in non-English languages would have a tremendous impact on that goal and vastly increase the number of people who can benefit from Sage.
Proper internationalization (i18n) involves wrapping each string in a function that looks up the correct translation, depending on the current language selected.
Deliverables for this project would include:
- wrapping strings in the Python, Javascript, and templating code of the Sage notebook with appropriate translation calls;
- updating the Sage notebook so that the language can be set to a default language (so that, say, a French site could have everything default to French) and can be changed on a per-user basis (so each user can choose a preferred language);
- developing a workflow for adding new translations and updating existing translations when strings change.
It would also be nice to work on support for more significant localization, perhaps using the Python Babel tools; this would include more thorough localization abilities, such as proper pluralization, thousands/decimal separators, ordinals, date and time display, and so on.
This project will not involve any actual translation, just making it possible for the Sage notebook UI to be localized. This is probably a medium-difficulty project, and will not require any specialized knowledge of mathematics or mathematical programming.
Potential mentor: DanDrake
Interfaces to Sage
- Sage as a C library
- Make it easier to call Sage from other applications.
libGAP #6391
Portable C99 libm
Sage relies on a fairly complete C99 libm. In particular, it expects the "long double" and "complex" variants of most functions to be present. Not all these functions are present on Cygwin, FreeBSD or older Solaris, causing porting problems on those platforms. The objective of this task would be to either locate and port or write a libm that is sufficient to meet Sage's requirements.
One possible option would be to use glibc and only compile the libm bits. (Thought glibc is a bit dodgy on the precision side in some areas).
Another possible thing to look at is http://lipforge.ens-lyon.fr/www/crlibm/index.html.
pynac (optimizing data structures)
As the symbolics backend, Pynac is a fundamental component of Sage. With some work and optimization, it could also be used for arithmetic with other mathematical structures like generic polynomial rings. It's based on a solid library GiNaC, which has great documentation and very readable code.
Skills: C++, (the necessary Cython and Python can be picked up easily)
This project would have two steps, the first would be a major optimization for Pynac also a good introduction to the library, coding conventions, type hierarchy, etc. The second would involve replacing the basic datatypes (vectors with heaps), lot's of timings and experiments to improve performance.
- allow different printing orders
- GiNaC uses the Maple approach to printing symbolic expressions. Variable orders depend on their creation order at runtime. This is not acceptable for Sage, so we (=Burcin) added some code to do an approximation to a graded lexicographic order. This is the wrong design, and it slowed things down considerably. Doing it the right way, and allowing different orders would also let us use Pynac for other structures in Sage.
- consider heaps instead of vectors for storage of add and mul objects
Data structures for polynomials is a well studied topic. Geobuckets (used by Singular, or hash tables have been used successfully until now. Recently heaps, based on work by Stephen Johnson in the 70s, were shown to perform much better than the alternatives.
Pynac uses vectors to keep add and mul objects. Here is a detailed explanation of the data structure. Using heaps could lead to a much more efficient data structure allowing us to handle much larger expressions. This would mean a major restructuring of the basic data types in Pynac.
(Geobuckets) http://dx.doi.org/10.1006/jsco.1997.0176
(Johnson's paper) http://doi.acm.org/10.1145/1086837.1086847
Development Process
The Sage development process (detailed here) involves posting patches on our [[http://sagetrac.org, getting them reviewed, and then merged into our source code. Then the code is compiled and tested. Right now, many of these steps are done manually and could be automated and improved. Possible ideas for projects in this area would involve:
- setting up a buildbot with trac integration for tickets with patches (list failing doctests, ...)
set up a code review system like Rietveld.
- ...
Porting
Sage is mostly "native" to Linux and OS X. A port to 32-bit Solaris SPARC is mostly complete, and a Cygwin port is underway (it may be working by this summer). But porting to other systems is very useful, both to increase our potential user base, and to help eliminate bugs (see the section "Computational Technique; the Pentium Flaw" in this paper). This project would involve assisting our ports to the following platforms:
- Cygwin
- FreeBSD
- Solaris 10 64-bit (the 32-bit version is basically complete).
- Open Solaris on x64 hardware.
It could also involve improvements to the build system -- can we use the same system on different platforms, including Windows? Skills required: knowledge of porting software to new compilers and platforms.
Another idea in this vein would be setting up a Launchpad PPA for Sage. The Ubuntu packages for Sage are hopelessly out of date, and because of the large and heterogeneous nature of Sage, packaging for various Linux distributions is quite difficult. Having Ubuntu packages available would be very useful and possibly help us get Sage packaged for other distributions.
Others
Here are some other task lists:
Potential Mentors
- Burcin Erocal
- Peter Jeremy (FreeBSD port, possibly libm task)
- William Stein
Notes:
We should take care to define deliverables for the items below. These should be doable with less than 3 months of work.
Here is what the FAQ says for "Ideas" lists:
- An "Ideas" list should be a list of suggested student projects. This list is meant to introduce contributors to your project's needs and to provide inspiration to would-be student applicants. It is useful to classify each idea as specifically as possible, e.g. "must know Python" or "easier project; good for a student with more limited experience with C++." If your organization plans to provide an application template, you should include it on your Ideas list.
- Keep in mind that your Ideas list should be a starting point for student applications; we've heard from past mentoring organization participants that some of their best student projects are those that greatly expanded on a proposed idea or were blue-sky proposals not mentioned on the Ideas list at all.
And this is from the notes on organization selection criteria:
- 2) Do the projects on your ideas list look feasible for student developers? Is your ideas list thorough and well-organized? Your ideas list is the first place that student participants are going to look to get information on participating in GSoC, so putting a lot of effort into this list is a good thing(tm). One thing we noticed and really appreciate is how some organizations classified their ideas by easy, medium and difficult, and specifically listed the skills and background required to complete a given task. It might also be cool to expand on each idea with some places to get started research-wise (pointers to documentation or specific bugs), as well as the impact finishing a given idea will have for the organization.