Go Back   Forum Care Forums > Development Reference Area > MySQL Discussion

Reply
 
LinkBack Thread Tools Display Modes
a new statistical/data-crunching/ETL programming language
Old
  (#1)
Robert
Guest
 
Posts: n/a
Default a new statistical/data-crunching/ETL programming language - 06-04-2007, 07:49 AM


Vilno is a new data crunching programming language. It's available as a
file attachment at the August 31 blog-cast at
www.my.opera.com/datahelper . More information is at
www.xanga.com/datahelper and datahelper.blogspot.com .

The positive: The syntax of Vilno is a lot more innovative than that of
SAS or SPSS, which allows one to achieve more data crunching with less
code. This productivity gap between the Vilno data processing function
and the SAS datastep ( or SPSS data crunching ) will only get bigger
over time because the internal architecture of Vilno gives it a lot of
room to grow ( for versions 2.0, 3.0, etc.). Also, the source code for
Vilno is probably tiny compared to the accumulated source at SAS or
SPSS, which makes Vilno much easier to enhance and extend.

The negative: Not yet ported to Apple/Windows. Still needs a library of
mathematical functions and date/time functions(particularly important
for data crunching). Not yet extended and integrated with a library of
statistical functions( regression, ANOVA, etc.).

DATA ANALYSIS = DATA CRUNCHING + STATISTICAL ANALYSIS

Data crunching has many names: data cleansing, data preparation, data
munging. It is the least glamorous of the two halves, but far more time
consuming. You cannot do proper data analysis without it. It's also the
"T" part of "ETL" (extract, transform, load). There is no one agreed
name for "data crunching".

Statistical analysis is the application of mathematical procedures to
produce analysis statistics and p-values. The choice and interpretation
of these statistical procedures requires some knowledge of applied
mathematics ( i.e. statistics ). Many people find this activity to be
far more interesting than data crunching ( I however find data
crunching to be a fascinating challenge ).

S-Plus ( or R ) is good at statistical analysis, but not data
crunching.
Vilno is excellent at data crunching (date/time functions aside), but
does not yet do statistical analysis.

In data crunching/preparation, there has been a dramatic slowdown in
productivity growth over the last 20 years. This is because a software
monopoly causes a lack of competition, hence a slowdown in creativity
and innovation. (actually, that applies to the pharmaceutical industry,
where there is a monopoly).

All three major statistical programming languages ( S, SPSS, SAS ) are
at least three decades old. It's time to shake things up a bit.

   
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote

Re: a new statistical/data-crunching/ETL programming language
Old
  (#2)
Jerry Stuckle
Guest
 
Posts: n/a
Default Re: a new statistical/data-crunching/ETL programming language - 06-04-2007, 07:49 AM

Robert wrote:

<SPAM snipped>

And exactly what does this have to do with MySQL?

Sorry - any language which has to be spammed to unrelated newsgroups
must not be worth using.

--
==================
Remove the "x" from my email address
Jerry Stuckle
JDS Computer Training Corp.
EMAIL REMOVED
==================
   
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On





Contact Us - Forum Care Forums - Archive - Top