Matches for diana_coman, 488 total results Sorted by newest | relevance
Thu Jul 17 15:10:55 UTC 2014 <diana_coman> uhm, think that would be of interest?
Thu Jul 17 15:10:35 UTC 2014 <diana_coman> it's a bit of a hack so not terribly precise - I count non-words and then add 1 - I've used basic string processing rather than all the text mining shit
Thu Jul 17 15:09:45 UTC 2014 <diana_coman> mircea_popescu yes, it counts urls, but that's not the top by wordlength, the title was misleading I guess; updated now
Thu Jul 17 14:50:24 UTC 2014 <diana_coman> mircea_popescu updated the post with users by words and mean wordlength per user (per line), is that what you meant?
Thu Jul 17 14:24:16 UTC 2014 <diana_coman> as for words and mean length, I'll be back with it in a few moments
Thu Jul 17 14:23:33 UTC 2014 <diana_coman> mircea_popescu re mismatch in number of lines - there are some missing in my data (part is what kakobrekla said earlier that I had to throw out one day due to funny things with the timestamps)
Thu Jul 17 13:52:03 UTC 2014 <mircea_popescu> 's stats ; logs between 26 March 2013 and 12 June 2014. mircea_popescu 91479 according to diana_coman. well which is it ? cause these can't both be right
Thu Jul 17 13:48:18 UTC 2014 <mircea_popescu> diana_coman users-by-lines is already in http://stats.bitcoin-assets.com/ users-by-words and per-user-mean-wordlength'd have been moar interesting.
Thu Jul 17 13:29:39 UTC 2014 <diana_coman> ThickAsThieves that's what the computer said!
Thu Jul 17 13:25:02 UTC 2014 <diana_coman> ThickAsThieves curiosity mainly; I've been around for a while, but I don't talk much
Thu Jul 17 13:24:20 UTC 2014 <ThickAsThieves> what brings you here diana_coman?
Thu Jul 17 13:23:54 UTC 2014 <assbot> 6 results for 'diana_coman' : http://search.bitcoin-assets.com/?q=diana_coman
Thu Jul 17 13:23:53 UTC 2014 <ThickAsThieves> !s diana_coman
Thu Jul 17 13:21:40 UTC 2014 <diana_coman> mircea_popescu uhm, the structures did not seem so uniform, but it could be worth a try
Thu Jul 17 13:17:31 UTC 2014 <diana_coman> ThickAsThieves it clearly is driven mainly by a few people
Thu Jul 17 13:15:23 UTC 2014 <diana_coman> simply because of the fact that the discussions really are pretty much about everything and in all forms and shapes
Thu Jul 17 13:14:39 UTC 2014 <diana_coman> obviously, excluding some more could make it a bit better, but from what I saw, it doesn't get it too far still
Thu Jul 17 13:14:12 UTC 2014 <diana_coman> I excluded stuff like "the" "a" etc
Thu Jul 17 13:14:02 UTC 2014 <diana_coman> I guess it depends on what you consider common
Thu Jul 17 13:13:53 UTC 2014 <diana_coman> pankkake, I excluded the most common ones