pmxbot IRC Log Viewer

[05:46:35] <earendel> hi. i've just created a dump of a db via mongodump /h <host> /o <dir> .. now i'm a little bit perplex about the size. the total dump is ~ 250MB, while the orginal db files are like 1.7gb (4 files with ending from .0 to .3) .. how comes this big difference?

[05:48:32] <joannac> preallocation, padding, unused space, etc

[05:48:33] <joannac> https://docs.mongodb.com/manual/faq/storage/#why-are-the-files-in-my-data-directory-larger-than-the-data-in-my-database

[06:46:56] <earendel> okay. thanks.

[06:48:37] <earendel> joannac: you know if it's possible to eventually use compression on a live db?

[06:50:46] <earendel> it's a little bit creepy .. the index was 4 times the size of the table

[06:51:52] <joannac> arrays?

[06:52:29] <earendel> not sure what you mean

[06:53:45] <joannac> you're indexing array fields?

[06:53:59] <earendel> nope.

[06:54:31] <earendel> have text search index on 3 fields

[06:54:40] <earendel> and think one on date

[06:55:22] <joannac> oh

[06:55:25] <joannac> yeah that'd do it

[06:55:52] <joannac> you might want to re-evaluate if you really need text search on those fields

[06:56:13] <earendel> also i start to realize a documentdb was not the right choice here.. the use case is simply logging irc chat data, so i have the same structure on each entry.. yet the column names are stored on each entry ..//

[06:56:36] <earendel> yeah. without the search it would be totally useless.

[06:57:23] <earendel> i think even with a search a simple csv file would make a faster search

[06:57:41] <earendel> or maybe not faster.

[06:58:06] <earendel> but space is a factor here.

[06:58:34] <earendel> unfortunately

[06:59:17] <earendel> so it totally makes sense not to use column names.

[06:59:38] <earendel> or name them with a single letter

[07:00:02] <earendel> the overhead i get is fatal

[07:02:13] <earendel> i see you're using a logging bot here as well. does it use mongodb? probably not

[07:02:23] <earendel> pmxbot: !help

[07:02:26] <earendel> !help

[07:02:26] <pmxbot> !8ball (8) !acronym (ac) !anchorman !annoy (a, bother) !bender (bend) !bitchingisuseless (qbiu) !blame !bless !boo !bottom10 (bottom) !calc !chain !cheer (c) !c

[07:02:27] <pmxbot> ompliment (surreal) !config !ctlaltdel (controlaltdelete, controlaltdelete, cad, restart, quit) !curse !dance (d) !danke (dankeschoen, ds) !deal !define (def) !

[07:02:27] <pmxbot> demotivate (dm) !disembowel (dis, eviscerate) !duck (ducky) !embowel (reembowel) !emergencycompliment (ec, emercomp) !esperantomotivate (em) !excuse (e) !featur

[07:02:28] <pmxbot> ecreep (fc) !fight !flip !fm (frenchmotivate) !fml !gettowork (gtw) !gm (germanmotivate) !golfclap (clap) !google (g) !grail !haiku !hal (2001) !hangover !help

[07:02:29] <pmxbot> (h) !hire !imotivate (im, ironicmotivate) !insult !jm (japanesemotivate) !job (card) !karma (k) !keelhaul (kh) !klingon (klingonism) !lm (latinmotivate) !log !l

[07:02:29] <pmxbot> ogo !logs !lunch (lunchpick, lunchpicker) !meaculpa (apology, apologize) !motivate (m, appreciate, thanks, thank, gracias) !murphy (law) !nailedit (nail, n) !na

[07:02:30] <pmxbot> stygram (nerf, passive, bcc) !norris !notify !oregontrail (otrail) !panic (pc) !password (pw, passwd) !pick (p, p:, pick:) !pm (piratemotivate) !progress !quote

[07:02:30] <pmxbot> (q) !r (r) !resolv !roll !rubberstamp (approve) !russianmotivate (рm, rm) !saysomething !schneier !send_text !simpsons (simp) !stab (shank, shiv) !storytime (s

[07:02:31] <pmxbot> tory) !strategy !strike !tgif !therethere (poor, comfort) !ticker (t) !tinytear (tt, tear, cry) !top10 (top) !translate (trans, googletrans, googletranslate) !t

[07:02:31] <pmxbot> routslap (slap, ts) !urbandict (urb, ud, urbandictionary, urbandefine, urbandef, urbdef) !version (ver) !wa (wolframalpha) !where (last, seen, lastseen) !zinger

[07:02:32] <pmxbot> (zing) !zoidberg (zoid)

[12:03:02] <adrian_lc> hi, anyone knows why the update result with mongo shell is in the format {"acknowledged" : true, "matchedCount" : 1.0, "modifiedCount" : 0.0} but pymongo gives {'updatedExisting': True, u'nModified': 1, u'ok': 1, u'n': 1}. I wanted to implement some logic based on modifiedCount but nModified doesn't seem to match the behaviour

[12:51:45] <lenochka> how can I search for all documents which have a list of embedded documents, in which neither of the embedded docs would have a specific year - 2016.

[12:56:13] <StephenLynx> dot notation

[12:56:23] <cheeser> "top.nextleve.date"

[12:56:23] <StephenLynx> 'field.subfield' : value

[12:57:31] <lenochka> StephenLynx: I need not equal

[12:58:06] <lenochka> neither of the docs

[12:58:35] <StephenLynx> $ne

[12:58:42] <cheeser> https://docs.mongodb.com/manual/reference/operator/query/ne/

[12:58:48] <StephenLynx> or $lt

[12:58:53] <StephenLynx> $lte, $gt, $gte

[12:59:03] <StephenLynx> whatever fits the value.

[13:00:52] <lenochka> StephenLynx: I've tried ne, doesn't work

[13:00:58] <lenochka> I can post an example

[13:01:07] <StephenLynx> rftm

[13:01:10] <StephenLynx> rtfm*

[13:01:17] <lenochka> I've read it

[13:01:29] <StephenLynx> had you read it, you would know $ne wouldn't work.

[13:01:29] <lenochka> not in my case at least

[13:01:42] <StephenLynx> and you need to query for a range.

[13:02:08] <StephenLynx> have you read all query operators?

[14:45:40] <Shokora> hi i have a question about projection in the MongoDB PHP Library

[14:46:21] <Shokora> I have pretty big documents and for an index page I'm making I would like only the id and some other information inside the document because of memory issues

[14:46:43] <Shokora> I specified an array and passed it with $options['projection'] to my find method, but it seems to use more memory after doing that than before

[14:47:02] <Shokora> are the projections made in memory? is there a way to do the projection in a way it is less memory intensive?

[15:50:41] <Doyle> If one of three config servers is offline, the cluster availability isn't impacted, just the meta data, right? Or does the meta have to be committed against all three for the chunk to be accessible?

[15:51:13] <Derick> you're correct that meta data only can't be changed

[15:56:04] <Doyle> If the first config server in your mongos connection string goes down, is there a timeout that mongos waits for before trying the second server in the string?

[15:59:43] <wrkrcoop> so i just created a table called users with a field called username

[15:59:52] <wrkrcoop> i then ran db.users.createIndex({"username"})

[16:00:11] <wrkrcoop> 2016-09-19T08:49:48.348-0700 E QUERY [thread1] SyntaxError: missing : after property id @(shell):1:32

[16:00:24] <wrkrcoop> thats what i got … what?

[16:01:11] <StephenLynx> collection

[16:01:14] <StephenLynx> not a table.

[16:01:27] <StephenLynx> and you have to specify the order of the index.

[16:02:26] <wrkrcoop> this worked: db.users.createIndex({username: 1})

[16:02:30] <wrkrcoop> got it thanks StephenLynx

[16:03:02] <wrkrcoop> im trying to set up a mongodb so i can use it with a graph db

[16:03:23] <wrkrcoop> so i have created a users table …

[16:03:39] <wrkrcoop> i want users to have friends … and posts ….

[16:04:09] <wrkrcoop> should i add the username of friends to a field in users?

[16:04:14] <StephenLynx> is not a table.

[16:04:33] <wrkrcoop> and should i create a posts table … then every post goes into the users posts field?

[16:04:45] <StephenLynx> I would keep posts on a separate collection.

[16:05:01] <StephenLynx> is not good to nest complex objects like that

[16:05:04] <wrkrcoop> ok so posts is its own collection …

[16:05:12] <wrkrcoop> how do i then know what posts belong to what user?

[16:05:16] <wrkrcoop> set up some index?

[16:05:35] <StephenLynx> I'd just have a field on the post indictating the poster.

[16:05:44] <wrkrcoop> hmm ok

[16:07:08] <wrkrcoop> StephenLynx: like this? db.posts.insert({"user": "ellis", "content": "i had yogurt for breakfast"})

[16:07:13] <wrkrcoop> should i then index user?

[16:07:14] <StephenLynx> yeah

[16:07:20] <StephenLynx> I mean

[16:07:30] <StephenLynx> if you expect to look for posts from specific users

[16:07:36] <StephenLynx> it would help a lot indexing it

[16:07:59] <StephenLynx> personally I don't add indexes until I have to use them.

[16:08:02] <wrkrcoop> cool

[16:08:07] <wrkrcoop> why is that?

[16:08:16] <StephenLynx> indexes have to maintained and kept in ram.

[16:08:24] <StephenLynx> so you can have too many indexes.

[16:08:41] <adrian_lc> hey, does anyone know why I can't seem to get a modifiedCount of 0 with replaceOne through pymongo ?

[16:08:44] <wrkrcoop> now how do you suggest i keep track of a user’s friends? have a field in the users collection called friends?

[16:08:48] <adrian_lc> it works with the shell

[16:09:03] <StephenLynx> yeah, I'd have a list of users.

[16:09:11] <StephenLynx> on the user itself.

[16:10:59] <wrkrcoop> hm ok

[16:18:49] <wrkrcoop> so i see mongo is on port 27017 … but i dont see what the host is …

[16:19:39] <StephenLynx> what do you mean?

[16:19:47] <StephenLynx> by default its 0.0.0.0

[16:19:51] <wrkrcoop> oh ok

[16:19:54] <StephenLynx> no, wait

[16:19:58] <wrkrcoop> is that the same as 127.0.0.1?

[16:20:00] <StephenLynx> by default its 127.0.0.1

[16:20:03] <StephenLynx> no, is not.

[16:20:17] <StephenLynx> 127.0.0.1 means it will be only reachable from localhost.

[16:20:30] <StephenLynx> 0.0.0.0 means from all interfaces.

[16:20:35] <wrkrcoop> because monogd outputs a line that says ‘connection accepted from 127.0.0.1'

[16:20:58] <wrkrcoop> so i have to provide a host for this other db im trying out: mongo: ./cayley init --db=mongo --dbpath="<HOSTNAME>:<PORT>" -- where HOSTNAME and PORT point to your Mongo instance.

[16:21:07] <StephenLynx> localhost

[16:21:15] <StephenLynx> if you haven't changed anything.

[16:21:23] <wrkrcoop> for hostname and port shouldi it be “0.0.0.0:27017"?

[16:21:27] <StephenLynx> nope

[16:21:42] <wrkrcoop> it should be “https://localhost:27017”?

[16:21:44] <StephenLynx> yes

[16:21:47] <wrkrcoop> ok

[16:21:50] <StephenLynx> or 127.0.0.1

[16:21:58] <wrkrcoop> hmm ok

[16:22:38] <StephenLynx> if the server is listening on 127.0.0.1, it won't take connections from no one outside the localhost.

[16:24:31] <spacecrab> good morning folks

[16:25:27] <spacecrab> if anyone is bored and feels like they want to chime in on this issue i'd be happy to discuss https://github.com/gravcat/mongodb/issues/1 -- when i was trying to make a replica set as fast as possible for the fun of it, i found that rs.add() worked exactly how i needed, but rs.initiate() did not, it was what i attempted first.

[16:26:10] <spacecrab> if no takers i'll just dig through the docs at some point and resolve the issue solely :p thought it might be fun to talk about hypothetical scenarios

[18:12:13] <kuku1g> hi guys, can someone tell me if splitting up collections makes sense? for example, i have sensor data that contains geospatial data or not. shall I split them up?

[18:13:06] <StephenLynx> that depends.

[18:13:14] <kuku1g> i feel like reading the data again would be more complicated. but would it speed up queries that go to one or the other collection faster?

[18:14:59] <kuku1g> with "reading the data again" I mean reading both geospatial and "normal" data at the same time

[18:15:22] <StephenLynx> on your current model, both of these go into the same document?

[18:15:44] <kuku1g> they go to the same collection yes. they are different documents however

[18:16:19] <kuku1g> i have a lot of "small" documents atm

[18:16:27] <StephenLynx> do they refer to the same pivoting point?

[18:16:53] <kuku1g> Yeah, the data logically belongs together

[18:16:56] <kuku1g> If that's what you mean

[18:17:02] <StephenLynx> I would put that on a single document instead.

[18:17:25] <kuku1g> I see. But I have like TBs of data. I can't really put that into a single document right?

[18:17:32] <StephenLynx> hold up.

[18:17:41] <StephenLynx> what's the pivoting point?

[18:19:08] <GothAlice> kuku1g: The note about "speeding up queries" encourages me to link http://www.devsmash.com/blog/mongodb-ad-hoc-analytics-aggregation-framework — an excellent article on pivoting time series data (their example is sensor buoys) in different ways, comparing the performance and storage costs. This is against mmapv1, not WiredTiger, of course, however pre-aggregation would allow for O(1) reporting.

[18:19:44] <GothAlice> (O(1) in the sense that for a given period of time, say, a report covering 24h, with hourly time-slices, only 24 records would ever be evaluated to answer the report query. Constant time.)

[18:22:11] <kuku1g> The thing is that I do not want to create reports or aggregate any data.

[18:22:30] <kuku1g> I am logging sensor data to use that data and run machine learning algorithms on it after pulling it out of mongodb

[18:22:58] <kuku1g> I think aggregating my data per minutes, he calls it "Document Per Buoy Per Hour" in your link, might be fine because I usually read out large chunks of data. I can see how that speed ups my queries a lot.

[18:23:30] <GothAlice> kuku1g: Yikes. Without pre-aggregation, your queries are guaranteed to be variable performance, mostly dependant on good index coverage and the RAM state of those indexes.

[18:24:22] <kuku1g> My use case is read only. I write data once and read it out afterwards. Some corrections (= updates) might happen like once every few months for a single log file (a single log file is 800mb to 8gb big in json - so theres a lot of overhead though)

[18:25:08] <kuku1g> as I mentioned, I have TBs of these data. I will always query minutes or tens of minutes of my data.

[18:25:44] <GothAlice> Note that we perform aggregation into hourly buckets at work _and_ store the original events, too. The original events are preserved for a period of time for auditing purposes using capped collections, and using capped collections also lets other processes "listen live". The events (and pre-aggregated stats for reports, dashboard widgets, etc.) are certainly read-only.

[18:26:16] <GothAlice> kuku1g: In your case, per-minute buckets would potentially _greatly_ reduce the amount of data. If you're getting a sensor reading at 100Hz, for example, you're saving 100:1 every minute with pre-aggregation.

[18:26:51] <kuku1g> GothAlice: With aggregation you mean that you put events together in a bucket right?

[18:27:02] <GothAlice> "Pre-aggregation", aye.

[18:27:27] <kuku1g> Oh I see. Yeah I definitely have to use this then. That's a great starting point. Thanks

[18:27:46] <GothAlice> Aggregation is one thing, pre-aggregation basically performs any calculations early and stores them instead or in addition to the original data, benefitting from the constant time querying later.

[18:28:43] <GothAlice> And sorry, always get orders of magnitude wrong. 100Hz sample rate would result in a 6000:1 savings using per-minute buckets.

[18:28:57] <kuku1g> GothAlice: Would you de-normalize even further and create more "views" on the data? Like splitting the data up by minutes as well as hours?

[18:29:05] <GothAlice> You often can, yes.

[18:30:05] <GothAlice> You create "bucket sizes" that match how you will want to present the data. If your charts have three zoom or granularity levels, e.g. per-minute, per-hour, and per-day, you can save the 24x more difficult processing of all of the hourly bucket data to get daily reports by pre-aggregating with daily buckets, too.

[18:30:20] <GothAlice> Note however, that this is all a trade-off.

[18:30:35] <kuku1g> GothAlice: What about mixing geospatial queries with time-range queries? In my use case, I will have to match the geolocation of my sensor data to bounding boxes (one at a time obviously) and then query +15 minutes and -15 minutes of the timestamp of the geospatial records found

[18:30:37] <GothAlice> Slightly more work on every event (to update one or more buckets) vs. potentially huge amounts of work all at once later.

[18:30:37] <StephenLynx> yeah, more work on inserts.

[18:30:49] <StephenLynx> in general, it really pays off on some cases.

[18:30:56] <GothAlice> It really, really can.

[18:32:14] <GothAlice> kuku1g: Also possible. You could have a "locations" field in each bucket which uses $addToSet to add each distinct location. You can then easily perform two queries: one to find by geospatial, the second to find the ±15 minute results from there?

[18:34:30] <GothAlice> Your request is clearly two queries, regardless of underlying storage: one geospatial, the other time-based. (Though some approaches can make this worse, i.e. two queries per sensor, etc.)

[18:35:53] <kuku1g> Gotcha.

[18:36:15] <kuku1g> I can only read out data that stays within the bucket size right?

[18:36:48] <kuku1g> For example, if have a granularity of 1 minute, I can not query 10 minutes at once?

[18:37:12] <kuku1g> GothAlice: I would have to read out the entire hour and then read out the data that I really need?

[18:37:28] <GothAlice> kuku1g: No.

[18:39:18] <kuku1g> I thought I have buckets of for example 0..59 in my document. and each of them represent a minute within a hour. so when I query for timestamp "19-09-2016 - 20:28" + 15 minutes what do i do? how to treat the subdocument that contains the buckets as one big chunk of data?

[18:39:23] <GothAlice> kuku1g: If you only had buckets of one minute, you can easily query 10 minutes at once. You're processing 10x as many records, however. My previous example was assuming buckets of one hour, but queries covering a granularity of 24 hours. 24x as many records. Using only buckets of per-minute, but desired granularity of one day would result in a 1440x increase in the number of records processed to answer queries of that granularity.

[18:40:14] <GothAlice> https://gist.github.com/amcgregor/1ca13e5a74b2ac318017 is an example from some older code at work.

[18:40:32] <GothAlice> The second file, sample.py, represents a pre-aggregated bucket record.

[18:41:30] <GothAlice> You should be able to see lots of different counters (in this case, click data, so browser, platform, etc.) and the "h" hourly period field, which is a date/time with the minutes/seconds set to zero / snapped to the hour.

[18:42:55] <GothAlice> Note that there isn't one document per hour per metric in my latest example, but one document for all metrics per hour per distinct job. No arrays of sub-documents to worry about.

[18:43:08] <kuku1g> GothAlice: I need a granularity of hours. My log data does not span a whole day. Each log file contains sensor data that was logged at most ~8 hours each.

[18:43:25] <GothAlice> In my case, I can literally query the "h" field for $gte and $lt the target time.

[18:44:43] <kuku1g> GothAlice: I see. that's actually sweet. I might adapt that to what I need. http://pastebin.com/ERbETVP1 this ain't exactly what you mean right?

[18:45:07] <GothAlice> kuku1g: Not even close.

[18:45:34] <kuku1g> Thats what I found like here: http://blog.mongodb.org/post/65517193370/schema-design-for-time-series-data-in-mongodb

[18:45:35] <GothAlice> That will be nearly unqueryable, and will be entirely unindexable.

[18:46:05] <kuku1g> I might have misinterpreted though

[18:46:27] <kuku1g> I think I was mislead by their 0..59 thing.

[18:47:03] <GothAlice> Aye. That's… double-bucketing.

[18:47:24] <kuku1g> I think I can actually throw anything in the "minutes" field, which is also misnamed in that case

[18:47:36] <kuku1g> Like the whole list of ALL events in that hour go into that field right?

[18:47:43] <kuku1g> No double-bucketing at all

[18:47:50] <GothAlice> Refer back to the original blog post I linked.

[18:49:39] <kuku1g> lmao. now I get the whole concept.

[18:50:49] <kuku1g> GothAlice: one more concern though. you said you would add all distinct locations (they are all because they are taken from a GPS trace from a moving object) to a list. Why so? Why not keep the original GPS-Events in the "events" field?

[18:52:12] <GothAlice> kuku1g: Could you gist an example of your data, with, say, three measurements included?

[18:53:05] <GothAlice> Not pre-aggregated or anything, but what you consider to be the data relevant for three specific events and their surrounding context. The last pastebin is too abstract. ;P

[18:54:34] <kuku1g> GothAlice: Give me 5 minutes

[18:54:43] <kuku1g> and prepare your paypal mail for me

[18:54:50] <GothAlice> :P

[18:55:09] <kuku1g> gonna give you a coffee for your stolen 15 minutes lol :)

[18:56:14] <GothAlice> While I do have a Patreon, I tend not to point at it in channels I help support. Somewhat ironic lack of self-marketing, there, but I'm not here to profit, I'm here to help. ;P

[19:01:26] <StephenLynx> link it

[19:03:53] <GothAlice> StephenLynx: I won't until I can confirm with another, more MongoDB Inc. moderator, that it's okay.

[19:04:25] <StephenLynx> kek

[19:04:27] <StephenLynx> just pm it to me then

[19:04:43] <GothAlice> :P

[19:06:18] <kuku1g> where can I paste tables to keep the layout?

[19:06:31] <StephenLynx> wat

[19:06:33] <GothAlice> kuku1g: Wagh, tables?

[19:06:39] <kuku1g> :D

[19:06:43] <GothAlice> kuku1g: I was hoping for a JSON document or three. :|

[19:07:45] <kuku1g> I've created like a table of "Field Name" / "Field Value" / Description table

[19:07:59] <StephenLynx> collection.

[19:07:59] <kuku1g> I can't give away any of our data really

[19:08:14] <GothAlice> kuku1g: Fake the numbers if you have to. XP

[19:08:22] <kuku1g> No, just a description of our data layout really. Not MongoDB related

[19:10:13] <GothAlice> As you are using "table" terminology, and listing fields like a CSV file (instead of using JSON notation), I'll also link you http://www.javaworld.com/article/2088406/enterprise-java/how-to-screw-up-your-mongodb-schema-design.html

[19:10:43] <kuku1g> GothAlice, StephenLynx: http://i.imgur.com/M35teSk.png

[19:11:02] <StephenLynx> that is

[19:11:12] <StephenLynx> at the same time that gives too much information it doesn't give information enough

[19:11:37] <StephenLynx> what are the types for each field?

[19:11:42] <StephenLynx> oh wait

[19:11:44] <StephenLynx> its there v:

[19:11:50] <StephenLynx> actually

[19:11:51] <StephenLynx> not

[19:12:48] <kuku1g> Lat/Lon/Alt are double, ts is a timestamp, the rest is string

[19:13:00] <kuku1g> and data_1 to data_n can be either double or string

[19:13:08] <StephenLynx> that is severely wrong.

[19:13:20] <StephenLynx> you shouldn't use a timestamp, but a date object instead.

[19:13:39] <StephenLynx> you should also use locations instead of specifying lat/lon

[19:13:42] <kuku1g> Wat? Yeah obviously it will be a Date object in mongodb

[19:13:48] <kuku1g> as I said, thats not what it will look like in MongoDB

[19:13:59] <kuku1g> it's the data layout we have in our log files

[19:14:00] <StephenLynx> then why are you showing it?

[19:14:03] <StephenLynx> ah

[19:14:06] <kuku1g> Because GothAlice asked for it :D

[19:14:24] <kuku1g> I can't give away real data though and I do not have real data at hand here too :(

[19:14:30] <StephenLynx> never asked for real data.

[19:14:41] <StephenLynx> just a proper model.

[19:14:48] <kuku1g> Sure. Just the layout of course. I don't have any of the log files here atm

[19:23:48] <kuku1g> alright

[19:24:00] <kuku1g> I think this is more what you wanted: http://pastebin.com/PQwhrGKQ

[19:24:49] <StephenLynx> nope

[19:24:54] <kuku1g> idk then ^_^

[19:25:00] <StephenLynx> hold on

[19:25:24] <StephenLynx> https://gitgud.io/LynxChan/LynxChan/blob/master/doc/Model.txt

[19:25:32] <GitGud> lol

[19:25:46] <StephenLynx> kek

[19:26:08] <kuku1g> doesn't load

[19:26:21] <GothAlice> That's quite the model, StephenLynx.

[19:26:26] <StephenLynx> weird, its loading to me.

[19:27:11] <kuku1g> Is it actually loading for you or are you trolling

[19:27:22] <StephenLynx> telling you

[19:27:36] <StephenLynx> its loading perfectly fine.

[19:27:44] <StephenLynx> maybe your client mangled the url.

[19:27:55] <kuku1g> gitgud.io ain't loading either

[19:28:04] <StephenLynx> then you have a dns issue.

[19:28:09] <kuku1g> i'm using google dns

[19:28:14] <StephenLynx> firewall?

[19:28:15] <GitGud> it loads for me

[19:28:29] <kuku1g> no firewall

[19:28:32] <GitGud> https://gitgud.io/users/sign_in

[19:28:57] <kuku1g> wait, gitgud.io is now slowly redirecting me to https://gitgud.io/users/sign_in

[19:29:07] <StephenLynx> 104.238.217.12/LynxChan/LynxChan/blob/master/doc/Model.txt

[19:29:08] <GitGud> as it should

[19:29:17] <StephenLynx> hm, weird

[19:29:24] <StephenLynx> the project is public, it shouldn't redirect you.

[19:29:42] <GitGud> yes i am able to open that link as well

[19:29:47] <GitGud> it should redir you to https://gitgud.io/LynxChan/LynxChan/blob/master/doc/Model.txt

[19:29:54] <kuku1g> that one works for me

[19:29:58] <kuku1g> lol

[19:31:15] <kuku1g> Can't comprehend what you're missing from the latest pastebin one except the descriptions though?

[19:31:22] <kuku1g> my data do not have arrays or the like

[19:32:02] <kuku1g> there are just hundreds of thousands, up to xx million events of the same structure i posted in the pastebin before.

[19:32:27] <kuku1g> Remember, that's the "event by event" view of course. All events one by one.

[19:33:17] <GothAlice> If you can give me 40 minutes, I have a bit of a deadline for something at work, but then I can dive into helping you properly. :)

[19:34:30] <kuku1g> alright, np!

[19:36:13] <GothAlice> kuku1g / StephenLynx: https://www.patreon.com/GothAlice to answer your earlier query, BTW. Got the go-ahead to link it. XP

[19:37:05] <StephenLynx> https://gist.github.com/amcgregor/9f5c0e7b30b7fc042d81

[19:37:06] <StephenLynx> tl,dr

[19:39:40] <GothAlice> StephenLynx: I need to better randomize the list or something to keep enough funny / tongue-in-cheek ones mixed in to keep readers interested, apparently. ;^P

[19:39:52] <StephenLynx> i dunno, I like to KIS

[19:40:58] <GothAlice> Philosophy… is not simple.

[19:41:47] <StephenLynx> not having a philosophy might as well be a philosophy in itself ¯\_(ツ)_/¯

[19:42:05] <StephenLynx> one of the reasons I go with free software and not open source.

[19:42:17] <StephenLynx> open source is a dozen of rules that no one ever remembers

[19:42:28] <StephenLynx> free software is 4 simple things.

[19:43:21] <StephenLynx> >Github private repositories for hosting of Marrow related services, such as package index, documentation site, wiki, etc.

[19:43:31] <StephenLynx> you can do that on any system running gitlab for free btw.

[19:44:08] <GothAlice> StephenLynx: For various definitions of "free".

[19:44:37] <StephenLynx> you talking about free software or the private repository feature?

[19:44:47] <GothAlice> However, while I appreciate the critique of my Patreon milestones, feel free to PM them to me. One reason I've avoided linking it in the past is to avoid making it the topic of discussion.

[19:45:13] <StephenLynx> not like we're disrupting anything going on

[20:06:52] <kuku1g> GothAlice, StephenLynx: so i think that's how you want me to model the data: http://pastebin.com/D47FrbUi how far off am I?

[20:07:11] <synthmeat> no, this totally disrupts my bi-yearly mongodb question that ends up being a mongoose mongoosery

[20:07:56] <kuku1g> is the second model superior? if yes, why?

[20:10:10] <StephenLynx> same thing, isn't it?

[20:10:32] <kuku1g> yeah almost. just the gps data separated from the non-gps data

Log file Viewer

Help | Karma | Search:

#mongodb logs for Monday the 19th of September, 2016