pmxbot IRC Log Viewer

[00:18:48] <rkgarcia_> niftylettuce_: what's up?

[00:19:00] <rkgarcia_> please explain your question

[00:20:07] <niftylettuce_> Right now, per that link to StackOverflow, you have to do `docs.filter` in order to filter out any tags that were not populated (e.g had null or undefined values). What I want to do, is instead of having to do this extra step with `docs.filter`, I want to add something like `.where({ tags: { $ne: undefined })` or something to the main query before `.exec`

[00:20:08] <niftylettuce_> is called. However it appears this feature is not supported or something.

[00:20:10] <niftylettuce_> rkgarcia: ^

[00:20:37] <niftylettuce_> at least thats what the people said in that S.O. post...

[00:23:26] <niftylettuce_> basically it would be otherwise fine to just use this .filters thing

[00:23:29] <niftylettuce_> but this means i then can't do .limit and .sort

[00:23:40] <niftylettuce_> so it results in longggg queries that take 10s of seconds instead of ms

[00:24:00] <niftylettuce_> e.g. by having .limit and .sort in the main query, not at/after the .filters part as shown in S.O., it makes it fast

[00:26:33] <rkgarcia> niftylettuce_: i don't know mongoose :(

[00:26:59] <niftylettuce_> bah

[00:26:59] <niftylettuce_> :(

[00:27:19] <rkgarcia> :(

[00:27:23] <rkgarcia> sorry

[01:12:04] <Asterfield> Is this channel being logged in a mongo DB, by any chance? :P

[01:13:30] <Asterfield> Anywho, question time. I _really_ love the idea of mongo, but my workplace uses MSSQL exclusively, so my brain has a lot of table-based habits

[01:14:04] <Asterfield> I'm trying to build a bit of a messenging service, for fun

[01:16:05] <Asterfield> I have a `users` collection and a `conversations` collection. Right now, I'm trying to query all the conversations that have a certain user's `_id` as `userA` or `userB`.

[01:17:13] <Asterfield> This seems like it's going against the spirit of mongo, but I don't know the "correct" way of structuring things

[01:24:48] <Asterfield> Also, I'm having technical problems with the query using nodejs and monk :P

[01:33:53] <Asterfield> What is the appropriate way to structure conversations and users in that a conversation can be queried by the user inside it?

[01:43:36] <Asterfield> Is monk's querying ability crippled? I can't get a simple `or` or `in` to work

[01:46:15] <Asterfield> Bleh, dead community

[01:47:32] <BurtyB> ut oh the community is dead... best shut up shop and go home :(

[02:20:48] <rkgarcia> what's happening BurtyB ?

[02:22:44] <BurtyB> sleep for me

[02:27:03] <alsotest> why does mongoimport not want to import this? https://raw.github.com/shuw/ies-twitter/master/data/crunchbase/service-providers.js

[02:27:31] <rkgarcia> alsotest: test without []

[02:27:38] <rkgarcia> at start and end

[02:34:16] <rkgarcia> alsotest: you need a document per line, without "," separated

[02:38:24] <alsotest> rkgarcia: that did the trick! thank you

[02:39:25] <rkgarcia> alsotest: or you can put db.collectionName.insert( at start

[02:39:31] <rkgarcia> and ); at the end

[02:40:00] <rkgarcia> and run in shell "mongo databaseName < service-providers.js "

[02:41:41] <alsotest> rkgarcia: marry me. don't think about it. just go for it

[02:41:50] <alsotest> that works exactly as i was hoping as possible

[03:04:29] <rkgarcia> see you

[03:44:57] <MacWinner> does mongodb have the concept of "Query Cache"?

[03:45:13] <MacWinner> like in mysql..

[03:45:38] <MacWinner> nm.. found the answer:)

[04:34:07] <Fandekasp> hi there

[04:34:19] <Fandekasp> I have a problem connecting to my mongodb set from heroku

[04:35:31] <Fandekasp> normally, clicking on the addon redirects to the proper interface, but today I am redirected to partners.mongohq.com and I have to login with unknown credentials (those setup by heroku are username&password, partners.mongohq.com asks for email&password

[04:35:57] <Fandekasp> Any idea why this only happened today? and what email am I supposed to use there?

[04:36:53] <ranman> Hi Fandekasp: this is a mongodb channel not a mongohq channel but have you tried just using your existing credentials?

[04:38:32] <Fandekasp> ranman: oh right, sorry. and yes, I tried to fill the email field with the username "heroku", without success. I then tried to register with the same email I use in heroku, but it didn't find my heroku mongodb

[04:39:25] <ranman> try to use the username&password that heroku setup?

[04:39:28] <ranman> even though it says email?

[04:42:00] <Fandekasp> ranman: that's what I said. I tried and could not login

[04:42:56] <ranman> I thought you tried to use the username "heroku" which was not your personal username I assumed

[04:42:57] <ranman> anyawy

[04:43:06] <ranman> *anyway that's something to take up with the mongohq or heroku guys

[04:43:12] <ranman> we can't really do anything here sorry :/

[04:43:29] <Fandekasp> no worries, thx

[06:02:40] <safani> Im getting an error Failed to load c++ bson extension, using pure JS version , any ideas

[08:17:19] <Repox> Hi guys. I'm having an issue with updating some members. I'm trying the following command which reports an error (which can be seen in the same pastie). Could you help me figure out how I can achieve what I'm trying to do? http://pastie.org/private/fyyehbo0ifstlappxvevw

[08:47:03] <kali> Repox: i assume what you want is to use $set. the second update argument can be either - a full document that will replace the matched document, - or a combination of $xxx modifiers (like $set, $addToSet, $push, etc)

[09:26:02] <Striki> we need to change the format of the shard key. we must do it manually by deleting the documents, apply the hashing in client code and insert them again. the number of documents that we need to change are around 100-200 million in a 530 million document collection. It's distributed over 4 shards. Should we stop balancing while running the program that takes care of the changes?

[09:26:31] <Striki> the thing is, we started out sharding the collection without hashing. then we changed it to be hashed but we haven't changed the "leftovers"

[09:26:34] <Striki> it's a bit of a hack

[09:26:49] <Striki> now we need to change the leftovers because we are experiencing hot shard because of this mess

[09:27:03] <Striki> even though "only" 100-200 million out of 530 million of those documents do not have a hashed shard key

[09:27:21] <Striki> currently the shards are moving stuff around

[09:27:30] <Striki> have been since yesterday, just automatic balancing

[09:27:40] <Striki> just wondering if I should turn off the balancing while making these big changes

[09:28:11] <Striki> running that program will take some days as well and the system will be kept running as well. the collection is increasing by around 3-4 million documents a day IIRC

[10:05:59] <ahawkins> Hello everyone, I have an array of strings. How can I query documents where an array element matches that regex?

[10:06:19] <ron> pray.

[10:11:58] <kali> ahawkins: have you tried the obvous way ? :)

[10:12:25] <ahawkins> in + regex worked according to my test, don't know if that's the best way

[10:12:26] <Nodex> regex works on arrays

[10:44:43] <maisapride> Hello

[11:14:55] <Rhaven> Hi, is there a way to add an arbiter in mms console? I tried this solution : https://jira.mongodb.org/browse/MMS-1158, but the mms agent throw an error : "ERROR NonBlockingMongoStatsThread-xxx:27016 Problem collecting non-blocking data: xxx:27016 is an arbiter"

[11:15:33] <Guest65335> hello all...

[11:16:05] <Guest65335> i'm having problems installing latest mongodb release through apt

[11:16:19] <Guest65335> the connection is very slow...

[11:16:34] <Guest65335> checked ping and it seems normal...

[11:16:54] <Guest65335> any ideas what might be the cause?

[11:39:56] <muraliv> hi group, I'm trying to enable authentication in a replication setup. I've --keyFile enabled. after creating admin and normal user, restarting mongodb with --auth option, replication by itself is not happening.

[11:40:33] <muraliv> it's in recovering mode only. and error log says, "command denied"

[11:40:44] <muraliv> any help would be appreciated.

[11:45:14] <muraliv> do I have to create users and set password in every mongodb instance, to enable authentication in a replication environment?

[11:56:50] <_boot> muraliv: my understanding is that all the replicas have the same keyFile

[11:57:10] <_boot> muraliv: and then you add the users on the primary and you're good

[11:57:25] <muraliv> _boot: yes, all the replicas have the same file infact, all the repl instances are running in my machine only.

[11:57:43] <muraliv> _boot: yes. that's what I've done.

[11:58:03] <_boot> ah I just scrolled up and saw the rest :)

[11:59:02] <_boot> I've no idea, was the replica set all happy before?

[11:59:28] <_boot> and do they all have auth enabled?

[12:00:33] <muraliv> _boot: yes. before enabling auth, I logged in and created admin user in admin db and a normal user in my db. only after that I enabled --auth in command prompt

[12:01:26] <muraliv> no. not on all replicas. only on the primary I created both the users. because, if I create a user in primary, the collection and document is replicated in the secondary too.

[12:01:59] <_boot> perhaps add a third replica or just an arbiter?

[12:02:36] <_boot> though I guess that's difficult to do if none of them want to be primaries

[12:03:56] <muraliv> ohhh.. but is my approach correct? i.e. I've first created replicas. and then added users in primary.

[12:04:34] <muraliv> or should I create users individually in every instance, before replication. then restart mongodb with -auth and then enable replication?

[12:04:57] <_boot> no it should be fine as is

[12:05:08] <_boot> i've never had to individually manage users on any of the replicas i've used

[12:05:20] <muraliv> yes.. but stil I'm facing this strange error.

[12:05:32] <muraliv> yes. that's I've read on docs.

[12:06:24] <_boot> hmm, I don't know :|

[12:07:30] <muraliv> :) anyways, thanks a lot for your time. if I came to find it out, will update you . thanks.

[12:08:13] <_boot> okay, cool

[13:56:44] <OliverJAsh> i'm console logging my mongodb queries, however they are converted to JSON which means object IDs just become strings, which means i can’t copy the query straight into the mongoDB client/REPL for inspection.

[13:56:59] <OliverJAsh> i have to go through and manually convert strings to ObjectIDs

[13:57:12] <OliverJAsh> is there anything i can do about this?

[14:03:58] <_boot> will {$oid:"objectidstring"} work in the shell?

[14:19:04] <OliverJAsh> i get this

[14:19:04] <OliverJAsh> error: { "$err" : "bad query: BadValue unknown operator: $oid", "code" : 16810

[14:37:15] <_boot> i guess not then :|

[15:04:31] <ryansb> Hey, I have a question about MMS in multi-tenant deployments. It says here http://bit.ly/1cuaKQV that a single mms agent should be able to access all of your mongodb nodes to monitor them

[15:04:46] <ryansb> and that a second agent should only be installed for redundancy

[15:06:04] <ryansb> In my case I want to have one mms agent per cluster (separate config servers, etc)

[15:06:14] <ryansb> and the clusters are isolated from each other

[15:07:32] <ryansb> Is there a setting to do that, or is there a string 1 MMS per deployment limit?

[15:11:05] <cheeser> isn't done by creating separate entities in mms and then attaching agents with those entity IDs?

[15:14:09] <ryansb> Do you mean creating an entirely separate group (e.g. myco.com1 and myco.com2) or within the group registering separate clusters?

[15:14:36] <cheeser> probably a separate group. i haven't used mms much it seems like that's the approach used.

[15:15:11] <joannac> separate group

[15:15:18] <joannac> only one MMS agent is active at a time

[15:15:26] <joannac> (in a group)

[15:15:31] <cheeser> there it is. thanks joannac. :)

[15:15:41] <cheeser> now, time to board the plane. heading home finally.

[15:16:38] <ryansb> joannac: thanks. I see there's a ticket to change that https://jira.mongodb.org/browse/MMS-1551

[15:16:42] <odc> Hi there. Question: let's say I have a collection with plenty of small text fields and one big blob field. To get the best read performance, should I stay like this or store the big blobs in a separate collection or database?

[15:17:42] <joannac> are you going to access both the small fields and the large blob at the same time?

[15:18:07] <odc> joannac, usually no, i will use projections

[15:18:24] <joannac> then separate them so you don't have to fetch the whole thing

[15:19:29] <odc> that's what i thought, it's impossible to load only one part of a document right?

[15:21:03] <Nodex> odc, you can project anything you like

[15:26:31] <odc> Nodex, but even with projection, mongodb will first have to load the full document, and then only copy the selected fields in the response buffer

[15:26:37] <odc> i think?

[15:38:05] <Nodex> odc, I don't understand what you're asking. Are you saying it's quicker to do 2 queries or somehting?

[15:38:44] <joannac> it's not quicker to do 2 queries

[15:39:13] <joannac> but if you have a small field and a large field, and the large field is rarely needed, you don't want to incur the cost of reading the large field in all the time

[15:39:53] <Nodex> I know it's not quicker, I was asking for confirmation of what he / she was asking :)

[15:40:07] <joannac> okay, don't mind me :)

[15:40:31] <Nodex> haha

[15:41:19] <odc> yeah, i'm wondering if mongodb internally reads/loads the whole document, even if i only selected a few fields

[16:00:54] <joannac> odc: yes

[16:45:20] <dsdsf> hi all ! I have mongodb in my server with 3 replication. I delete by accident one collection (remove documents) from 1 replica. How can I know that there is backup in other replica? And how do I backup from another replica?

[16:45:56] <hydrozen> Hi there. I’m getting the following error on a query sometimes: https://gist.github.com/hydrozen/9370933. I do have an index defined on the updated_at field. Any ideas what I can try to improve things?

[16:47:01] <joannac> hydrozen: gist your list of indexes?

[16:47:25] <joannac> dsdsf: do you already have backups? replication is for HA, not backups...

[16:47:34] <hydrozen> @joannac: yeah, gimme a minute

[16:47:58] <dsdsf> joannac: no :( I don't have

[16:50:55] <hydrozen> @joannac: updated the gist. Thanks.

[16:51:38] <hydrozen> @joannac: umm, I don’t see updated_at in the index oddly

[16:51:56] <hydrozen> so mongoid must be lying somehow

[16:55:14] <joannac> dsdsf: delayed secondary?

[16:55:59] <joannac> dsdsf: filesystem backups?

[16:56:55] <joannac> dsdsf: taken writes to the database since dropping the collection?

[16:57:05] <joannac> dsdsf: did you drop a collection or delete documents?

[16:58:08] <dsdsf> joannac: delayed secondary - very small

[16:58:22] <dsdsf> joannac: filesystem backups - idk

[16:58:36] <dsdsf> taken writes to the database since dropping the collection - yes

[16:58:48] <dsdsf> did you drop a collection or delete documents? - delete all docs

[16:59:06] <joannac> you're probably out of luck

[16:59:19] <joannac> if you had some kind of filesystem backup you could mongodump from that

[16:59:36] <joannac> your extents have probably been reused by now

[16:59:43] <joannac> take backups in future

[17:02:16] <joannac> I guess *in theory* you could go into your db files and look for BSON and see if you can recover anything?

[17:39:16] <sgfny> I have three collection A, B, C with multiple Cs having a field with DBRef to B and multiple Bs having a field with DBRef to A. Is there a way to do a mongo aggregate query on C using a reference to A?

[18:02:47] <mboman> Has anyone used Amazon Glacier to backup MongoDB? How do you do it?

[18:06:20] <joannac> I thought glacier was more for archiving than backups

[18:21:12] <joannac> mboman: do you run something sufficiently non-critical that you can afford to use glacier?

[18:22:16] <mboman> joannac, It is my own research on my home computer running under the stairs

[18:22:33] <mboman> joannac, so it is more important that I can get the data back then getting it back quickly

[18:22:45] <mboman> joannac, and I need it on the cheap

[18:23:51] <mboman> joannac, to be fair, it will probably take me longer to replace the drives then suck the data back from Amazon...

[18:27:24] <kali> mboman: it's very simple actually. put your backups in an s3 bucket, and you setup a few rules with the aws console to get them to the glavier vault

[18:38:25] <mboman> kali, cool

[19:16:44] <nukulb> hey , quick question about Mongo, I have a product Schema, that has a details object. Each Detail is simply a name value pairs, like price: 25. size: 7. Would it better to store details as an array or simply objects. As in is this better http://pastebin.com/HKcRd8xg or http://pastebin.com/i2acpxBW. Remember that details are different between different products and not a standard list

[19:17:09] <nukulb> joannac: ^^ any help would be appreciated

[19:17:50] <nukulb> Sorry second link should be this - http://pastebin.com/nbMZfZSe

[19:18:02] <nukulb> so is this better http://pastebin.com/HKcRd8xg or http://pastebin.com/nbMZfZSe

[19:31:03] <andrewrk> what's the point of an objectid? why not use a uuid instead?

[19:35:12] <mboman> How can I see how large (storage requirements MB/GB) a collection is? 'show dbs' tells me the whole DB, but not the individual collections....

[19:43:41] <joannac> db.coll.stats() ?

[19:45:16] <joannac> nukulb: how many properties ?

[19:46:07] <nukulb> joannac: 20 to 100

[19:46:45] <nukulb> average would be closer to 100

[19:46:54] <nukulb> joannac: ^

[19:47:37] <joannac> nukulb: are they often updated / queried?

[19:47:55] <nukulb> read very often

[19:48:00] <nukulb> update not as often

[19:48:07] <nukulb> new attributes might be added quite often

[19:48:27] <nukulb> joannac: ^

[19:48:49] <joannac> hmmm

[19:50:08] <joannac> I would do {details: [{k:'name', v: 'nuklb'}, {k:'type', v:'irc user'}}

[19:50:09] <joannac> etc

[19:50:18] <joannac> and then index on details.k, details.v

[19:52:22] <nukulb> joannac: in my current testing so far that seems to be significantly slower than [ {'k': 'nukulb'}, {'type':'irc user'}]

[19:52:53] <nukulb> for both read and write

[19:53:16] <nukulb> my writing test isn't great though, since I am writing the entire product, not individual items in the array

[19:54:06] <nukulb> joannac: what do you like about the pattern you suggested?

[20:13:54] <joannac> nukulb: it's indexable

[20:14:13] <joannac> nukulb: how does yours work with multiple types?

[20:14:29] <joannac> sorry, multiple details

[20:14:48] <joannac> oh, sorry

[20:14:55] <joannac> okay, how are you going to index yours?

[20:15:05] <joannac> there's a limit on the number of indexes

[20:15:19] <nukulb> so I just finished a test with 100K products, 100 attributes each product

[20:15:22] <nukulb> for both styles

[20:15:33] <nukulb> with an index like the one you described

[20:16:21] <nukulb> and despite the index , its slower get product details with this style - {details: [{k:'name', v: 'nuklb'}, {k:'type', v:'irc user'}}

[20:16:42] <nukulb> joannac: ^

[20:18:01] <nukulb> now the question is there a real reason for me index on the details

[20:18:15] <nukulb> I only need details for each product at a time

[20:18:27] <nukulb> so do I really care? joannac ^

[20:18:34] <joannac> I don't know, do you?

[20:18:40] <joannac> I don't know what your use case is

[20:19:18] <nukulb> joannac: well what does an index on details.k, details.v actually provide me?

[20:20:19] <joannac> indexed searching on any detail

[20:20:49] <nukulb> joannac: got it

[20:20:53] <nukulb> I will design another test

[20:20:54] <nukulb> thanks

[20:21:16] <nukulb> joannac that was super helpful. Appreciate your responses today.

[21:07:03] <Chepra> Hi. Is there a way to reclaim space after deleting collections?

[21:07:28] <Chepra> db.repairDatabase() cannot be run, there is no more disk space

[21:08:13] <module000> Chepra: if you can take down your DB, you can dump/restore to reclaim the space

[21:08:22] <module000> Chepra: but if you're at 100% on the disk you have other problems

[21:09:09] <Chepra> I am not at 100% disk space :) I some gbs free. I just deleted a bunch of collections and want to free this space up :)

[21:09:50] <kali> http://docs.mongodb.org/manual/faq/storage/ start there

[21:10:08] <kali> compact may actually be a solution in your case

[21:10:53] <Chepra> how? I already deleted these collections ;)

[21:11:18] <kali> ha !

[21:11:22] <kali> maybe not then :)

[21:11:52] <Chepra> the actual working set of the database is about 250mb, but the disk size is about 15gb :(

[21:13:56] <cheeser> nice

[21:15:29] <Chepra> nice? :( noooo :D

[21:16:44] <Chepra> jesus

[21:16:55] <Chepra> there must be a way to do that o_o

[21:27:11] <cheeser> check for external files and clear up some space enough to allow repairDatabase to run

[21:27:24] <cheeser> check /tmp, /var/log, etc. for files that can be pruned

[21:28:34] <Chepra> there are so such files.

[21:28:39] <Chepra> I will dump the database and recreate it

[21:35:28] <nicholaswyoung> I'm designing a geoanalytics system on top of Mongo, and could use input from someone more knowledgable. Any insight on this thread would be greatly appreciated: https://groups.google.com/forum/#!topic/mongodb-user/rs4pgTS_bQI

[21:36:32] <nicholaswyoung> I've worked out how to use $addToSet, but incrementing the embedded counters... I'm still lost. Maybe I'm going about this entirely wrong, too. So rebuke I'm going astray!

[21:40:23] <michael_swe> nicholas: If you're going to enumerate all countries and regions your document will get really big. Why don't you create one document per counter?

[21:42:31] <nicholaswyoung> michael_swe: That's a very valid point. I'm already doing some document partitioning based on URL and day of the month, and creating collections by month number.

[21:43:38] <nicholaswyoung> michael_swe: I think with those partitions already in place, it will scale well enough, unless you see something else wrong. Essentially, in the current draft, there's a JSON structure (like I showed in the gist) for each 24-hour day.

[21:46:16] <nicholaswyoung> michael_swe: But then again, I'm no expert. What do you think? Am I relatively safe?

[21:46:57] <michael_swe> I'm new to noSQL (comming from SQL Server), so please take my words lightly :) I'm still wondering though.. how many countries/regions will be enumerated. You can't index that kind of structure.

[21:49:01] <nicholaswyoung> I'd say, at max, 150. That's not terrible, given the performance I've seen from Mongo thus far.

[21:49:31] <nicholaswyoung> The biggest issue is incrementing those counters, and so far, the documentation hasn't helped much.

[21:50:28] <michael_swe> Ah, ok. Thought it would be thousands

[21:50:30] <nicholaswyoung> I've tried to comprehend $elemMatch, which I think I need to select the appropriate title="" key, but I don't understand it yet. I'm hoping someone else does!

[21:51:38] <nicholaswyoung> and given that I'll be pulling everything back one document at a time (since documents are already created one for each day), indexing shouldn't be too horrid.

[21:52:56] <michael_swe> You could loose one depth level by naming your keys "countrycode.region", i.e. "US.California". Wouldn't that make incrementing the counters easier?

[21:53:47] <BurtyB> michael_swe, that's what I was thinking too

[21:55:23] <nicholaswyoung> I did try that, but encountered some issues with international naming schemes. (I have a trial of that system running here: http://api.machine.fm/episodes/5313a16a3230350a20010000/accesses)

[21:55:55] <nicholaswyoung> Not every region maps cleanly to a key => value.

[21:56:56] <nicholaswyoung> I guess I should also mention that I'm not concerned with the complexity of queries needed to achieve the initial schema design, because I can handle 'em in a queue. This isn't a realtime system.

[21:57:10] <nicholaswyoung> BurtyB: I wish it was that easy. :-/

[22:03:13] <nukulb> nicholaswyoung: not to interrupt your conversation , but I am currently using $elemMatch, like this - http://docs.mongodb.org/manual/faq/indexes/

[22:03:37] <michael_swe> How many days will you save in one document? You can't go on forever (mind the 16MB limit). I'm sorry, but I have no clue of how to increase those counters in an effective way. Why don't you make a "days" collection and reference the book/media or whatever from each document? You'll have to make some bucketing anyway when more days goes by.

[22:03:54] <nicholaswyoung> michael_swe: One day per document. One collection per month.

[22:04:32] <michael_swe> But the document in your demo api has five days..

[22:04:35] <nicholaswyoung> nukulb: you're not interrupting at all. Let me take a peek at that link, because I don't think I've seen it before.

[22:05:01] <nicholaswyoung> michael_swe: It's only been up for a few days. That's the test environment, where I'm prototyping this idea.

[22:06:10] <michael_swe> One collection per month.. That's a new concept to me. I that legit in the noSQL world? Would be a no-no in SQL Server.

[22:07:03] <nukulb> nicholaswyoung: look under this heading "How can I effectively use indexes strategy for attribute lookups?"

[22:07:05] <nicholaswyoung> I don't know about the larger world of nontraditional databases, but it's totally cool in Mongo, since collections can be created on the fly without a real performance penalty.

[22:08:08] <sflint> are you just trying to collect hits?

[22:08:30] <nicholaswyoung> nukulb: This helps a lot. Is there any documentation on incrementing counters for the matched keys?

[22:08:37] <nicholaswyoung> sflint: That's the gist of it.

[22:09:05] <nicholaswyoung> sflint: Here's the schema I would like to have, on a per-document basis: https://gist.github.com/nicholaswyoung/475eeaa49800db1d2f75

[22:09:16] <sflint> do you can about knowing the time of the hit or just total?

[22:09:20] <sflint> care*

[22:09:48] <nicholaswyoung> just total. Each day has a new document, and that's as specific as I get.

[22:10:16] <sflint> k...then you should have a day collection for each

[22:10:21] <sflint> month database for each month

[22:10:36] <sflint> unless you want to keep it for all time? you will need to archive some how

[22:10:53] <sflint> you document should be 1 document for each of the states listed under region

[22:10:53] <nicholaswyoung> That seems a little extreme. I plan on expiring the monthly generated collections, as I go.

[22:11:23] <sflint> i am sorry ... 1 document starting with lowest region (city)

[22:11:43] <nicholaswyoung> sflint: Won't that force me to use aggregate queries to retrieve data

[22:11:45] <sflint> right but if you don't use 1 DB for each month you will fragment and not get the space back

[22:11:45] <nicholaswyoung> ?

[22:12:14] <sflint> no ... you can include everything you need in the _id for the queries

[22:13:08] <sflint> _id : usa california cupertino

[22:13:16] <nicholaswyoung> FYI: the whole "collection per month" thing came from one of the MongoDB early adopters, John Nunemaker of http://gaug.es. That's who started me down this path.

[22:13:22] <sflint> if you want all usa documents you can do find on ^usa

[22:13:44] <sflint> call california find on ^usa california

[22:13:48] <nicholaswyoung> I tried that approach too, but the international naming for regions etc were a massive pain to include in keys of any kind.

[22:14:18] <sflint> then you will have to include them on the document and use secondary index

[22:14:27] <nicholaswyoung> You get weird characters and stuff, like characters we don't see in the normal alphabet.

[22:14:34] <sflint> you don't want you document to grow EVER if you can

[22:14:56] <sflint> if it does grow you want some type of bound on it

[22:15:06] <nicholaswyoung> Gotcha. Well, I'm going to do some more digging and see what I come up with. Thanks for the different perspective.

[22:15:15] <sflint> the collection per month is okay but the OS doesn't get the space back if you remove the collection

[22:15:29] <sflint> if you use a seperate db for the month when you drop it the OS gets the space back

[22:15:38] <nicholaswyoung> I wonder how everyone else has been getting away with the technique then.

[22:15:57] <sflint> the repair their databases

[22:16:08] <nukulb> nicholaswyoung: I haven't seen any where you are incrementing counters for matched kets

[22:16:11] <nukulb> keys*

[22:16:17] <sflint> or they don't take on a lot of data....i am used to having a lot of datat coming in

[22:16:17] <nukulb> it would be nice find some though

[22:36:42] <pomke> Hello!

[22:36:52] <pomke> Sorry if this is a little off topic

[22:37:27] <pomke> Would anyone know where I might be able to buy some mongo stickers for my laptop?

[22:38:04] <Joeskyyy> http://www.mongodb.com/swag

[22:38:39] <pomke> ty :)

[22:38:47] <Joeskyyy> not sure the legality of redbubble's use here but you can buy some here apparently? http://www.redbubble.com/people/bape/works/11148916-mongodb-2?country_code=US&p=sticker&utm_campaign=shopping&utm_medium=google_products&utm_source=google&gclid=CJv105C6_LwCFYuXOgodJHsAIw

[22:39:40] <cheeser> pretty sure that's a trademark violation :)

[22:39:41] <pomke> hah

[22:39:44] <Joeskyyy> me too

[22:39:45] <Joeskyyy> haha

[22:39:53] <Joeskyyy> unless they have some partnership or something

[22:39:59] <pomke> redbubble is a print-on-demand service

[22:40:05] <pomke> so some user has listed that

[22:40:14] <pomke> not the company

[22:40:23] <cheeser> but that user seems to have a whole portfolio of others' trademarked items

[22:40:42] <joannac> cheeser: we should forward that to legal

[22:40:47] <cheeser> indeed

[22:40:57] <Joeskyyy> you're welcome? :D

[22:41:00] <Joeskyyy> can i have a sticker now?

[22:41:07] <joannac> come visit an office

[22:41:25] <cheeser> Joeskyyy: you win a free copy of mongodb shipped directly to your computer.

[22:41:29] <Joeskyyy> might actually come to mongo world actually, y'all sent me a tshirt already so I'm just begging :P

[22:41:46] <cheeser> well, there *will* be more shirts there...

[22:41:51] <cheeser> just saying..

[22:42:16] <Joeskyyy> pretty expensive shirt tho baha

[22:42:36] <Joeskyyy> Nah, I work at Rackspace, so we host some mongo events with you guys sporadically with objectrocket and stuff

[22:42:37] <joannac> lame

[22:42:40] <joannac> I want a tshirt

[22:42:50] <cheeser> nice

[23:06:53] <Fallout2man> So I'm working with node JS and the ISODate's for my fields are all wacky

[23:07:59] <Fallout2man> I'm setting a date of Febuary 18th at 12:00 P.M. PST and getting ISODate("2014-03-18T19: 00: 00.0Z") when it should be ISODate("2014-03-18T4: 00: 00.0Z") what with PST being eight hours behind UTC

[23:29:43] <rafaelhbarros> I asked this before

[23:30:08] <rafaelhbarros> I'm looking into migrating from baremetal ec2 into elastic beanstalk

[23:30:16] <rafaelhbarros> but I have a replica set with 3 nodes

[23:30:28] <rafaelhbarros> how would I go about and do this on elastic beanstalk?

[23:30:58] <rafaelhbarros> should I run the app on the beanstalk and the db on 3 ec2? any suggestions?

[23:32:32] <MacWinner> hi, I have imported a document into a collection.. one of the fields is something like this {"timestamp" : "2013/08/26 17:36:22" } can I make mongo treat this as a normal datetime field?

[23:32:50] <MacWinner> assuming I don't have control on how the data gets into the mongodb

[23:47:40] <Fallout2man> Using NodeJS and I'm getting problems with setting/retrieving dates to my mongo server. I set a date at for instance Feb 18th 12:00 P.M. PST in javascript. I record the date as a date object, I get ISODate("2014-03-18T19: 00: 00.0Z") when it should be ISODate("2014-03-18T4: 00: 00.0Z")

[23:47:48] <Fallout2man> Any idea why this is happening?

[23:47:58] <pasichnyk> hey guys, i'm working with aggregation framework and curious if i can insert a property value from the result documents into a proeprty name of the projection. Basically i'm getting information back grouped by users, and i want to build a document that has a schema like { date: {some date here}, summary.$user.total: $total, summary.$user.sales: $sales, ... }. is that possible, or am i always

[23:47:58] <pasichnyk> goign to end up with a document per "result" that comes back from the aggregation?

[23:48:44] <changerOfSea> hey all, got an aggregation problem here. in my $group clause I have 3 vars that I want to group by, they are all part of the same sub-object, an id, name, and slug. However, only the id is showing up, any ideas?

Log file Viewer

Help | Karma | Search:

#mongodb logs for Wednesday the 5th of March, 2014