[00:39:41] <LurkAshFlake> when i db.naissance.drop() in my ubuntu terminal it return false
[00:42:15] <LurkAshFlake> i can't drop any of my collection
[01:25:02] <joannac> LurkAshFlake: pastebin exact command and output
[08:15:19] <KekSi> quick question since i'm not entirely sure about this right now: is it possible to add more shards to an already sharded collection? it looks to me like that wouldn't be a problem and the only problematic thing would be to change the shard key later
[08:28:14] <Boomtime> KekSi: you can add shard any time you like, note that shards are added to the cluster as a whole, sharded collections merely make use of shards as necessary
[08:29:03] <Boomtime> also, changing the shard key is a difficult problem any time, regardless of how many shards you have - you effectively must dump and re-import your entire collection to change the shard key
[08:39:44] <KekSi> thanks boomtime, thats what i thought
[10:51:22] <djlee> Hi all, if i have a list of object id's represented as strings, is there a way of doing an IN query against _id without converting them ll to ObjectIds, like : find({_id:{"$in":[ "stringid", "stringid2" ]}})
[11:10:54] <mah0-deh0> I have a visits collection that I would like to "group", i.e.: a user just can have a single visit in a certain day so I need to get all its visits by using the min in_timestamp and max out_timestamp and summarizing each visit duration (which may be available or not..). Which is the best way to achieve it?
[12:27:21] <allaire> Hi. We're using the MongoDB Amazon AMI image for our app. we just upgraded from 2.4 to 2.6.9. sudo sevice mongodb start is not working but sudo mongod --config /etc/mongod.conf is working, any ideas?
[12:27:50] <allaire> Only getting Starting mongod: [FAILED], nothing is logged.
[12:30:34] <gabrielsch> hey, is there any way to group datetime by day in mongodb queries?
[12:32:35] <gabrielsch> for example, I want to find how many products where created for each user in the last 30 days...
[12:32:39] <gabrielsch> I have the following schema: https://gist.github.com/gabrielsch/1d8a0d8125a21378f35c
[13:43:44] <amitprakash> Hi, how can I sync my collection data from mongo to EMR? I was looking at mongodb hadoop connector, but the documentation doesn't seem to indicate anything about how information is continuously updated from the latter to hdfs?
[13:44:13] <cheeser> because mongo-hadoop doesn't provide a sync facility
[13:44:22] <cheeser> you can build one with it, though.
[14:07:45] <GothAlice> Oh, HL7 encoding, whoever invented you needs to be taken out back and shot. (Speaking of EMR in the "electronic medical records" sense.)
[14:26:40] <GothAlice> Connector uses a lot of deep tricks to do what it does, including tailing the oplog. If there are _any_ differences in the status information (there are) or operations themselves (there shouldn't be, but could…) then Mongo Connector will have to be updated to recognize it.
[14:28:39] <cheeser> only two issues against wired tiger: https://github.com/10gen-labs/mongo-connector/issues?utf8=%E2%9C%93&q=is%3Aissue+wired+tiger
[14:32:41] <saml> if i can reproduce at smaller testcase, will create ticket
[14:35:30] <pamp> Anyone here with Csharp driver experience?
[14:49:41] <dberry> is there a good study guide out there for the mongodb DBA exam(C100DBA)?
[14:51:26] <saml> wasn't mongodb created so that you don't need dba?
[14:52:27] <StephenLynx> who gives out this certificate? 10gen?
[15:04:38] <saml> mongodump, take a node out of replication, change storage engine, restart the node (not joined in replicaset), mongorestore, let it join. repeat?
[15:04:55] <saml> or do I not have to worry about taking it out of replication?
[15:04:58] <cheeser> if you use mms, we can change the storage engine for you
[15:17:03] <cheeser> mms uses WT on some of its node. when you provision via mms, it uses the default engine which is mmapv1
[15:17:15] <cheeser> but you can tell it to configure your nodes with WT if you want.
[15:18:02] <GothAlice> Indeed; I tested out my dataset that way. It was fun to watch the elected primary symbol rotate around the nodes during my testing. :3
[15:19:13] <GothAlice> The primary literally couldn't stay up for longer than 30 seconds under my 1K ops/sec benchmark. :(
[15:31:46] <saml> 1k ops/sec is web scale depending on the size of each op
[15:40:56] <GothAlice> saml: It was ~700MB of data, initially loading the first 500MB then querying that data to generate and upsert the rest (combined reads, getMores, and updates).
[15:41:40] <GothAlice> A typical pre-aggregation workload; the "benchmark" is actually my at-work project's data bootstrap and migration run, part of which re-calculates aggregate click stats.
[15:42:33] <saml> so complicated. i just want to start mongod, no configuration and everything works well up to web scale
[15:42:53] <saml> i guess mongod conf yaml is still shorter than postgres conf
[15:43:28] <GothAlice> saml: Slap the MMS agent on a few machines, and yeah, it basically is that simple. (I'm just using somewhat under-sized VMs with tight memory constraints, thus running into multiple issues with WiredTiger.)
[15:45:40] <GothAlice> ^_^ I'm guessing if I threw 64GiB of RAM at my VMs, my test suite won't make them crash. 2GiB, though, and yeah, there be problems.
[15:47:01] <saml> oh i see. you should downlaod more ram
[16:32:06] <Croves> I'm learning MongoDB and I have a question. My app have two collection: products and tickets. One ticket can store infinite products. Imagine a restaurant that assign many orders to a specific table
[16:32:48] <Croves> I need to relate these two tables. As a mysql user, I'm about to create a products_in_ticket collection and relate them in this new object.
[16:33:04] <Croves> Is there a better aproach? If so, what is it? Is there foreign keys in MongoDB?
[16:34:21] <StephenLynx> no, there arent foreign keys in mongo, and you never create collections for n*n relations.
[16:34:40] <StephenLynx> you should have an array in a field in the ticket
[16:34:48] <StephenLynx> this array would hold the products it contains.
[16:35:20] <Croves> But what if I need to change some information in products?
[16:35:30] <Croves> I have to update all tickets with that product?
[16:35:34] <StephenLynx> so you just keep track of its unique id.
[16:36:02] <StephenLynx> I would use a readable name as its unique id.
[16:36:15] <StephenLynx> so you don't need to duplicate the name and keep track of it.
[16:47:21] <StephenLynx> for you to list all available products without a products collection
[16:47:40] <StephenLynx> you would have to get all tickets, which are way more than available products, unwind it, group
[16:48:05] <mah0-deh0> Croves, I'm not the best at mongo, but one of your first steps should figure out if u really need mongoDB or a SQL one as your data is relational
[16:48:44] <StephenLynx> well, you can do with a few relations in mongo. but if you need a join, then you really can't use mongo.
[16:48:46] <Croves> StephenLynx: This part I get it...
[16:49:09] <StephenLynx> if your amount of queries increase with the amount of results, you can't use mongo.
[16:49:34] <Croves> Would you use Mongo for a restaurant software?
[16:53:28] <StephenLynx> not without, but few relational cases.
[16:53:47] <StephenLynx> as I said, you can go with fake relations, if you understand how that limits you.
[16:55:16] <Croves> So, I guess I'll have to use MySQL then
[16:55:22] <StephenLynx> if you can use fake relations without performing additional queries to the db, you don't really lose much in performance, or anything at all. and if you would need to perform an unwind on a large set of data to perform a sort, you would even gain, IMO.
[16:55:25] <mah0-deh0> Croves: you should google "mongodb vs mysql" to figure out what do you need, there are A LOT of articles covering this topic already
[17:01:30] <StephenLynx> options like this would be sqlite.
[17:01:37] <cheeser> mongo isn't officially supported on that hardware at any rate.
[17:01:37] <Croves> StephenLynx: I have a software that manages a entire restaurant. Now I'm building a new version of it, so the waiters will use iPads to note customer's orders and so on...
[17:01:50] <cheeser> there are ARM builds out there but they're community driven so caveat emptor
[17:02:06] <Croves> But for many reasons, I can't keep this software on the web. I really need a physical server into my customer's restaurant
[18:25:25] <juliofreitas> I've another problem. I'm using: mongodbimport -d testParser -c myCollection --type tsv --file myfile.tsv --headerline and my collections there is a key value like this "" : "" . How can I remove it?
[19:40:23] <StephenLynx> hey, if a json object has a key with ":", how do I read it? object['lala:lolol']?
[19:59:49] <GothAlice> StephenLynx: Yup, that'd be the approach.
[21:45:15] <miceiken> Hello guys. Do I just store password hashes in a StringField?
[21:57:04] <GothAlice> miceiken: Unless you are aware of the crypto security issues involved, I recommend storing scrypt hashes rather than rolling your own (i.e. "just sha it").
[21:57:09] <GothAlice> miceiken: Which language are you using?
[22:01:14] <GothAlice> Still; scrypt handles things like salting for you, it's also designed purposefully to be slow. (And memory hard in addition to being CPU hard, to prevent parallelization attacks. Literally the result of every previous loop iteration, internally, is needed to produce the next iteration.)
[22:02:47] <GothAlice> This differs from bcrypt in that bcrypt uses symmetric encryption to make itself slow, and doesn't have that type of feedback setup; increasing the number of needed iterations increases CPU difficulty, but not memory.
[22:05:16] <GothAlice> miceiken: https://github.com/bravecollective/core/blob/develop/brave/core/util/field.py#L14-L48 < I use the MongoEngine ODM on top of pymongo; if you do, too, you can borrow my PasswordField implementation. (MIT license.)
[22:06:58] <miceiken> yeah, I heard there was something going on. Anyway, I am literally writing a microservices thing now because I got so inspired by the brave collective IT
[22:09:02] <GothAlice> cheeser: The result of that, BTW, was doxxing and some rather tedious conversations with my employers. (They agreed that gamers getting that upset over volunteer services going away are, themselves, pants-on-head.)
[22:10:48] <cheeser> probably from the BOFH heyday
[22:11:29] <GothAlice> cheeser: I use lines similar to that fairly frequently. http://s.webcore.io/image/3A313C1q1A0y < side effect of camping ##webdev with a female nick.
[22:22:47] <Feadurn> Is it possible to query a database to find all the documents that have a specific value to a field and doing it again with the next different one
[22:23:08] <Feadurn> I have seen that distinct can output all the distinct value but the dataset contains millions of row
[22:23:29] <Feadurn> so how can I do that without knowing in advance all the distinct value and not parsing them several times