[10:08:50] <phira> if I know I'm going to restart a replicaset member, ie for a system update or something, is it good practice to remove it from the RS first?
[10:43:23] <NodeX> quick RS question .... is there anyway to have a secondary not put indexes into RAM?
[10:43:52] <NodeX> I need a secondary purely as a backup and never intend to switch to it - just want the files
[10:44:39] <kali> well, mongodb does not do anything clever to control the way the RAM is used. it delegates everything to the kernel, so quick answer is no
[10:45:00] <kali> but anyway, you need your replica to update the index to keep track of its primary. and for that, you need them in RAM anyway
[10:47:30] <kali> if it does not work just with slaveOk(), i'm pretty sure it will work if you stop the instance, restart it without the replset on a different port and drop the indexes and restart with the normal setup
[10:48:02] <NodeX> aggregation is killing my primary CPU so I have to move it to another box
[10:48:40] <NodeX> but that box also has a mongo instance on it so I need a low level instance that doesn't use much RAM
[11:40:13] <elias_> Anyone aware of a comparison betwen MongoDB and Cassandra with some examples and detailed explanation why one is better than the other in certain circumstances? We are in the final round of a DB evaluation exercise of a project we need to decide between the two. Briefly, within our team cassandra is considered better as it regards to non-functional requirements such as High Availability, Clustering etc while MongoDB is considered better as
[11:40:13] <elias_> it regards to ease of use during development, data modeling and query flexibility. Since we need both, its hard to make the decision. Any hint from your side or references to external material would be greatly appreciated.
[11:43:00] <NodeX> each comparison is meaningless as it's probably not related to your specific app/requirements
[11:43:42] <NodeX> perhaps explain what you're using it for / the types of queries and we can advise what Mongo will be like in terms of performance and scalability regarding
[11:49:32] <elias_> Yes, I understand that. That's why I am interested in examples than strict feature comparison
[11:50:20] <elias_> its hard to go into example use cases, but some of our objects seem to have a depth of 4 or more levels
[11:50:50] <elias_> which seems to be trouble in cassandra - not blocking though
[11:51:48] <elias_> maybe let me start with a fundamental question on demineralization and its implications
[11:52:04] <elias_> say you have twitter with a user having 100.000 followers
[11:52:56] <elias_> if you store tweets in to a user_timeline collection, a tweet from this user will be copied 100,000 times
[11:54:02] <unsleep> i have a tag list for a suggestion box... the tags are an array index.... if i have the word "word" and i can find it i need to search tag:"word".... how i find it with looking for "w" only?
[11:58:59] <NodeX> looking up those 100k followers will be messy and unperformant so that needs to be in a graph DB probably
[11:59:43] <elias_> any alternative to suggest - not for twitter per se - but for similar problem
[11:59:53] <elias_> how would you do it in mongo though
[11:59:56] <NodeX> Mongo docs currently have a 16mb per document limit on them and in order to keep things fast you would need to store the followers + a little info about them maybe (twitter name or w/e) - this may exceed the doc size
[12:00:20] <unsleep> i preffer to do 10 querys than 10 copies...
[12:00:27] <NodeX> which would mean you needing to shard the user' followers
[12:00:40] <NodeX> unsleep : that's bad performance
[12:16:02] <kali> today news... france forces at war in mali, three kurds activists killed in paris this week, one french hostage probably killed in somalia during its attempted rescue, about one million religious zealot marching in paris tomorrow against gay marriage
[16:16:55] <warrick> i have never used mongodb, but find the api meets my needs very well. however, i am unsure that mongodb is suited for my use case.
[16:17:37] <warrick> can mongodb maintain 1000-1500 upserts/second
[16:18:27] <warrick> small document size (< 1KB), and two daily collections, which will be cycled out of mongodb
[17:47:27] <Mortah> hullo. We just added an index (background true) to a replica set. primary server added it fine. secondaries have got 20% through (and still going) but all queries to secondaries have suddenly started hanging
[17:48:24] <nemothekid> What would be the best way to drop an entire collection on a single shard without effecting the other shards? I don't think a remove is possible either
[17:53:16] <kali> Mortah: you can workaround by building it offline: stop a replica, start it without the replset option and on a different port, build the index manually, set the replica back in line
[17:53:20] <Mortah> (looking at how long we've had issues vs how long the index has been running)
[17:53:26] <kali> Mortah: and iterate over your replica
[21:30:42] <borntyping> I'm working on a project using MongoEngine, and am wondering whether it is better to use embedded documents or not for storing a set of related objects (that are unique to the parent object)_
[21:41:00] <borntyping> That's pretty much the same answer I've got in other places, but in a little more detail
[23:09:18] <owen1> let's say i have 3 hosts in my replica set. 1 can't see the other two. will he become a primary? what if the other 2 are ok, it's just a network issue?