ArchiveOrangemail archive

java-user.lucene.apache.org


(List home) (Recent threads) (34 other Apache Lucene lists)

Subscription Options

  • RSS or Atom: Read-only subscription using a browser or aggregator. This is the recommended way if you don't need to send messages to the list. You can learn more about feed syndication and clients here.
  • Conventional: All messages are delivered to your mail address, and you can reply. To subscribe, send an email to the list's subscribe address with "subscribe" in the subject line, or visit the list's homepage here.
  • Moderate traffic list: up to 30 messages per day
  • This list contains about 41,883 messages, beginning Mar 2005
  • 10 messages added yesterday

java-user.lucene.apache.org

December 2005 - page 1
Jack Xue 113339578201 Dec 2005* Hello list, I am working on boosting query performance by writing my own Similarity Class (Rewrite tf(), idf(), boost(), lengthNorm()). I've gone...
Daniel Pfeifer 113340685201 Dec 2005* Hi, I am working for a major Application Service Provider in Europe and we have now since a couple of months very successfully used Lucene 1.4. We are...
reverse sort (7 Replies)
Michael Pow 113341663501 Dec 2005* I'm trying to reverse sort a result set by it's date field (YYYYMMDDhhmm). pseudocode: Boolean order = {false | true}; Sort sorter = ne...
Andy Hind 113342716701 Dec 2005 Hi I would like some pointers for causes of the following error, using lucene 1.4.3. I have not really got much to go on at the moment other then the ...
George Abraham 113357035903 Dec 2005* All, I have created a Lucene index from data in a SQL Server db. When I conduct a Lucene search, I get back in the hits the primary key (WorkID) and t...
Gaston 113365577004 Dec 2005* Hallo, JTidy is a very good HTMLParser but for HTML Websites made with the help of Microssoft Office Products like Word for example it is not optimal....
Distributed sort (2 Replies)
Jeff Rodenburg 113373391104 Dec 2005* In one of the Google Labs whitepapers ( http://labs.google.com/papers/mapreduce-osdi0...), a programming construct known as MapReduce is used in a var...
Paul.Illingworth113376479005 Dec 2005* A simple question and I guess it may have been asked before. Does the order of Querys in a BooleanQuery affect search speed? By this I mean if the fir...
Harini Raghavan 113378615005 Dec 2005* Hi, I have a requirement to highlight search keywords in the results and display the matching fragment of the text with the results. I am using the Hi...
Combs, Craig 113380347805 Dec 2005* This is very mysterious I have check my parser and I'm returned body:<token>. My analyzer during indexing returns <token> in the toke...
Combs, Craig 113380819205 Dec 2005* I'm able to see the documents that were indexed but not the tokens associated with the document in Luke. I'm using the multifield query pars...
Combs, Craig 113381167405 Dec 2005 I hate to admit this but I must. My error was caused by a simple offset on a counter. Luke was very helpful in helping me determine this error. Althou...
Alan Chandler 113386441806 Dec 2005* I added a date field to a document with doc.add(Field.keyword("A Date",myDate)); How do I get it back out again as a date?...
Dan Climan 113386609306 Dec 2005* Do stopfilters create non-contiguous token positions?I was interested in experimenting with the highlighter and using the TokenSources.getTokenStream(...
Amir Hosein Jadidi Nejad 113387476406 Dec 2005* Hi All, is anybody have a "persian (farsi) analyzer" for lucene search engine. if "arabic" is available, i can customize it to far...
Robert Selvaraj 113388857606 Dec 2005 SearchBlox Software has released Version 3.1 of its J2EE Content Search Software. SearchBlox delivers out-of-the-box search functionality for quick an...
Zhang, Lisheng 113389283106 Dec 2005* Hi, We are using lucene v1.4.3 for some time, in general it is working well. We often try to search multiple collections at the same time, so we are u...
Dan Funk 113389312706 Dec 2005* In the sandbox at http://lucene.apache.org/java/docs/lucene-san... There is a link to the WordNet repository: http://svn.apache.org/repos/asf/lucene/j...
Mordo, Aviran (EXP N-NANNATEK) 113389318306 Dec 2005 Lucene is thread safe, it is recommended that you only have one IndexSearcher instance. No problems with multiple searches on the same IndexSearcher. ...
Alan Chandler 113391236606 Dec 2005* I have implemented a "Database" on top of lucene and am currently testing it. When I manually step through my test suite in the debugger, it...
victorn113394423907 Dec 2005 kbforge.com is pleased to announce the first public release of "kbforge", a new, completely free, desktop search application specifically de...
Eugene Ezekiel 113396885007 Dec 2005* Hi, Is there any way to get the similarity scores for each document in the index? I can iterate thru each doc in the index using the IndexReader but n...
repeating fields (5 Replies)
Reza Ghaffaripour 113397222407 Dec 2005* hi all, im new to lucene. i have an xml with repeating tags.something like : <a> <p>x</p> <p>xx</p> <p>xxx</p...
Ching-Pei Hsing 113401442608 Dec 2005* Has anyway solved the following problem, or have good suggestions? Each document is assigned to one or more category nodes in a hierarchy. For example...
Alan Chandler 113403065908 Dec 2005* I am trying to construct, via individual query api, a query to search for documents with a field name of "Category" and a value of either ...
Vanlerberghe, Luc 113403282508 Dec 2005* Since 'byte' is signed in Java, can't the first test be simply written as if (b>0) return b; Doing an 'and' of two bytes a...
Paul.Illingworth113404350608 Dec 2005 Hello all, Whilst merging one index into another using IndexWriter.addIndexes(IndexReader[]) I got the following error. (index _file_path)\_5z.fnm (Th...
Top n Searches (2 Replies)
Paul Williams 113405396908 Dec 2005* Hi, I've been asked whether we can do a Top n Searches functionality where we record the most common searched for phrases on a daily basis. I...
Dalton, Jeffery 113405478308 Dec 2005 Andrzej, I think you did a great job elucidating my thoughts as well. IAgreed!> What I found out is that "usable" depends a lot on how yo...
Dan Liu 113405881908 Dec 2005 Hi, What is the difference between following approaches? Approach1 1) open IndexWriter and index documents 2) optimize the indexWriter and close the i...
Beady Geraghty 113406372208 Dec 2005* I am back to doing something with Lucene after a short break from it. I am trying to index/search hyphenated words, and retrieve them from a token str...
Mordo, Aviran (EXP N-NANNATEK) 113406577808 Dec 2005 Well the best way in my opinion is to: 1) open the IndexReader and delete some documents from the same index 2) close the IndexReader 3) open IndexWri...
J.J. Larrea 113406720108 Dec 2005* My application needs to simultaneously process record additions and updates with one pass through a database. That's not in itself a problem: I o...
Dan Liu 113406817508 Dec 2005 The document is indexed first. This is required by the application. Based on Lucene in Action", "Optimizaation" is to merge multiple in...
Mordo, Aviran (EXP N-NANNATEK) 113407068408 Dec 2005* Optimization also purges the deleted documents, thus reduces the size (in bytes) of the index. Until you optimize documents stay in the index only mar...
Dan Liu 113407236008 Dec 2005 The document is markded as "deleted" when reader.delete(i) is called. It is actually deleted from index when reader.close(). The deleted dou...
pdf and highlighting (6 Replies)
Sonja Löhr 113407253208 Dec 2005* Hi, all! I have a question concerning analysis and highlighting. I'm indexing multiple document formats (up to now, only html and pdf occured, an...
Dan Quaroni 113407315608 Dec 2005 I'm confused by what you mean - there is no difference between something being marked as deleted and deleted. (Since it's not removed from t...
Dan Liu 113407426008 Dec 2005 There IS difference between something being marked as deleted and something is actually deleted. As these marked as deleted can be undeleted. The docu...
JVM Crash in Lucene (2 Replies)
Dan Gould 113409763409 Dec 2005* Hi-- I'm relatively new to Lucene. When I run my app, I get a JVM error. This gets called a lot, but only fails every once in awhile (maybe 1 in ...
Dan Hanley 113414284109 Dec 2005 MaxPermSize is a not-always-documented jvm flag similar to Xmx. You set it like this... java -Xmx200M -XX:MaxPermSize=100m Myclass It sets the maximum...
Lucene Tutorial (3 Replies)
Srinivas Jadcharla 113415142809 Dec 2005* Hi there, is these any online tutorial which explains how to use the lucene that is Starting from installing lucene to develop a simple application th...
Dan Funk 113417261609 Dec 2005* We build indexes, then share those indexes (along with files and database records) with our client installations. We now have multiple clients, and th...
Alan Chandler 113419673010 Dec 2005* I am slowly making may way through lucene, as witnessed by earlier threads to this mailing list. But I am stuck again, going round in circles with the...
index databases (2 Replies)
Vasudeva Rao 113420496910 Dec 2005* hi there are there any APIs which will index mysql databases and run periodically ? i have one more query: if i choose to search on multiple fields do...
Releases (1 Reply)
Lawrence 113426517511 Dec 2005* Hello, Is there a projected date for the next release? I am using the code from SVN repos and I am hoping the next release will be before I go to prod...
Cheolgoo Kang 113428650911 Dec 2005* Hi, I'm running an index on FSDirectory with 0.4M documents with each of 7 fields. When I open an IndexReader and an IndexSearcher, the average s...
Keegan Callin 113430022611 Dec 2005* Hello and Good Day, In my application of Lucene, I am must search through some fields that contain numbers with very large ranges on the order of 1500...
Dan Gould 113431691311 Dec 2005* First, thank you Chris, Yonik, and Dan for your ideas as to what might be causing this problem. I tried moving things around so that the IndexReader i...
Raul Raja Martinez 113431921811 Dec 2005* What would be the best practice storing the index in a webapp. I mean in wich folder? Thanks. Raul....

Next page

Home | About | Privacy