ArchiveOrangemail archive

java-user.lucene.apache.org


(List home) (Recent threads) (34 other Apache Lucene lists)

Subscription Options

  • RSS or Atom: Read-only subscription using a browser or aggregator. This is the recommended way if you don't need to send messages to the list. You can learn more about feed syndication and clients here.
  • Conventional: All messages are delivered to your mail address, and you can reply. To subscribe, send an email to the list's subscribe address with "subscribe" in the subject line, or visit the list's homepage here.
  • Moderate traffic list: up to 30 messages per day
  • This list contains about 41,903 messages, beginning Mar 2005
  • 2 messages added yesterday

java-user.lucene.apache.org

September 2009 - page 1
marquinhocb 125179488601 Sep 2009* I would like to create a scorer that applies a score based on a value that is calculated during a query. More specifically, to apply a score based on ...
Chris Bamford 125180779101 Sep 2009* Hi Mike, Thanks for the suggestions, very useful. I would like to adopt a combination of setUseCompoundFile on the IndexReader and perform an open/clo...
Chris Bamford 125181313601 Sep 2009 Thanks Mike, I get what you mean now :-) BTW I have tested the code with 1 open/close per search (rather than keeping the IndexReader open between sea...
Alex 125182503501 Sep 2009* Hi, I am new to Lucene so excuse me if this is a trivial question .. I have data that I Index in a given language (English). My users will come from d...
Christian 125183522901 Sep 2009 Hi, I am putting some text into a field which we set to Field.Store.NO & Field.Index.NOT_ANALYZED. We are then doing exact & fuzzy matches aga...
Chris Bamford 125183985101 Sep 2009* Hi Erick,I have often wondered about this - I hope you can help me understand it better in the context of our app, which is an email client: When one ...
cemerick 125185892302 Sep 2009* I've built a Lucene Directory implementation for jdbm, an embedded Java database. Part of the Directory API are two methods related to "file...
<Fang_Li125185990302 Sep 2009* I met a problem to open an index bigger than 8GB and the following exception was thrown. There is a segment which is bigger than 4GB already. After se...
Konstantyn Smirnov 125188002802 Sep 2009* Hi all I have following pieces of code (simplified for the simplicity sake): 1: new TermsFilter( termsList:[ new Term( 'id', '111...
Sahi 125188608302 Sep 2009* Hi, I'm new to this site. My question is: Articles in wikipedia can be edited by everyone and may or may not be accurate. If any contributor writ...
lucene on amazon s3 (2 Replies)
prasenjit 125188906102 Sep 2009* I am exploring the possibility of creating large lucene indices via ec2/s3. Till now I have got only teh following url : http://www.kimchy.org/lucene-...
Christian 125201526303 Sep 2009* Hi, I am putting some text into a field which we set to Field.Store.NO & Field.Index.NOT_ANALYZED. We are then doing exact & fuzzy matches aga...
Michael Thomsen 125201557103 Sep 2009* Is it possible to translate this sort of Perl regex into a lucene query: /goth(am|ic)/ Where the only results that would be returned would be got...
Ganesh 125204899904 Sep 2009* Hello all, I want to retrieve the first result in the group. How to acheive this? Currently i am parsing all the results, using a hash and avoiding du...
David Causse 125205296104 Sep 2009* Hi, If I use tika for parsing HTML code and inject parsed String to a lucene analyzer. What about the offset information for KWIC and return to text (...
Francisco Borges 125211181705 Sep 2009* Hello everyone, As I understood it, merging indexes will lead to the deletion of the original indexes. Is there a way to merge indexes while keeping t...
Marc Sturlese 125217159405 Sep 2009* Hey there, I am iterating over a DocSet and for every id I neew to get the value of a field wich is analyzed with KeyworddAnalyzer and is not sored. I...
Erik Hatcher 125220866506 Sep 2009 Excuse the cross-posted announcement. Next week we'll be having a Lucene/Solr meetup around the JavaZone conference in Oslo, Norway. Before, duri...
Ganesh 125231565807 Sep 2009* Hello all, In my linux pc, there are too many fd counts for lucene database. /proc/<processid>/fd shows very big list. I have provided sample be...
Anshum 125232574707 Sep 2009 Wrote a post about 'how to use lucid gaze for lucene'. Thought I'd share here as I didn't find anything elsewhere on the informati...
Paul Taylor 125233580407 Sep 2009* I submitted this https://issues.apache.org/jira/browse/LUCENE-... patch to StandardTokenizerImpl, understandably it hasn't been incoroprated into...
関 磊 125241137408 Sep 2009* Hello dears, I an studying the index format of lucene. But, I really cannot understand the format in "*.frq" file. Please you help me. I thi...
Paul Taylor 125246785909 Sep 2009* Is there way to get complete start end matches to be first in the list We use Lucene to search song albums titles typically one to ten words long. If ...
m.harig 125248112409 Sep 2009* hello all , is there any way to get all tokens from my index ? please anyone suggest me...
Chris Bamford 125249589609 Sep 2009* Hi, Since moving our app to Java 6 and Tomcat 6, we have started getting occasional exceptions of the form: java.io.IOException: Stream closed at sun....
Ian Vink 125250285109 Sep 2009* I have created an index and each document has a contents field and a language field. contents has the flags: Indexed Tokenized Stored Vector language ...
Mark Miller 125250678809 Sep 2009* Hello Lucene users, On behalf of the Lucene dev community (a growing community far larger than just the committers) I would like to announce the secon...
関 磊 125251032909 Sep 2009 Hello, I want to know how to calculate the DGaps value in *.del file? For example, if there are 8000 bits and only bits 10,12,32 are set, DGaps would ...
Mark Miller 125251504209 Sep 2009 Hello Lucene users, On behalf of the Lucene dev community (a growing community far larger than just the committers) I would like to announce the third...
Bill Au 125258024310 Sep 2009* Has anyone done anything regarding the support of PayloadTermQuery in MoreLikeThis? I took a quick look at the code and it seems to be simply a matter...
Uwe Schindler 125258388010 Sep 2009* Hi, I cross-post this here, Isabel Drost is managing the meetup. This time it is more about Hadoop, but there is also a talk about the new Lucene 2.9 ...
Patricio Galeas 125258400210 Sep 2009* Hi all, I get the TooManyClauses exception by some wildcard queries like : (a) de* (b) country AND de* (c) ma?s* AND de* I'm not sure how to appl...
AHMET ARSLAN 125258587410 Sep 2009 Which analyzer/tokenizer are you using on that field? I am assuming it is a tokenized field.Is it returning 1's as well as -1's? - is a spec...
maryam ma'danipour 125258888810 Sep 2009 Hello every . I have a problem with MultiSearcherThread.hits in ParallelMultiSearcher.java . Some times when I want to search via paralleMultiSearcher...
vibhuti 125259345710 Sep 2009* Hello I am new to Lucene and facing a problem while performing searches. I am using lucene 2.2.0. My application indexes documents on "keyword...
Nick Bailey 125259438210 Sep 2009* Looking for some help figuring out a problem with the IndexReader.isCurrent() method and cached indexes. We have a number of lucene indexes that we at...
Shai Erera 125260216610 Sep 2009* Hi I'd like to extend Lucene's FieldCache such that it will read native values from a different place (in my case, payloads). That is, inste...
asitag 125260543510 Sep 2009* Hi, We are trying to index html files which have japanese / korean / chinese content using the CJK analyser. But while indexing we are getting Lexical...
Dvora 125261283910 Sep 2009* Hello, I'm using Lucene2.4. I'm developing a web application that using Lucene (via compass) to do the searches. I'm intending to deplo...
rr04 125263902311 Sep 2009 I am a MIT student doing a project on schema-less database usage and would greatly appreciate if you guys can fill out a quick survey on this (should ...
Jason Rutherglen 125265844411 Sep 2009* I'm seeing a strange exception when indexing using the latest Solr rev on EC2. org.apache.solr.client.solrj.SolrServerException: org.apache.solr....
Ariel 125268326511 Sep 2009 Hi every body: I am using lucene version 2.3.2 to index and search my documents. The problem is that I have a remote search server implemented this wa...
Ian Vink 125278621312 Sep 2009* My index has a field <religion> with the source of the document. In luke I can see that religion has baha'i or islam or Tao etc.... The pro...
Alexy Khrabrov 125279795812 Sep 2009* Given that I have a field for which term vector was computed and stored, and that field is the text of a document, I'd like to rank a subset of s...
Paul Taylor 125280318813 Sep 2009* Is it possible to filter before tokenize, or is that not a good idea. I want to convert '&' to 'and' , so they are dealt with ...
Mark Miller 125287334813 Sep 2009* Hello Lucene users, On behalf of the Lucene dev community (a growing community far larger than just the committers) I would like to announce the fourt...
Stephen Greene 125289290314 Sep 2009* Hello, I am new to lucene and building an application which requires documents with many fields to be searched. A "project" id is being stor...
Phil Whelan 125294483314 Sep 2009* Hi, I've used NumericField to store my "hour" field. Example... doc.add(new NumericField("hour").setIntValue(Integer.parseInt...
Mark Miller 125294976114 Sep 2009* Hey all, Just wanted to alert you to a new free offering we just released. Combine Lucene with a little aspect programming and you can do some pretty ...
Phil Whelan 125296797714 Sep 2009* Hi, I'm not sure why my IndexReader.reopen() call is not working. The latest results are not coming back, meaning the reader / searcher has not b...

Next page

Home | About | Privacy