ArchiveOrangemail archive

java-user.lucene.apache.org


(List home) (Recent threads) (34 other Apache Lucene lists)

Subscription Options

  • RSS or Atom: Read-only subscription using a browser or aggregator. This is the recommended way if you don't need to send messages to the list. You can learn more about feed syndication and clients here.
  • Conventional: All messages are delivered to your mail address, and you can reply. To subscribe, send an email to the list's subscribe address with "subscribe" in the subject line, or visit the list's homepage here.
  • Moderate traffic list: up to 30 messages per day
  • This list contains about 41,857 messages, beginning Mar 2005
  • 4 messages added yesterday

java-user.lucene.apache.org

October 2010 - page 1
Sahin Buyrukbilen 128604536902 Oct 2010* Hi all, I need to retrieve the score of a term in a document? I dont want to play different scoring schemes. I just checked my index with Luke and it ...
Federico Fissore 128618935404 Oct 2010* Hello all We are migrating our apps from 2.4.1 to 2.9.3, removing deprecated calls so to fast switch to 3.0.2 asap At startup, our apps look for the m...
Koji Sekiguchi 128621267004 Oct 2010* Hello, I'd like to know which field got hit in each doc in the hit results. To implement it, I thought I could use Scorer.freq() which was introdu...
Justin 128621642704 Oct 2010* Hi all, The JavaDocs do not appear to mention that only stored fields persist IndexWriter.updateDocument. When opening new readers, from either IndexW...
Altaf Vasi 128627030105 Oct 2010* Hi, i am doing the following in java. I am searching through a lucene index and getting a Collection of Documents. Below is the code that i have writt...
Naveen Kumar 128627460605 Oct 2010* Hi I have a Very large number (say 3 million) of frequently changing Small indexes. 90% of these indexes contain about 50 documents, while a few 2-3% ...
Jason Rutherglen 128628233905 Oct 2010* Lets say the segment infos file is missing, and I'm aware of CheckIndex, however is there a tool to recreate a segment infos file?...
Jordon Saardchit 128628647005 Oct 2010* In lucene 3, is there an equivalent to obtaining a BitSet of documents from an Index as there was in version 2.x? I'm trying to put together an up...
Mindaugas Žakšauskas 128629447905 Oct 2010* Hi, I am keeping a ConcurrentMap of o.a.l.index.IndexReader which I use in my system. These readers are retrieved by multiple threads and I have no kn...
liat oren 128631160305 Oct 2010* Hi, Is it possible to use WhitespaceAnalyzer in one field and another analyzer in a differnt field? If it is, how should it be written? Many Thanks, L...
Philippe Thomas 128636105306 Oct 2010* Hi, I was indexing some documents, but my program crashed after several days of work. If I reopen this index it is empty. I guess the reason is that a...
Mark Harwood 128637871106 Oct 2010* Having upgraded a live system from 2.4 to 2.9.3 the client is reporting a change in merge behaviour that is causing some issues with their update moni...
Todd Nine 128639255106 Oct 2010* Hi all, I'm having some issues with Numeric Range queries not working as expected. My underlying storage medium is the Lucandra index reader and w...
Sergey 128642814607 Oct 2010* When running application on Windows XP 32 bit machine the search time is 0.5 second. JVM is IBM Java 5 for 32 bit. But when running the same applicati...
Shailendra Mudgal 128646636507 Oct 2010* Hi Everyone, Recently we have migrated from lucene 2.2 to lucene 2.9.3. We are having some issues in search. During the load, searchers are getting hu...
Sirish Vadala 128650490908 Oct 2010* Hello All: Can any one suggest me the best way to implement both sentence specific and non sentence specific phrase search? The user is going to have ...
Jan Engler 128653262108 Oct 2010* Hi everybody, I am currently working on a system which stores objects in a database. For searching these objects I use a lucene indexer which adds all...
detect Lucene version (2 Replies)
Ivan Vasilev 128654279908 Oct 2010* Hi Guys, Is there way to detect org.apache.lucene.util.Version of an index having IndexReader or just FSDirectory? I know I can open segments file and...
Build environment (2 Replies)
Pradeep Singh 128681229011 Oct 2010* Hi all, I am having issues building Lucene and Solr from svn checkout. I had this problem earlier but I was able to figure out the combination of ant ...
flushing index (9 Replies)
Yakob 128682994711 Oct 2010* hello all, I am here to ask about lucene in flushing indexes. below is a pseudocode I get from the book lucene in action. FSDirectory fsDir = FSDirect...
Paolo Castagna 128687207412 Oct 2010* Hi, I am curious. Do you know why the book Lucene in Action, Second Edition is not available on sale (as new) on Amazon UK? http://www.amazon.co.uk/Lu...
Update lucene index (4 Replies)
Nilesh Vijaywargiay 128690943412 Oct 2010* Hi Group, I understand that the process of updating a document in lucene index is to delete the document and add it again. But I do not want to delete...
Hyun Joo Noh 128694267813 Oct 2010* Hi, how would you make Lucene leave a search log of who searched what, when, etc (i.e. cookie, query, timestamp, etc)? Is there a built-in Lucene func...
Jeff Zhang 128698145913 Oct 2010* Hi all, I only want to index the latest one week's data, the previous data can be deleted. So I'd like to know about lucene's delete perfo...
Pawlak Michel (DCTI) 128698698213 Oct 2010* Hello, We have an application which is using lucene and we have strong performance issues (on bad days, some searches take more than 2 minutes). I...
Dan OConnor 128700331413 Oct 2010* Jeff, I would suggest not deleting documents off the back of the index unless you can optimize your index regularly. (Depending on your volume, this c...
Ching 128703439814 Oct 2010* Hi All, Can anyone help with this issue? I have about 2000 pdf files that I use PDFBox to extract its text, then index them using for loop. The indexi...
Christoph Hermann 128705221214 Oct 2010* Hi, is there a way to store additional metadata with fields? My Problem is as follows: I'm extracting extended html with tika. This extended html ...
ParallelReader (4 Replies)
Nilesh Vijaywargiay 128710698515 Oct 2010* I have two index, A and B. Can two documents doc1[in index A] and doc2[in index B] have a common field? doc1 and doc2 have same document Id...
subwayne 128713337915 Oct 2010* Hi, I'am facing some problems in using Lucene. The index I am using is constructed like this: try { Analyzer analyzer = new SnowballAnalyzer(Versi...
Zaharije Pasalic 128714927015 Oct 2010* Hi my original problem is to index large number of documents which contains 360 integers in rage from 0-90K. Searching it's a little bit complicat...
Sykes, Derek 128715295115 Oct 2010* Hi there, I'm currently trying to work out how I can determine the type (string/number/date/etc)of a term. I've not seen any off the shelf way...
appy74128716661715 Oct 2010* Hello I would like to store data retrieved hourly from RSS feeds in a database or in Lucene so that the text can be easily indexed for word frequencie...
Tokenizing XML (1 Reply)
Christoph Hermann 128716690515 Oct 2010* Hi, is there a Tokenizer in Lucene, that tokenizes XML correctly? I.e. that one gets from the following XML: <span>this is <span attr=...
Christoph Hermann 128717058715 Oct 2010* Am Donnerstag, 14. Oktober 2010, 12:29:43 schrieben Sie: Hello,Thanks for the hint. Looks good. Although i have some additional questions. It seems Pl...
APOLO_11 128722421516 Oct 2010* hey - is there an API that return the number of term indexed? I found the API return the amount of document indexed (IndexWriter.docCount) but cant fi...
Christoph Hermann 128724266316 Oct 2010* Hello, i'm trying to introduce an additional weighting using payloads. Therefore i implemented my own Similarity class (see below) and implemented...
Yakob 128724768316 Oct 2010* well actually I am doing a kind of a thesis regarding information retrieval.and my tutor wanted me to be able to create a program that firstly index a...
MultiFieldQueryParser (3 Replies)
Lev Bronshtein 128731199417 Oct 2010* Hi Group, I have an isue when using MultiFieldQueryParser, I would like to use one query against a number of fields however I get an java.lang.Illegal...
Christoph Hermann 128733861417 Oct 2010* Hello, how can i copy the Payload from the current token to the following token in a TokenFilter? I have implemented a TokenFilter and thought, that i...
Mike Sokolov 128734064217 Oct 2010* Background: I've been trying to enable hit highlighting of XML documents in such a way that the highlighting preserves the well-formedness of the ...
Mahmoud Abdelkader 128734750517 Oct 2010* Hello, We're currently evaluating utilizing Lucene to index a large English corpus and we were are optimizing for space. We're basically conce...
Antony Joseph 128739492718 Oct 2010* Hi all, I got this java.lang.OutOfMemoryError: Java heap space error on searching and also it crashed my application [Python.exe]. Its an long running...
Max Jakob 128741242118 Oct 2010* Hi, I would like to change the IDF value of the Lucene similarity computation to "inverse document frequency inside category". Not the compl...
Grant Ingersoll 128743952818 Oct 2010* Is there interest in having a Meetup at ApacheCon? Who's going? Would anyone like to present? We could do something less formal, too, and just hav...
Nilesh Vijaywargiay 128744068218 Oct 2010* I have many fields in my document and want to parse my query including each of them QueryParser parser = new QueryParser(Version.LUCENE_29, "Fiel...
Andy Yang 128747659419 Oct 2010* I would like to use MultiFieldQueryParser to serach multiple fields, then in each field, I want to use fuzzy search. How can that be done? Any example...
Nilesh Vijaywargiay 128750973719 Oct 2010 I am trying to find a work around for updating fields and in turn the documents in the original index. I am using parallel reader and providing it two...
Jasper de Barbanson 128756551120 Oct 2010* I'm currently working on building a Geocoder. The purpose of a Geocoder is to find the coordinates belonging to any given input address. I have a ...
Richard Marr 128758586720 Oct 2010* Hi all, We've booked a London Search Social for Thursday the 28th Sept. Come along if you fancy geeking out about search and related technology ov...

Next page

Home | About | Privacy